Global state and snapshot recording algorithms

Ajay D. Kshemkalyani; Mukesh Singhal

doi:10.1017/CBO9780511805318.005

4 - Global state and snapshot recording algorithms

Published online by Cambridge University Press: 05 June 2012

Ajay D. Kshemkalyani and

Mukesh Singhal

Show author details

Ajay D. Kshemkalyani: Affiliation:
University of Illinois, Chicago
Mukesh Singhal: Affiliation:
University of Kentucky

Book contents

Get access

Summary

Recording the global state of a distributed system on-the-fly is an important paradigm when one is interested in analyzing, testing, or verifying properties associated with distributed executions. Unfortunately, the lack of both a globally shared memory and a global clock in a distributed system, added to the fact that message transfer delays in these systems are finite but unpredictable, makes this problem non-trivial.

This chapter first defines consistent global states (also called consistent snapshots) and discusses issues which have to be addressed to compute consistent distributed snapshots. Then several algorithms to determine on-the-fly such snapshots are presented for several types of networks (according to the properties of their communication channels, namely, FIFO, non-FIFO, and causal delivery).

Introduction

A distributed computing system consists of spatially separated processes that do not share a common memory and communicate asynchronously with each other by message passing over communication channels. Each component of a distributed system has a local state. The state of a process is characterized by the state of its local memory and a history of its activity. The state of a channel is characterized by the set of messages sent along the channel less the messages received along the channel. The global state of a distributed system is a collection of the local states of its components.

Recording the global state of a distributed system is an important paradigm and it finds applications in several aspects of distributed system design.

Type: Chapter
Information: Distributed Computing
Principles, Algorithms, and Systems
, pp. 87 - 125

DOI: https://doi.org/10.1017/CBO9780511805318.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2008

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

4 - Global state and snapshot recording algorithms

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive