did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

We're the #1 textbook rental company. Let us show you why.

9781118549438

Building Dependable Distributed Systems

by
  • ISBN13:

    9781118549438

  • ISBN10:

    1118549430

  • Edition: 1st
  • Format: Hardcover
  • Copyright: 2014-03-17
  • Publisher: Wiley-Scrivener

Note: Supplemental materials are not guaranteed with Rental or Used book purchases.

Purchase Benefits

  • Free Shipping Icon Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • eCampus.com Logo Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $223.94 Save up to $67.18
  • Rent Book $156.76
    Add to Cart Free Shipping Icon Free Shipping

    TERM
    PRICE
    DUE
    USUALLY SHIPS IN 3-4 BUSINESS DAYS
    *This item is part of an exclusive publisher rental program and requires an additional convenience fee. This fee will be reflected in the shopping cart.

Supplemental Materials

What is included with this book?

Summary

A one-volume guide to the most essential techniques for designing and building dependable distributed systems

Instead of covering a broad range of research works for each dependability strategy, this useful reference focuses on only a selected few (usually the most seminal works, the most practical approaches, or the first publication of each approach), explaining each in depth, usually with a comprehensive set of examples. Each technique is dissected thoroughly enough so that readers who are not familiar with dependable distributed computing can actually grasp the technique after studying the book.

Building Dependable Distributed Systems consists of eight chapters. The first introduces the basic concepts and terminology of dependable distributed computing, and also provides an overview of the primary means of achieving dependability. Checkpointing and logging mechanisms, which are the most commonly used means of achieving limited degree of fault tolerance, are described in the second chapter. Works on recovery-oriented computing, focusing on the practical techniques that reduce the fault detection and recovery times for Internet-based applications, are covered in chapter three. Chapter four outlines the replication techniques for data and service fault tolerance. This chapter also pays particular attention to optimistic replication and the CAP theorem. Chapter five explains a few seminal works on group communication systems. Chapter six introduces the distributed consensus problem and covers a number of Paxos family algorithms in depth. The Byzantine generals problem and its latest solutions, including the seminal Practical Byzantine Fault Tolerance (PBFT) algorithm and a number of its derivatives, are introduced in chapter seven. The final chapter details the latest research results surrounding application-aware Byzantine fault tolerance, which represents an important step forward in the practical use of Byzantine fault tolerance techniques.

Author Biography

Wenbing Zhao received his PhD in electrical and computer engineering from the University of California, Santa Barbara, in 2002. Currently, he is an Associate Professor in the Department of Electrical and Computer Engineering at Cleveland State University. Dr. Zhao has more than 80 academic publications to his credit, and three of his recent research papers in the area of dependable distributed computing have won best paper awards. Dr. Zhao also has a U.S. patent on consistent time service for fault-tolerant distributed systems.

Table of Contents

List of Figures xiii

List of Tables xxi

Acknowledgements xxiii

Preface xxv

References xxviii

1 Introduction to Dependable Distributed Computing 1

1.1 Basic Concepts and Terminologies 2

1.2 Means to Achieve Dependability 9

References 13

2 Logging and Checkpointing 15

2.1 System Model 16

2.2 Checkpoint-Based Protocols 21

2.3 Log Based Protocols 34

References 54

3 Recovery-Oriented Computing 57

3.1 System Model 59

3.2 Fault Detection and Localization 62

3.3 Microreboot 83

3.4 Overcoming Operator Errors 87

References 93

4 Data and Service Replication 97

4.1 Service Replication 99

4.2 Data Replication 105

4.3 Optimistic Replication 111

4.4 CAP Theorem 131

References 138

5 Group Communication Systems 141

5.1 System Model 143

5.2 Sequencer Based Group Communication System 146

5.3 Sender Based Group Communication System 160

5.4 Vector Clock Based Group Communication System 186

References 191

6 Consensus and the Paxos Algorithms 193

6.1 The Consensus Problem

6.2 The Paxos Algorithm 196

6.3 Multi-Paxos 206

6.4 Dynamic Paxos 210

6.5 Fast Paxos 221

6.6 Implementations of the Paxos Family Algorithms 229

References 236

7 Byzantine Fault Tolerance 239

7.1 The Byzantine Generals Problem 240

7.2 Practical Byzantine Fault Tolerance 255

7.3 Fast Byzantine Agreement 271

7.4 Speculative Byzantine Fault Tolerance 271

References 284

















Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program