Skip to Content
All memories

Distributed Systems Under Failure

Browse series · RSS · Atom

Start here: first entry.

  1. State Machine Replication: Log Design, Snapshots, and Compaction

    January 1, 2017 · 3 min

    Adversarial-first deep dive (January 2017): State Machine Replication: Log Design, Snapshots, and Compaction.

  2. Consensus Under Partial Synchrony: From Paxos to Raft

    February 1, 2017 · 4 min

    Correctness-focused deep dive (February 2017): Consensus Under Partial Synchrony: From Paxos to Raft.

  3. BFT from First Principles: Safety, Liveness, and Quorums

    March 1, 2017 · 4 min

    Adversarial-first deep dive (March 2017): BFT from First Principles: Safety, Liveness, and Quorums.

  4. Membership & Reconfiguration: Changing the Set Without Breaking Safety

    April 1, 2017 · 3 min

    Adversarial-first deep dive (April 2017): Membership & Reconfiguration: Changing the Set Without Breaking Safety.

  5. Consistency Models: Linearizability, Serializability, and What You Actually Need

    May 1, 2017 · 4 min

    Design memo (May 2017): Consistency Models: Linearizability, Serializability, and What You Actually Need.

  6. Transactions: 2PC, 3PC, and Coordinators You Can't Trust

    June 1, 2017 · 3 min

    Design memo (June 2017): Transactions: 2PC, 3PC, and Coordinators You Can't Trust.

  7. Gossip & Epidemic Dissemination: Fast, Probabilistic, and Weird

    July 1, 2017 · 3 min

    Threat-model-first analysis (July 2017): Gossip & Epidemic Dissemination: Fast, Probabilistic, and Weird.

  8. Geo-Replication: Latency Budgets and Cross-Region Failure Modes

    August 1, 2017 · 4 min

    Threat-model-first analysis (August 2017): Geo-Replication: Latency Budgets and Cross-Region Failure Modes.

  9. Queues & Streams: Exactly-Once Semantics Without Lying to Yourself

    September 1, 2017 · 4 min

    Design memo (September 2017): Queues & Streams: Exactly-Once Semantics Without Lying to Yourself.

  10. Rate Limiting and Fairness: Protecting Critical Paths

    October 1, 2017 · 4 min

    Design memo (October 2017): Rate Limiting and Fairness: Protecting Critical Paths.

  11. Designing for Network Partitions: Degraded Modes That Still Make Sense

    November 1, 2017 · 4 min

    Engineering notebook entry (November 2017): Designing for Network Partitions: Degraded Modes That Still Make Sense.

  12. A Minimal TLA+ Workflow for Distributed Protocols

    December 1, 2017 · 4 min

    Spec-driven research note (December 2017): A Minimal TLA+ Workflow for Distributed Protocols.