Problem:
At Tenerife, both pilots believed they had clearance to take off. At Cloudflare, separate control surfaces believed unrelated changes were safe. Both cases suffered from state divergence across actors.
State Divergence Causes Disasters
Why It Matters:
Distributed systems fail when participants act on different views of reality.
Impact:
A safe local decision becomes unsafe globally.
Distributed systems fail when participants act on different views of reality. A safe local decision becomes unsafe globally.
Insight:
A central authority must define the single runtime safety state shared by all parts of the system.
Want to see how RCP solves this?
Email us at bparanj@zepho.com.