This conference talk from SREcon25 Americas explores the critical process of testing Disaster Recovery Plans (DRPs) through tabletop exercises. Josh Simon from the University of Michigan provides a comprehensive overview of what DRPs should contain and why they're essential for organizational resilience. Learn how to develop and implement collaborative discussion-based thought experiments to effectively test your disaster recovery procedures. Discover best practices and common pitfalls when writing and testing DRPs, along with valuable insights on designing services with reliability and recovery in mind. Gain practical knowledge about ensuring your organization's critical technology infrastructure, systems, and applications can be recovered efficiently after a disaster strikes.
Overview
Syllabus
SREcon25 Americas - Running DRP Tabletop Exercises
Taught by
USENIX