
Udemy Special: Ends May 28!
Learn Data Science. Courses starting at $12.99.
Get Deal
This keynote presentation from DevOpsDays NYC 2018 explores how real-world safety systems can inform better software failure management. Discover how Tanya Reilly draws parallels between physical safety measures like fire partitions, smoke alarms, sprinkler systems, and fire escapes to software contingency planning. Learn why many failures still come as surprises despite having procedures in place, and gain insights on how to better design systems that expect and manage failure. The talk examines practical approaches to reducing damage, redirecting traffic, prioritizing requests, and creating more effective documented procedures based on lessons from physical safety infrastructure.