Explore a conference talk from SREcon24 Americas that delves into measuring reliability culture to optimize tradeoffs, presented by Kathryn (Casey) Bouskill from Meta. Learn how Meta transformed its approach from "move fast and break things" to "move fast with stable infrastructure" by focusing on the cultural aspects of reliability work. Discover systematic methods for measuring on-the-ground perspectives of reliability work, enabling informed decisions on optimizing reliability levels. Gain actionable insights on evaluating underlying reliability culture, implementing data-driven approaches to measure reliability sentiment, identifying barriers and facilitators to performing the work, and developing holistic, data-driven prioritization of reliability efforts aligned with cultural values. Understand how to balance competing demands and increasing pressure for efficiency optimization in the context of reliability culture.
Overview
Syllabus
SREcon24 Americas - Measuring Reliability Culture to Optimize Tradeoffs: Perspectives from an...
Taught by
USENIX