Overview
Syllabus
Intro
Service ownership, defined
Obstacles to successful service ownership
Distributed tracing, defined
Relationships matter
Traces = raw material, not finished product
Centralized documentation
Why is documentation important?
Iterating toward ownership
More context -- mitigating facter
Dynamic alert delivery
Handling alerts
Improving postmortems
Postmortems are documentation
Why is improving oncall important?
Determining SLOS
Derive internal SLOs using tracing
Why are SLOs important?
3-piece puzzle review
Making changes
Ownership = Accountability + Agency
Taught by
USENIX