Overview
This course focuses on tracing real-time distributed systems, specifically in the context of latency-sensitive data streaming in large-scale distributed systems. The learning outcomes include understanding how to model asynchronous message flows, track long-running sessions, handle fan-in/fan-out data flows, and reduce storage costs while capturing granular tracing data. The course teaches concepts such as tracing architecture, triage debugging, and utilizing tracing to diagnose issues and measure service level indicators. The intended audience for this course includes professionals working with real-time data streaming systems and those interested in improving observability in distributed systems.
Syllabus
Intro
Overview
What is the Bloomberg Terminal?
About Bloomberg
Example - A Search Engine
(Possible) Architecture View
Trace Concepts. Span Model
Tracing Architecture (One Interpretation)
Using Trace • Triage Debugging
Tracing Model & Implementation
A Trading System
Architecture - Simple View
System Characteristics
How we traced it?
What does it look like?
The interesting bit
What we do with it?
Message Fan-Out (broadcast)
#2 Splitting Messages
Message Conflation
What's Next?
We are hiring!
Taught by
USENIX