Overview
This course on server fleet management using Camunda covers the learning outcomes and goals of achieving hands-off capacity management through multiple workflows with asynchronous pauses and synchronizations. The course teaches how to integrate Camunda with various platforms and components within an organization's infrastructure, along with best practices for achieving the final state. The individual skills taught include understanding the impact of hardware failures, designing automation solutions, and overcoming challenges in hardware maintenance. The teaching method involves a talk by a Site Reliability Engineer from LinkedIn, sharing real-world examples and insights. The intended audience for this course includes IT professionals, system administrators, and individuals interested in server management and automation.
Syllabus
Intro
Outline
Introduction
Technology Scale
Espresso Hardware
Impact of hardware failure
Failure frequency
Challenges with HW recovery
Requirements for automation
Features needed
Solution
Entry barriers
Automate part 2
Initial design 1
Design 2 (Final design)
Metrics
Success
Part 3 in production
Challenges with HPM
Our deployment
Future
Taught by
Camunda