Why I Love Kubernetes Failure Stories and You Should Too

Why I Love Kubernetes Failure Stories and You Should Too

GOTO Conferences via YouTube Direct link

Intro

1 of 42

1 of 42

Intro

Class Central Classrooms beta

YouTube playlists curated by Class Central.

Classroom Contents

Why I Love Kubernetes Failure Stories and You Should Too

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 ZALANDO AT A GLANCE
  3. 3 2019: DEVELOPERS USING KUBERNETES
  4. 4 INGRESS ERRORS
  5. 5 COREDNS OOMKILL
  6. 6 STOP THE BLEEDING: INCREASE MEMORY LIMIT
  7. 7 INCREASE IN MEMORY USAGE
  8. 8 CONTRIBUTING FACTORS
  9. 9 CUSTOMER IMPACT
  10. 10 IAM RETURNING 404
  11. 11 NUMBER OF PODS
  12. 12 ROUTES FROM API SERVER
  13. 13 API SERVER DOWN
  14. 14 INNOCENT MANIFEST
  15. 15 INCIDENT #2: LESSONS LEARNED
  16. 16 CLUSTER DOWN?
  17. 17 THE TRIGGER
  18. 18 CLUSTER LIFECYCLE MANAGER (CLM)
  19. 19 CLUSTER CHANNELS
  20. 20 FLANNEL ERRORS
  21. 21 RBAC CHANGES
  22. 22 NETWORK SPLIT
  23. 23 CREDENTIALS QUEUE
  24. 24 WHAT HAPPENED
  25. 25 SLACK
  26. 26 DISABLING CPU THROTTLING
  27. 27 RACE CONDITIONS..
  28. 28 COMMON PITFALLS
  29. 29 READINESS & LIVENESS PROBES
  30. 30 RESOURCE REQUESTS & LIMITS
  31. 31 AWS EKS IN PRODUCTION
  32. 32 AUTOMATED E2E TESTS
  33. 33 MONITORING
  34. 34 OPENTRACING
  35. 35 UPGRADE TO KUBERNETES 1.14
  36. 36 EMERGENCY ACCESS SERVICE
  37. 37 KUBERNETES FAILURE STORIES
  38. 38 INTERNAL TICKETS BASED ON FAILURE STORIES
  39. 39 FACTFULNESS
  40. 40 WHY KUBERNETES?
  41. 41 COMPLEXITY FOR GOOGLE-SCALE INFRA?
  42. 42 OPEN SOURCE & MORE

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.