
AI Infrastructure Rack Management: Exploring Scalable Solutions Through Open-source Collaboration
Open Compute Project via YouTube
Overview

Coursera Plus Monthly Sale:
All Certificates & Courses 40% Off!
Grab it
This 14-minute presentation by Brian Vandecoevering (American Megatrends - Senior Director) and Anil Agrawal (Meta Platform Corp - Hardware Systems Engineer) explores how AI infrastructure is transforming data center rack management. Learn how AI workload complexities and distributed system interdependencies necessitate more resilient, diverse, and responsive infrastructure solutions beyond current OpenRMC and BMC designs. Discover approaches for efficient, scalable, and heterogeneous rack management solutions through open-source collaboration, with practical datacenter use cases for both current and near-future implementations. The talk emphasizes developing comprehensive manageability solutions that extend to compute components, peripherals, accelerators, racks, and cooling infrastructure to meet the evolving demands of AI workloads.
Syllabus
AI Infrastructure Rack Management: Exploring Scalable Solutions Through Open-source Collaboration
Taught by
Open Compute Project