Completed
- Introduction to AI Inference Scaling
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Building an Auto-scaling AI Inference Service - From Setup to Deployment
Automatically move to the next video in the Classroom when playback concludes
- 1 - Introduction to AI Inference Scaling
- 2 - Video Agenda Overview
- 3 - Different Inference Approaches
- 4 - Understanding GPU Utilization
- 5 - Setting Up One-Click Templates
- 6 - Docker Image Configuration
- 7 - Building Auto-Scaling Service
- 8 - Model Configuration Settings
- 9 - Load Testing and Metrics
- 10 - Scaling Manager Implementation
- 11 - Setting Up API Endpoint
- 12 - Conclusion and Future Topics