Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Running AI Inference on Google Kubernetes Engine - Anthropic's Approach with Claude

Google Cloud Tech via YouTube

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!
Discover how Anthropic leverages Google Kubernetes Engine (GKE) to run inference for Claude, achieving cost efficiency and high performance using Google tensor processing units (TPUs) and NVIDIA graphics processing units. Learn about Anthropic's improved price-performance on TPU v5e, GKE's advanced management capabilities for simplified Day-2 maintenance, and the exceptional support provided by Google Cloud. Explore topics such as customer-triggered maintenance, Cube for Claude, Google TPU, Kubernetes orchestration, cost-efficient inference, GPU recommendations, and GKE features in this 29-minute conference talk from Google Cloud Next 2024.

Syllabus

Introduction
About Anthropic
Customer triggered maintenance
Cube for Claude
Google TPU
Kubernetes Orchestration
Cost Efficient Inference
GPU Recommendations
GKE

Taught by

Google Cloud Tech

Reviews

Start your review of Running AI Inference on Google Kubernetes Engine - Anthropic's Approach with Claude

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.