Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Google

Build a Multimodal Live Streaming Agent with ADK

Google via YouTube

Overview

Coursera Plus Monthly Sale:
All Certificates & Courses 40% Off!
Grab it
This 21-minute tutorial from Google demonstrates how to create multimodal AI agents capable of real-time interaction through live streaming. Learn the challenging process of building live agents directly with Gemini models' Live Streaming API, including complex function calling management implementation from scratch. Then discover how the Agent Development Kit (ADK) significantly simplifies development by handling core components like live queue requests and event management automatically. The video includes comprehensive code walkthroughs and demonstrations of both approaches, providing practical insights for developers looking to build AI agents with real-time visual and audio processing capabilities. Complete with chapter markers covering architecture explanations, code examples, live demos, and access to documentation and sample code repositories for immediate implementation.

Syllabus

0:00 - Intro
1:09 - Live Agent Architecture - Gemini API
3:38 - Code - Gemini Live API
5:04 - Demo - Gemini Live API
6:45 - Live Agent Architecture - ADK
8:44 - Code - ADK
15:49 - Demo - ADK
18:22 - Recap and resources
20:24 - Outro

Taught by

Google Developers

Reviews

Start your review of Build a Multimodal Live Streaming Agent with ADK

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.