Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Dia 1.6B TTS for NotebookLM Podcasts - Exploring Text-to-Speech Technology

Sam Witteveen via YouTube

Overview

FLASH SALE: Ends May 22!
Udemy online courses up to 85% off.
Get Deal
This 13-minute tutorial video explores Dia, a new text-to-speech (TTS) system developed by Nari Labs, and demonstrates how it can be used to create podcasts similar to NotebookLM. Follow along as Sam Witteveen examines various articles about Dia from TechCrunch, VentureBeat, and Hacker News, before exploring the Nari Labs website and relevant research papers like SoundStorm and Parakeet. Learn about the Google TPU Research Cloud that supported this project, and watch a practical demonstration using Colab to implement the Dia 1.6B model available on Hugging Face. The video includes links to all necessary resources including the Colab notebook, Hugging Face repository, and GitHub page for those wanting to experiment with this TTS technology themselves.

Syllabus

00:00 Intro / TechCrunch Article
00:13 Venturbeat Article
00:25 Hacker News
00:37 Nari Labs Site
01:07 Toby Kim Tweet or X Post
01:33 SoundStorm Paper
01:52 Parakeet
02:21 Google TPU Research Cloud
02:52 Dia 1.5B Hugging Face Space
03:31 Colab Demo

Taught by

Sam Witteveen

Reviews

Start your review of Dia 1.6B TTS for NotebookLM Podcasts - Exploring Text-to-Speech Technology

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.