AudioGen- Textually Guided Audio Generation - Paper Explained

AudioGen- Textually Guided Audio Generation - Paper Explained

Aleksa Gordić - The AI Epiphany via YouTube Direct link

Intro

1 of 13

1 of 13

Intro

Class Central Classrooms beta

YouTube playlists curated by Class Central.

Classroom Contents

AudioGen- Textually Guided Audio Generation - Paper Explained

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Why is text-to-audio hard?
  3. 3 Comparison with VQ-GAN
  4. 4 Comparison with SoundStream
  5. 5 AudioGen overview
  6. 6 Deep dive: audio representation, LSTM
  7. 7 Losses explained
  8. 8 Complex-valued STFTs
  9. 9 Audio Language Modeling
  10. 10 Multi-stream audio inputs
  11. 11 Data and augmentations
  12. 12 Results
  13. 13 Outro

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.