Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

On the Biology of a Large Language Model - Part 1

Yannic Kilcher via YouTube

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!
This 54-minute video explores Anthropic's Transformer Circuit Blog Post, diving deep into the internal mechanisms of Claude 3.5 Haiku, Anthropic's lightweight production model. Examine how the model functions in various contexts through circuit tracing methodology, as presented by Yannic Kilcher. Learn about the research conducted by Jack Lindsey, Wes Gurnee, Emmanuel Ameisen, and numerous other contributors who investigated the inner workings of large language models. The video serves as the first part of a series analyzing what could be considered the "biology" of these AI systems, referencing the detailed technical paper available at transformer-circuits.pub. Discover how researchers are approaching AI systems with techniques reminiscent of biological research to understand their internal operations and behaviors.

Syllabus

On the Biology of a Large Language Model (Part 1)

Taught by

Yannic Kilcher

Reviews

Start your review of On the Biology of a Large Language Model - Part 1

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.