This 54-minute video explores Anthropic's Transformer Circuit Blog Post, diving deep into the internal mechanisms of Claude 3.5 Haiku, Anthropic's lightweight production model. Examine how the model functions in various contexts through circuit tracing methodology, as presented by Yannic Kilcher. Learn about the research conducted by Jack Lindsey, Wes Gurnee, Emmanuel Ameisen, and numerous other contributors who investigated the inner workings of large language models. The video serves as the first part of a series analyzing what could be considered the "biology" of these AI systems, referencing the detailed technical paper available at transformer-circuits.pub. Discover how researchers are approaching AI systems with techniques reminiscent of biological research to understand their internal operations and behaviors.
Overview
Syllabus
On the Biology of a Large Language Model (Part 1)
Taught by
Yannic Kilcher