In this 59-minute talk from the Simons Institute, Jacob Steinhardt of UC Berkeley explores the concept of using AI systems to understand other AI systems, focusing on safety-guaranteed Large Language Models. Discover approaches for scalable AI evaluation and analysis, examining how we might leverage artificial intelligence tools to better comprehend, assess, and ensure the safety of increasingly complex AI systems. The presentation addresses critical questions about AI oversight and the potential for creating more reliable verification methods for advanced AI capabilities.
Overview
Syllabus
Scalably Understanding AI With AI
Taught by
Simons Institute