Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Stanford University

Beyond Benchmarks – Building a Science of AI Measurement

Stanford University via YouTube

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!
Join this Stanford University seminar where Sanmi Koyejo explores the need for more rigorous AI evaluation methods beyond static benchmarks. Learn how critical domains require better approaches to assess AI capabilities and safety. Discover a measurement framework that combines psychometric principles with modern AI evaluation needs, featuring techniques from Item Response Theory, amortized computation, and predictability analysis. Through safety assessment and capability measurement case studies, see how these methods can create more reliable, scalable, and meaningful evaluation systems for AI. The presentation builds toward transforming AI evaluation from benchmark collections into a rigorous measurement science capable of effectively guiding research, deployment, and policy decisions. Recorded on March 19, 2025 at Stanford University, this 73-minute seminar provides valuable insights for anyone interested in the future of AI assessment methodologies.

Syllabus

HAI Seminar with Sanmi Koyejo: Beyond Benchmarks – Building a Science of AI Measurement

Taught by

Stanford HAI

Reviews

Start your review of Beyond Benchmarks – Building a Science of AI Measurement

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.