Overview
This lecture by Daniel Murfet from Timaeus explores the theoretical and empirical aspects of singular learning theory and its applications to AI alignment. Delve into how singular learning theory can be utilized to develop safety-guaranteed large language models. The presentation, part of the Simons Institute's series on AI safety, examines mathematical frameworks that could help ensure AI systems behave reliably and align with human values.
Syllabus
Theoretical And Empirical Aspects Of Singular Learning Theory For AI Alignment
Taught by
Simons Institute