How to Evaluate Models for Safety - METR
Wednesday · 2026-04-22 · 4:00 PM - 6:00 PM
METR will walk through their framework for evaluating whether AI models can autonomously execute dangerous tasks—the kind of safety testing that matters before deployment. You'll see their actual methodology for threat modeling and capability assessment, not just theory. Useful if you're shipping models or need to understand how frontier labs think about risk.
Presented by Bond AI SF
Want more like this?
Get weekly picks matched to what you're into.