AI Research Circle: AI Alignment, Unpacked
Monday · 2026-03-16 · 7:00 PM - 8:45 PM
research-papersai-alignmentconstitutional-ai
This session explores AI alignment through Anthropic's Constitutional AI framework. We'll dig into the 84-page constitution that guides Claude's behavior, the difference between rules and dispositions, and how Constitutional AI compares to RLHF. Co-facilitated by Anup Gosavi and Emily Hough-Kovacs. Pre-read: "Constitutional AI: Harmlessness from AI Feedback" (Bai et al., 2022).
Want more like this?
Get weekly picks matched to what you're into.