AI Research Circle: AI Alignment, Unpacked

Monday · 2026-03-16 · 7:00 PM - 8:45 PM

research-papersai-alignmentconstitutional-ai

This session explores AI alignment through Anthropic's Constitutional AI framework. We'll dig into the 84-page constitution that guides Claude's behavior, the difference between rules and dispositions, and how Constitutional AI compares to RLHF. Co-facilitated by Anup Gosavi and Emily Hough-Kovacs. Pre-read: "Constitutional AI: Harmlessness from AI Feedback" (Bai et al., 2022).

Want more like this?

Get weekly picks matched to what you're into.

One email a week. No noise.