Current frame
Anthropic head of alignment stress-testing
Atlas / People / Detail
Evan Hubinger is Head of Alignment Stress-Testing at Anthropic, where he works on AI safety and alignment. He previously worked at MIRI and OpenAI, studied mathematics and computer science at Harvey Mudd College, and is known for work on inner alignment, deceptive alignment, and alignment stress-testing.
Profile status: updated
Anthropic head of alignment stress-testing