Atlas / People / Detail

Evan Hubinger

Evan Hubinger is Head of Alignment Stress-Testing at Anthropic, where he works on AI safety and alignment. He previously worked at MIRI and OpenAI, studied mathematics and computer science at Harvey Mudd College, and is known for work on inner alignment, deceptive alignment, and alignment stress-testing.

Anthropic head of alignment stress-testing1 organizations2 reports

Profile status: updated

updated 4 public sources

AI safetyalignmentalignment stress-testing

Current frame

Anthropic head of alignment stress-testing