LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / People / Detail

Dylan Hadfield-Menell

Dylan Hadfield-Menell is a research scientist at Anthropic. His homepage says he studies how to understand and align increasingly capable AI systems, with additional interests in game theory and inverse reinforcement learning. MIT CSAIL's profile says he completed a PhD in computer science at the University of California, Berkeley advised by Stuart Russell.

Research Scientist at Anthropic1 organizations1 reports

Profile status: updated

Dylan Hadfield-Menell portrait
Suggest a correction
Suggest a source

Trust signals

Profile completeness77%
Public sources3
Official sources3
Last reviewedNot reviewed yet
Official homepage Scholar profile Structured work Structured education
updated 3 public sources
inverse-reinforcement-learningai-alignmentgame-theory

Current frame

Research Scientist at Anthropic

Education

University of California, Berkeley PhD · Computer Science

Work

Anthropic Research Scientist

Public links

website Personal website openreview OpenReview website MIT CSAIL profile

Organizations

core Anthropic

Reports

Alignment and Safety Constitutional Classifiers++: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Official and primary sources

Dylan Hadfield-Menell Official source · homepage · Dylan Hadfield-Menell Dylan Hadfield-Menell | OpenReview Official source · openreview · OpenReview Dylan Hadfield-Menell | MIT CSAIL Official source · team_page · MIT Computer Science and Artificial Intelligence Laboratory

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms