Latest review note
Added personal homepage, Google Scholar, and concise bio verified against Anthropic hidden objectives research page.
Atlas / People / Detail
PhD student at the University of Oxford working on AI safety, including scalable oversight and interpretability.
Profile status: updated
Added personal homepage, Google Scholar, and concise bio verified against Anthropic hidden objectives research page.