LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / People / Detail

Nicholas Schiefer

Member of Technical Staff at Anthropic and cofounder of Oulipo Labs, working on language model safety, evaluations, and scientific forecasting.

Researcher1 organizations8 reports

Profile status: updated

Suggest a correction
Suggest a source

Contributions are treated as untrusted leads. Public changes happen only after review against public sources.

Trust signals

Profile completeness41%
Public sources1
Official sources1
CountryUnknown
Last reviewedMar 13, 2026
Review outcomeUpdated
Official homepage
updated Unknown location 1 public sources

Latest review note

Added verified personal homepage and English bio from Nicholas Schiefer's public site.

Public links

website Personal homepage

Organizations

core Anthropic

Reports

Alignment and RLHF Constitutional AI: Harmlessness from AI Feedback Alignment and RLHF Collective Constitutional AI: Aligning a Language Model with Public Input Alignment and Safety Many-shot Jailbreaking Alignment and Safety Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Alignment and Safety Auditing language models for hidden objectives Alignment and Safety Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Alignment and Safety Constitutional Classifiers++: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Interpretability Tracing the thoughts of a large language model

Official and primary sources

https://nschiefer.github.io/ Official source ยท homepage

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.