I am a research scientist at Anthropic, where I started and lead the societalimpacts team. Prior to that, I was the founding research director at the Stanford Institute for Human Centered AI (HAI). I did my PhD in Computational Neuroscience at NYU and obtained my BS in Electrical Engineering and Computer Science (EECS) from Berkeley. For fun, I surf, play bass with my band, and read widely across the humanities and social sciences. Here's some of my and some
COLMlast author
Apr 2025NeurIPS
Dec 2024arXivfirst author
Feb 2023arXivfirst author
Aug 2022arXivmiddle author - evals
Dec 2022arXivmiddle author - evals
Apr 2022NeurIPS Algorithmic Fairness Workshoplast author
Dec 2024Anthropic Research Bloglast author
Oct 2024COLMlast author
Oct 2024Anthropic Research Bloglast author
Apr 2024Anthropic Research Blogfirst author
Oct 2023ACLmiddle author - evals
July 2023TMLRmiddle author - evals
May 2022Anthropic Research Bloglast author
Dec 2025Anthropic Research Bloglast author
Dec 2025arXivlast author
Feb 2025Anthropic Economic Indexlast author
Feb 2025Nature Human Behaviourmiddle author
Oct 2025FAccTlast author
Jun 2024Anthropic Research Blog
Jun 2024arXivlast author
Jun 2023Anthropic Research Bloglast author
Jun 2025FAccTfirst author
Jun 2022arXiv
Mar 2021arXivlast author
Feb 2021arXivfirst author
Feb 2016Neural Computationfirst author
Oct 2014NeurIPSfirst author
Dec 2010The Verge
Dec 2025Societal Impacts Team
The New York Times
Oct 2023Collective Constitutional AI: Aligning a Language Model with Public Input
Time Magazine
Dec 2023Collective Constitutional AI: Aligning a Language Model with Public Input
The Ezra Klein Show
Apr 2024Measuring the Persuasiveness of Language Models
Quanta Magazine
Mar 2023Predictability and Surprise in Large Generative Models
ZDNet
Dec 2025Introducing Anthropic Interviewer: What 1,250 professionals told us about working with AI
The Verge
Dec 2025Introducing Anthropic Interviewer: What 1,250 professionals told us about working with AI
Inc
Dec 2025How AI Is Transforming Work at Anthropic
Axios
Dec 2025How AI Is Transforming Work at Anthropic
VentureBeat
Apr 2025Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions
Axios
Feb 2025Which Economic Tasks are Performed with AI? Evidence from Millions of Claude Conversations
Platformer
Dec 2024Clio: Privacy-Preserving Insights into Real-World AI Use
VentureBeat
Dec 2023Evaluating and Mitigating Discrimination in Language Model Decisions
The New York Times
Aug 2023Red Teaming Language Models to Reduce Harms
MIT Technology Review
Mar 2023The Capacity for Moral Self-Correction in Large Language Models
VentureBeat
Feb 2021Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models
Nature
Aug 2019Starfish: Open Source Image Based Transcriptomics and Proteomics Tools