Anthropic Employs Jan Leike, Former OpenAI Safety Lead

1 min read

Anthropic employs Jan Leike, a former safety lead at OpenAI, to lead the new “superalignment” team in the company.

Leading AI researcher Jan Leike, who quit OpenAI earlier this month after openly criticizing the company’s AI safety policy, has joined rival Anthropic as the team leader for a newly formed “superalignment” division.

Leike stated in a post on X that his team at Anthropic will concentrate on “scalable oversight,” “weak-to-strong generalization,” and automated alignment research, among other areas of AI safety and security.

I'm excited to join @AnthropicAI to continue the superalignment mission!

My new team will work on scalable oversight, weak-to-strong generalization, and automated alignment research.

If you're interested in joining, my dms are open.
— Jan Leike (@janleike) May 28, 2024

As Leike’s team grows, researchers at Anthropic working on scalable oversight—methods to regulate the behavior of large-scale AI in predictable and desirable ways—will report directly to Leike, according to a source familiar with the situation who spoke with TechCrunch.

✨🪩 Woo! 🪩✨

Jan's led some seminally important work on technical AI safety and I'm thrilled to be working with him! We'll be leading twin teams aimed at different parts of the problem of aligning AI systems at human level and beyond. https://t.co/aqSFTnOEG0
— Sam Bowman (@sleepinyourhat) May 28, 2024

Leike’s team’s goal sounds a lot like that of OpenAI’s recently disbanded Superalignment project. Leike co-led the Superalignment team, tasked with overcoming the fundamental technological problems of superintelligent AI control within the next four years. However, the team frequently encountered obstacles due to OpenAI’s leadership.

Anthropic has repeatedly attempted to portray itself as OpenAI’s less concerned about security.

Dario Amodei, the CEO of Anthropic, was previously the vice president of research at OpenAI. It is said that the two companies fell out throughout the company’s course. Namely, OpenAI’s increasing commercial emphasis. To create Anthropic, Amodei pulled in several former OpenAI staff members, including Jack Clark, the former policy lead.

Edwin Aboyi

onMay 28, 2024