Anthropic hires former OpenAI safety lead to head up new team

3 1 minute read

Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company’s approach to AI safety, has joined OpenAI rival Anthropic to lead a new “superalignment” team.

In a post on X, Leike said that his team at Anthropic will focus on various aspects of AI safety and security, specifically “scalable oversight,” “weak-to-strong generalization” and automated alignment research.

I’m excited to join @AnthropicAI to continue the superalignment mission!

My new team will work on scalable oversight, weak-to-strong generalization, and automated alignment research.

If you’re interested in joining, my dms are open.

— Jan Leike (@janleike) May 28, 2024

A source familiar with the matter tells TechCrunch that Leike will report directly to Jared Kaplan, Anthropic’s chief science officer, and that Anthropic researchers currently working on scalable oversight — techniques to control large-scale AI’s behavior in predictable and desirable ways — will move to report to Leike as Leike’s team spins up.

✨🪩 Woo! 🪩✨

Jan’s led some seminally important work on technical AI safety and I’m thrilled to be working with him! We’ll be leading twin teams aimed at different parts of the problem of aligning AI systems at human level and beyond. https://t.co/aqSFTnOEG0

— Sam Bowman (@sleepinyourhat) May 28, 2024

In many ways, Leike’s team sounds similar in mission to OpenAI’s recently dissolved Superalignment team. The Superalignment team, which Leike co-led, had the ambitious goal of solving the core technical challenges of controlling superintelligent AI in the next four years, but often found itself hamstrung by OpenAI’s leadership.

Anthropic has often attempted to position itself as more safety-focused than OpenAI.

Anthropic’s CEO, Dario Amodei, was once the VP of research at OpenAI and reportedly split with OpenAI after a disagreement over the company’s direction — namely OpenAI’s growing commercial focus. Amodei brought with him a number of ex-OpenAI employees to launch Anthropic, including OpenAI’s former policy lead Jack Clark.

Source link

Music Industry Unites to Protect the Rights of Musicians Amid Risks of Generative AI Technology

Samsung finally reveals Unpacked date and how to watch

The big weaknesses Biden and Trump will confront on the debate stage: From the Politics Desk

WikiLeaks’ Julian Assange lands in Saipan for US plea-deal court hearing | WikiLeaks News

Researchers upend AI status quo by eliminating matrix multiplication in LLMs

Moto G85 quietly unveiled in Europe

This smiling robot face made of living skin is absolute nightmare fuel

TimTheTatman & NICKMERCS “can’t support” Dr Disrespect after DMs to minor

Is the risk of direct conflict between Russia and NATO increasing? | TV Shows

VW to invest up to $5B into Rivian in software deal

Anthropic hires former OpenAI safety lead to head up new team

Real Hacker Staff

B&C Group Promotes Patricia Adeyemi to Distribution Sales Director for B&C Speakers

Moondrop MIAD01 review: A unique phone that’s aimed at audiophiles

The 2025 Polestar 3 is a torque-vectoring SUV that’s fun to drive

Here’s everything Apple announced at the WWDC 2024 keynote, including Apple Intelligence, Siri makeover

Black Ops 6 will take up roughly half the console storage per preorder listings

Music Industry Unites to Protect the Rights of Musicians Amid Risks of Generative AI Technology

US military launches robot spaceplane on seventh mission | Space News

Home improvement upgrade: Apple Pay is now available at Lowe’s retail store

Switch Online's 'Missions & Rewards' Adds Pokémon Scarlet & Violet DLC Icons

Diablo 4 dev will “embrace” OP aspects: “let them have their fun”

‘Filled with fear’: Former Hong Kong student leader seeks UK asylum | Politics News

Robert De Niro bashes Trump for being a violent coward: "You think Trump ever threw a punch himself?" (video)

WWDC 2024 schedule revealed with June 10 keynote for AI, iOS 18, and more

Related Articles

Music Industry Unites to Protect the Rights of Musicians Amid Risks of Generative AI Technology

US military launches robot spaceplane on seventh mission | Space News

Home improvement upgrade: Apple Pay is now available at Lowe’s retail store

Switch Online's 'Missions & Rewards' Adds Pokémon Scarlet & Violet DLC Icons

Diablo 4 dev will “embrace” OP aspects: “let them have their fun”

‘Filled with fear’: Former Hong Kong student leader seeks UK asylum | Politics News