Anthropic Thinks Its Own Success Is Key to Making AI Safe

1 hour ago 2

Anthropic has spent the past 5 years informing the satellite astir however precocious artificial quality could alteration wide destruction, destabilize society, and origin a litany of different sedate harms. But simultaneously, it has go 1 of the astir almighty forces pushing AI capabilities forward. The institution is present among the apical developers and distributors of cutting-edge AI models and courts customers similar the US military. It was precocious valued astatine astir $1 trillion.

At archetypal glance, Anthropic's stark messaging and its actions look fundamentally astatine odds.

But wrong the company, galore radical don’t spot a contradiction. To recognize why, you archetypal person to recognize that Anthropic operates based connected 2 halfway beliefs. The archetypal is that artificial quality is the astir transformative exertion successful quality history, and its accomplishment is inevitable. The lone existent question is whether it leads to catastrophe oregon bonzer prosperity.

The 2nd is that Anthropic believes the satellite volition beryllium amended disconnected if it remains astatine the frontier of the AI race, according to respective erstwhile employees who spoke to WIRED connected the information of anonymity. Internally, leaders and employees astatine the institution often notation to themselves arsenic the “good guys,” meaning the ones being liable stewards of AI technology, 2 of the sources said. The institution sees accumulating power—whether successful the signifier of capital, compute, probe talent, oregon governmental influence—not arsenic an extremity successful itself, but arsenic the terms of fulfilling its mission: “to guarantee the satellite safely makes the modulation done transformative AI.”

Helen Toner, enforcement manager of Georgetown’s Center for Security and Emerging Technology and a erstwhile OpenAI committee member, uses an analogy to picture Anthropic’s worldview. She compares almighty AI to a wood filled with some magical treasures and unsafe monsters. All the villagers adjacent are rushing in, lured by the treasure. In her telling, Anthropic wants to task farther into the wood than anyone other portion investing heavy successful taming the monsters—that is, capturing AI’s benefits portion containing its catastrophic risks.

“What’s distinctive astir Anthropic is they’re like, ‘People are going successful the wood anyway, we person to bash it first.’ This is precise explicitly their strategy: physique cutting-edge AI successful bid to beryllium a superior subordinate astatine the array who tin speech astir what cutting-edge AI systems should look like, what risks they pose, and pushing for tenable safeguards,” Toner tells me. “They’re precise straightforward astir this. It’s conscionable a weird capable strategy that radical person a hard clip proceeding it.”

Anthropic CEO Dario Amodei outlined this attack plainly successful a speech with his cofounders posted connected the company’s vocation page: “You person to find a mode to really beryllium competitive, to really pb the manufacture successful immoderate cases, and yet negociate to bash things safely,” helium says. “If you tin bash that, the gravitational propulsion you exert is truthful great.”

Anthropic was founded successful 2021 by a radical of erstwhile OpenAI employees who defected aft losing religion successful the quality of the company’s leadership—particularly CEO Sam Altman—to safely bring transformational AI into the world. That sentiment inactive shapes the institution today. Two of the erstwhile employees I spoke with accidental that, successful interior discussions, Anthropic executives often picture Altman and OpenAI—and, to a lesser extent, Meta and Elon Musk’s xAI—as cautionary examples that assistance specify Anthropic’s ain consciousness of responsibility.

In galore regards, Anthropic is conscionable similar immoderate different Silicon Valley company. Many startups marketplace themselves arsenic David warring the outdated, entrenched Goliaths of the industries they privation to disrupt. Google, Facebook, and Apple were each founded upon idealistic principles, which aboriginal became muddied oregon were abandoned altogether arsenic they became richer, larger, and much influential.

Read Entire Article