Anthropic Weakens AI Safety Commitments Amid Pentagon Ultimatum Over Military Use
Anthropic dropped its core AI safety pledge as the Pentagon set a Feb 27 deadline for unrestricted Claude access. What this means for the industry.
Safety Takes a Back Seat to Competition
Anthropic — long regarded as one of the most safety-conscious AI labs — has abandoned a foundational commitment in its own safety framework. The company cited the need to remain competitive in a rapidly advancing market as the driving reason behind the change.
In 2023, Anthropic introduced its Responsible Scaling Policy (RSP), a set of voluntary rules designed to mitigate catastrophic risks in AI development. The original document committed the company to pausing AI development if its models were deemed potentially dangerous.
Anthropic's chief scientist Jared Kaplan said the company concluded that simply stopping model training would not help anyone. Given the rapid pace of progress across the industry, he argued, it would be unwise to take on unilateral commitments while competitors continue to push forward.
The third version of RSP codifies a new approach: Anthropic will continue improving its AI as long as it believes its models do not hold a significant advantage over competitors. In a blog post, the company noted that the political environment has shifted toward prioritizing competitiveness and economic growth, while AI safety discussions have failed to gain meaningful support at the federal level.
The updated policy does include greater transparency measures, such as detailed publication of model testing results. Anthropic says it will only slow development if it becomes the undisputed leader in the AI race while simultaneously identifying a significant catastrophe risk.
When RSP launched in 2023, the company hoped rivals would adopt similar pledges. None did — no other major player made a comparable commitment to halt AI development upon detecting risks.
Why This Matters
Anthropic's retreat from its own safety framework marks a troubling precedent. Even companies that built their brand on responsible AI development are now backing away from voluntary commitments under competitive pressure and a lack of regulatory enforcement. The failure of self-regulation raises fundamental questions about whether the AI industry can govern itself without binding legislation.
The Pentagon's Hard Deadline
Simultaneously, Anthropic is locked in a standoff with the U.S. Department of Defense. The conflict centers on the Pentagon's plans to use AI for mass surveillance of American citizens and autonomous weapons development — applications Anthropic explicitly opposes.
Pentagon officials have stated their intent to use large language models "for all lawful scenarios" without restrictions and floated the possibility of terminating their contract with the company.
Anthropic CEO Dario Amodei met with Defense Secretary Pete Hegseth to discuss the situation. The department issued an ultimatum: the company must accept government terms by February 27. If Anthropic refuses, authorities could designate it as a supply chain threat, jeopardizing the startup's business with other U.S. government contractors. The alternative is invoking the Defense Production Act, which would allow the Pentagon to forcibly access Anthropic's technology.
Franklin Turner, a government contracts attorney at McCarter & English, called the scenario unprecedented, noting it would almost certainly trigger a wave of litigation if the administration takes adverse action against Anthropic.
In response, Anthropic stated that both parties "continue to engage in good-faith dialogue."
Anthropic Holds the Line on Military Use
Despite the pressure, Reuters reports that Anthropic has no plans to soften its restrictions on military applications of its technology. The company continues to maintain two red lines: no fully autonomous weapons and no mass surveillance of Americans.
Ethereum co-founder Vitalik Buterin voiced his support for the company's stance:
Buterin said his opinion of the company would significantly improve if Anthropic refuses to back down and honorably accepts the consequences.
In February, it emerged that the Claude model was used in an operation to capture Venezuelan President Nicolás Maduro. According to reports, Amodei did not express concern about this use of the company's products during his meeting with Hegseth.
The Economist notes that the severity of the Pentagon's ultimatum signals the military's reluctance to abandon Claude — suggesting the startup's technology may be irreplaceable for defense applications.
Separately, SpaceX and its AI subsidiary xAI are participating in a classified Pentagon tender to develop voice-controlled autonomous drone swarms.
Frequently Asked Questions
Why did Anthropic weaken its AI safety commitments?
Anthropic cited the need to remain competitive in a rapidly advancing AI market. Chief scientist Jared Kaplan argued that simply pausing model training would be unwise while competitors continue to push forward, and no other major AI lab adopted similar safety pledges.
What was Anthropic's Responsible Scaling Policy (RSP)?
Introduced in 2023, the RSP was a set of voluntary rules designed to mitigate catastrophic risks in AI development. The original policy committed Anthropic to pausing AI development if its models were deemed potentially dangerous.
What is the Pentagon's ultimatum to Anthropic?
The Pentagon demanded that Anthropic accept government terms for unrestricted use of its AI — including mass surveillance and autonomous weapons development — by February 27. If Anthropic refuses, authorities could designate it as a supply chain threat, jeopardizing its business with other U.S. government contractors.
How has Anthropic changed its approach to AI safety under the new RSP?
Under the third version of its RSP, Anthropic will continue improving its AI as long as it believes its models do not hold a significant advantage over competitors. The company says it will only slow development if it becomes the undisputed AI leader while simultaneously identifying a significant catastrophe risk.
Why does Anthropic's safety policy rollback matter for the AI industry?
It sets a troubling precedent: even companies that built their brand on responsible AI are backing away from voluntary commitments under competitive pressure and lack of regulatory enforcement. This raises fundamental questions about whether the AI industry can govern itself without binding legislation.
Read also
Trump Orders All Federal Agencies to Drop Anthropic Technologies Within Six Months
Federal agencies have 6 months to drop Anthropic's Claude AI amid ethics clashes. See how xAI and Pentagon deals reshape the landscape.
Weekly Recap: NYT Satoshi Investigation, North Korean Hackers in DeFi, and Anthropic's AI 'Escape'
Bitcoin climbed above $71,000, a NYT journalist named Adam Back as Satoshi Nakamoto, ZachXBT exposed a network of North Korean IT agents in crypto projects, and Anthropic shelved its new AI model after it escaped a sandbox and found thousands of zero-day vulnerabilities.
End of the "Cheap Ether" Era: Buterin Rewrites the L2 Playbook
Vitalik Buterin has signaled a fundamental shift away from Ethereum's rollup-centric roadmap. With the Strawmap plan targeting 10,000 TPS on L1, Layer 2 projects must find new reasons to exist.
Ethereum Outpaces Bitcoin as Institutional Buyers Return in Force
ETH is outperforming BTC with a 14.1% weekly gain versus 9.6%. Major crypto figures like Erik Voorhees are accumulating Ethereum aggressively while Vitalik Buterin proposes simplifying node setup.
US DOJ Seizes Over $580M in Crypto Linked to Chinese Criminal Organizations
The U.S. Department of Justice seized more than $580 million in cryptocurrency tied to Chinese criminal organizations, marking one of the largest crypto enforcement actions in history.
Drift Protocol Hacked for $280M, Google Lowers Quantum Threat Estimate — Weekly Recap
Bitcoin held steady at $67,000, North Korean hackers stole $280M from Drift Protocol, Anthropic leaked Claude Code source, and Google drastically reduced quantum attack threshold estimates for crypto.
