Skip to content
Unauthorized Users Gained Access to Anthropic's Restricted Mythos AI Model
4

Unauthorized Users Gained Access to Anthropic's Restricted Mythos AI Model

AnthropicAnthropicSTARTUP

A group of forum members breached Anthropic's Mythos — an AI model designed to find vulnerabilities in operating systems and browsers. The company is investigating the incident.

📝
CoinJP Editorial
0
CoinJP Editorial · 0 articles

A small group of unauthorized individuals managed to gain access to Anthropic's restricted AI model Mythos, according to a Bloomberg report citing internal company documents.

How the Breach Occurred

Bloomberg reports that several members of a private online forum began using the neural network on the day of its launch and have continued to access it regularly since. Anthropic positions Mythos as a system capable of discovering and exploiting vulnerabilities across all major operating systems and web browsers. Because of this, the company restricted access to a select group of software vendors only.

The intruders employed a combination of tactics to penetrate the system:

  • Using credentials belonging to an employee of a third-party Anthropic contractor;
  • Guessing the model's URL based on addressing patterns of other Anthropic systems;
  • Extracting additional information from a data leak at startup Mercor.

Bloomberg's source claims the group intends only to experiment with the model and has no plans to cause harm. Beyond Mythos, the participants also have access to several other unreleased Anthropic neural networks.

An Anthropic spokesperson stated: "We are investigating a report of unauthorized access to Claude Mythos Preview through one of our third-party vendor environments."

Why This Matters

The incident exposes a fundamental challenge facing the AI industry — the difficulty of controlling the spread of potentially dangerous technologies. If a model designed to identify vulnerabilities in critical software became accessible through compromised contractor credentials and startup data leaks, a pressing question remains: who else may have gained access, and with what intentions?

The situation is compounded by Mythos's unique cybersecurity capabilities, making it a potentially dangerous tool in the wrong hands.

Mythos Capabilities: 271 Vulnerabilities Found in Firefox

The model's power has been validated through Mozilla's internal testing. The company revealed on its blog that an early version of Mythos helped identify 271 vulnerabilities in Firefox — all of which have since been patched. By comparison, a previous Anthropic model that Mozilla tested earlier detected only 22 vulnerabilities in an older Firefox version.

The results demonstrated how effectively advanced AI systems can analyze large codebases and uncover weaknesses that previously demanded extensive manual review by cybersecurity professionals.

Mozilla nevertheless acknowledged that achieving absolute security remains an "unrealistic goal." The company emphasized that all discovered bugs could have been found by a highly skilled human researcher as well.

"Some commentators believe that future AI models will discover entirely new forms of vulnerabilities that go beyond our current understanding. We don't think so," Mozilla stated.

Context: Mythos Continues to Make Headlines

Mythos has remained at the center of the news cycle throughout recent weeks. Earlier in April, reports emerged that the U.S. National Security Agency is using Mythos despite Anthropic's ongoing conflict with the Pentagon. In March, the company confirmed a leak of part of the source code for its AI programming tool Claude Code. An attempt to rectify the situation backfired when Anthropic accidentally deleted thousands of GitHub repositories.

This string of incidents raises serious questions about the ability of even leading AI companies to maintain reliable control over their own creations — especially when those systems possess potentially destructive capabilities.

ai-securityanthropicartificial-intelligencecybersecuritydata-breachmozillamythos

Frequently Asked Questions

What is Anthropic's Mythos AI model?

Mythos is a restricted AI model developed by Anthropic that can detect and exploit vulnerabilities across all major operating systems and web browsers. Access is limited to a select group of software vendors due to its potentially dangerous capabilities.

How did unauthorized users access Mythos?

The group used multiple tactics: compromised credentials of a third-party Anthropic contractor employee, guessed the model's URL based on addressing patterns of other Anthropic systems, and extracted information from a data leak at startup Mercor.

How many vulnerabilities did Mythos find in Firefox?

An early version of Mythos identified 271 vulnerabilities in Firefox during Mozilla's internal testing, all of which were subsequently fixed. A previous Anthropic model had found only 22 vulnerabilities in an earlier Firefox version.

Is the NSA using Anthropic's Mythos?

According to media reports from April, the U.S. National Security Agency is using Mythos despite Anthropic's ongoing conflict with the Pentagon. The details of this arrangement were revealed earlier in the month.

What are the security risks of the Mythos breach?

The breach raises significant concerns because Mythos can find and exploit vulnerabilities in critical software. While Bloomberg's sources claim the group has purely experimental intentions, the incident highlights the challenge of controlling access to potentially dangerous AI technologies.

Read also

AI

AI Audit Uncovers Critical Liveness Bug in Ethereum's Nethermind Client

Octane Security's AI discovered a high-severity vulnerability in the Nethermind execution client that could have halted block production for 38% of Ethereum mainnet validators. The Ethereum Foundation awarded a maximum $50,000 bounty.

3 min·🔥 1
Security

GPU Memory Attacks, $21B in Cybercrime Losses, and Chrome's Chip-Level Protection: Cybersecurity Roundup

The FBI reported record $21 billion in cybercrime losses for 2025, Google introduced hardware-bound session protection in Chrome, and researchers demonstrated three new attack methods targeting Nvidia GPU memory.

5 min·🔥 0
AI

Trump Orders All Federal Agencies to Drop Anthropic Technologies Within Six Months

Federal agencies have 6 months to drop Anthropic's Claude AI amid ethics clashes. See how xAI and Pentagon deals reshape the landscape.

3 min·🔥 1
Analytics

April 2026 Sets All-Time Record for Number of Crypto Hacks

April 2026 saw a record-breaking 24 crypto hacks resulting in approximately $651 million in total losses. Kelp and Drift Protocol suffered the largest exploits.

3 min·🔥 0
Analytics

Weekly Recap: NYT Satoshi Investigation, North Korean Hackers in DeFi, and Anthropic's AI 'Escape'

Bitcoin climbed above $71,000, a NYT journalist named Adam Back as Satoshi Nakamoto, ZachXBT exposed a network of North Korean IT agents in crypto projects, and Anthropic shelved its new AI model after it escaped a sandbox and found thousands of zero-day vulnerabilities.

5 min·🔥 0
Security

Drift Protocol on Solana Hacked for $280M in Sophisticated Durable Nonce Attack

Solana-based DeFi platform Drift Protocol lost at least $280 million in a hack on April 1. The DRIFT token dropped 37% while Circle faces criticism for failing to freeze stolen USDC.

4 min·🔥 0