Skip to content
Anthropic's Claude AI Discovered 22 Vulnerabilities in Firefox in Just Two Weeks
AI3 min
12

Anthropic's Claude AI Discovered 22 Vulnerabilities in Firefox in Just Two Weeks

AnthropicAnthropicSTARTUP

Anthropic tested Claude Opus 4.5's ability to find security flaws in Firefox. Out of 22 discovered vulnerabilities, 14 were classified as high-severity — accounting for a fifth of all critical bugs Mozilla fixed throughout 2025.

📝
CoinJP Editorial
0
CoinJP Editorial · 0 articles

Claude Scanned Nearly 6,000 Firefox Code Files

Anthropic partnered with Mozilla to evaluate the capabilities of its Claude Opus 4.5 language model in identifying security vulnerabilities. The target was Firefox — widely regarded as one of the most rigorously tested and secure open-source projects with a highly complex codebase.

Over a two-week period, the AI model uncovered 22 vulnerabilities. Mozilla classified 14 of these as high-severity, a figure representing roughly one-fifth of all high-severity bugs the browser maker remediated throughout 2025.

"We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox. Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025." — Anthropic (@AnthropicAI), original post

Why This Matters

The experiment's results highlight the growing potential of large language models in cybersecurity. A single AI system discovering one-fifth of a year's worth of critical vulnerabilities in one of the most well-defended browsers within just two weeks fundamentally shifts the landscape of software security auditing. At the same time, it raises concerns about the same technology potentially being weaponized by malicious actors.

How the Experiment Unfolded

Anthropic's researchers initially focused Claude on Firefox's JavaScript engine, which could be analyzed in isolation from the rest of the system. They subsequently expanded the model's scope to cover additional parts of the codebase.

Within just 20 minutes of starting, Claude flagged a Use After Free vulnerability — a type of flaw that allows attackers to replace data with arbitrary content. In total, the model analyzed nearly 6,000 C++ files and generated 112 reports on potential issues.

Mozilla's team addressed the majority of discovered vulnerabilities in Firefox 148, released in February. Patches for the remaining issues are scheduled for upcoming releases.

AI Proves Better at Finding Vulnerabilities Than Exploiting Them

Anthropic acknowledged that Claude demonstrated far greater proficiency in detecting security flaws than in actually exploiting them. To test this dimension, the team asked the model to demonstrate a real-world attack using the Use After Free vector.

The test was run several hundred times with varying starting conditions, consuming approximately $4,000 in API credits. Despite this extensive testing, Opus 4.6 managed to convert a vulnerability into a functioning exploit in only two instances.

According to Anthropic, this asymmetry — where AI finds vulnerabilities more effectively than it exploits them — provides a near-term advantage for cybersecurity defenders. However, the company noted that the fact the language model was able to produce even a primitive exploit at all is "cause for concern."

Mozilla Continues Working with Claude

Following the collaboration, Mozilla's researchers began independently experimenting with Claude for security purposes. This suggests the partnership's outcomes were compelling enough for the open-source project to consider integrating AI tools into its code audit workflows.

Earlier in February, so-called vibe coding using Claude Opus 4.6 led to a $1.78 million exploit of the DeFi protocol Moonwell — an incident that underscored the dual-use nature of AI in the security domain.

ai-securityanthropicclaudecybersecurityfirefoxmozillavulnerability

Frequently Asked Questions

How many vulnerabilities did Claude find in Firefox?

Claude Opus 4.5 discovered 22 vulnerabilities in Firefox over a two-week period. Mozilla rated 14 of them as high-severity, which accounts for roughly one-fifth of all high-severity bugs the company fixed throughout 2025.

Can Claude AI create exploits from vulnerabilities?

During testing, Claude was asked to develop a working exploit based on a Use After Free vulnerability. After several hundred attempts costing around $4,000 in API credits, the model succeeded in creating a functional exploit in only two cases. Anthropic noted that Claude is far better at finding flaws than exploiting them.

Were the Firefox vulnerabilities discovered by Claude patched?

Mozilla addressed the majority of the discovered vulnerabilities in Firefox 148, released in February. Remaining fixes are scheduled for inclusion in upcoming browser releases.

Why was Firefox chosen for AI security testing?

Anthropic selected Firefox because it is one of the most thoroughly tested and secure open-source projects with a highly complex codebase. This made it an ideal benchmark for evaluating AI capabilities in real-world cybersecurity scenarios.

What is the Moonwell exploit related to Claude?

In February, vibe coding using Claude Opus 4.6 led to a $1.78 million exploit of the DeFi protocol Moonwell. This incident highlighted the dual-use risks of AI tools in security contexts.

Read also

Security

GPU Memory Attacks, $21B in Cybercrime Losses, and Chrome's Chip-Level Protection: Cybersecurity Roundup

The FBI reported record $21 billion in cybercrime losses for 2025, Google introduced hardware-bound session protection in Chrome, and researchers demonstrated three new attack methods targeting Nvidia GPU memory.

5 min·🔥 0
AI

Anthropic Weakens AI Safety Commitments Amid Pentagon Ultimatum Over Military Use

Anthropic dropped its core AI safety pledge as the Pentagon set a Feb 27 deadline for unrestricted Claude access. What this means for the industry.

5 min·🔥 1
AI

Trump Orders All Federal Agencies to Drop Anthropic Technologies Within Six Months

Federal agencies have 6 months to drop Anthropic's Claude AI amid ethics clashes. See how xAI and Pentagon deals reshape the landscape.

3 min·🔥 1
AI

AI Audit Uncovers Critical Liveness Bug in Ethereum's Nethermind Client

Octane Security's AI discovered a high-severity vulnerability in the Nethermind execution client that could have halted block production for 38% of Ethereum mainnet validators. The Ethereum Foundation awarded a maximum $50,000 bounty.

3 min·🔥 1
Analytics

April 2026 Sets All-Time Record for Number of Crypto Hacks

April 2026 saw a record-breaking 24 crypto hacks resulting in approximately $651 million in total losses. Kelp and Drift Protocol suffered the largest exploits.

3 min·🔥 0
Analytics

Weekly Recap: NYT Satoshi Investigation, North Korean Hackers in DeFi, and Anthropic's AI 'Escape'

Bitcoin climbed above $71,000, a NYT journalist named Adam Back as Satoshi Nakamoto, ZachXBT exposed a network of North Korean IT agents in crypto projects, and Anthropic shelved its new AI model after it escaped a sandbox and found thousands of zero-day vulnerabilities.

5 min·🔥 0