Anthropic's Project Glasswing Update: Mythos AI Uncovers Over 10,000 Critical Vulnerabilities in One Month, Highlighting Urgent Cybersecurity Risks

Anthropic has released an initial progress report on Project Glasswing, its collaborative initiative to secure the world's most critical software infrastructure before advanced AI models can be weaponized against it. The update, published on May 22, 2026, details the early results of deploying Claude Mythos Preview — an unreleased frontier model optimized for vulnerability discovery — to approximately 50 trusted partners.

Most partners reported discovering hundreds of such issues in their own software, with several noting that their bug-detection rate increased by more than a factor of ten. The primary challenge has shifted from finding vulnerabilities to verifying, disclosing, and patching them.
Cloudflare provided a striking example: the company scanned its critical-path systems and uncovered 2,000 vulnerabilities, of which 400 were high- or critical-severity. Cloudflare's security team noted that the false-positive rate was lower than what they typically see from human testers.

Skeptics have questioned whether Mythos generates excessive false positives or “noise.” Anthropic directly addressed this by turning the model loose on open source. It scanned more than 1,000 major repositories that form the backbone of the modern internet. The scan surfaced 23,019 vulnerabilities in total (including medium- and low-severity issues), with 6,202 estimated to be high- or critical-severity.
To ensure reliability, 1,752 high- or critical-rated findings were subjected to rigorous independent review by six cybersecurity research firms or by Anthropic. Results showed a 90.6% true-positive confirmation rate (1,587 validated issues), and 62.4% of the assessed subset (1,094 vulnerabilities) were confirmed as high or critical severity.

The exploit allows an attacker to forge certificates, enabling the creation of fake bank or email-provider websites that appear completely legitimate to users and browsers alike (no warnings shown). The flaw received CVE-2026-5194 and has since been patched.

To date, only a fraction of the reported high/critical issues have been fully patched and publicly disclosed, partly because the standard 90-day coordinated vulnerability disclosure window is still in progress.
These results come with a broader warning. Anthropic states unequivocally that no company — including itself —has yet developed safeguards reliable enough to prevent the malicious use of models with Mythos-level capabilities. That is why public access to the model remains closed.

The initial update makes one thing clear: the age of AI-assisted vulnerability discovery has arrived. The bottleneck is no longer finding bugs — it is fixing them fast enough.
As Mythos and future models continue to evolve, the software ecosystem will need to adapt rapidly to this new reality. Project Glasswing represents a proactive first step, but the clock is ticking for the rest of the industry to catch up.
Also read:
- Cloudflare Just Made Email a First-Class Citizen for AI Agents — And Traditional Email Services Are Feeling It
- Claude Mythos Just Gave One Company Government-Level Cyber Weapons — And It’s Only April 2026
- Anthropic Launches Claude for Small Business: AI Agents That Work Inside the Tools You Already Use
- SEO Is Dying – But Google Is the Murderer
Thank you!