TL;DR: Anthropic’s System Card shows Claude Mythos Preview autonomously finding zero-days, building exploits, and completing corporate attack simulations with minimal guidance, while CyberGym and Cybench scores climbed sharply over prior models. That capability shift turns identity telemetry, patch velocity, and response timing into the decisive controls, not optional hardening.
NHIMG editorial — based on content published by AuthMind: Ahead of the Breach, Part 1 of 3, The Capability Threshold
By the numbers:
- Claude Mythos Preview achieved a perfect pass@1 score of 1.00 on Cybench, a public benchmark drawing from 40 CTF challenges across four major competitions.
- Claude Mythos Preview scored 0.83 on CyberGym versus Claude Opus 4.6's 0.67 across 1,507 real open-source software tasks.
- The Firefox 147 JavaScript shell exploitation evaluation showed Claude Mythos Preview succeeding at 84% from given crash categories.
Questions worth separating out
A: Security teams should assume the attacker can move from discovery to exploitation faster than human review cycles allow.
Q: Why do identity controls matter more when exploit development is automated?
A: Identity controls matter because post-exploit movement usually relies on legitimate credentials, sessions, and service accounts.
Q: What breaks when remediation still assumes human-paced attackers?
A: What breaks is the timing model.
Practitioner guidance
- Compress remediation priority around exploitable paths Re-rank vulnerabilities by reachable exploitability, exposed identity paths, and likely post-login impact rather than by severity alone.
- Instrument identity telemetry for post-auth abuse Correlate authentication events, service account activity, and session anomalies so that legitimate credential use can be distinguished from rapid lateral movement.
- Test response at machine pace Run exercises that force detection, triage, and containment to complete before the attack chain finishes.
What's in the full article
AuthMind's full article covers the operational detail this post intentionally leaves for the source:
- Benchmark-by-benchmark comparison of Mythos Preview against prior Claude models across Cybench, CyberGym, and Firefox shell exploitation
- The article's defence testing notes, including refusal-rate improvements and indirect prompt injection results in browser environments
- Project Glasswing context and the practical constraints around restricted rollout for vetted defensive partners
- The author's specific argument for why response speed, not just detection coverage, now determines survivability
👉 Read AuthMind’s analysis of AI exploit capability and identity-layer defence →
AI exploit capability and identity controls: are you ready?
Explore further