2026-06-02

Expanding Project Glasswing

Overall view of Project Glasswing / Mythos rollout

Many see the limited-access expansion as a classic scarcity/urgency marketing move targeting large enterprises and governments, cementing Anthropic as “essential infrastructure.”
Others argue the gated rollout looks like a genuine attempt at responsible deployment, given cyber misuse risks and time needed for remediation.
Some think the “too powerful to release” framing echoes past AI-safety marketing (e.g., prior GPT releases).

Capabilities vs hype

Supportive comments cite third‑party writeups (e.g., Mozilla, Cloudflare, wolfSSL, government benchmarks) claiming Mythos finds many real, high‑severity bugs and can chain vulnerabilities into working exploits.
Skeptical commenters emphasize:
- Dependence on elaborate “harnesses” and workflows rather than unique model magic.
- Irreproducible or cherry‑picked benchmarks.
- Other strong models (e.g., GPT‑5.5‑Cyber, open‑weights ensembles) being close in capability.
Some note that Mythos appears to mainly amplify skilled humans rather than autonomously outclassing top experts, contradicting some marketing language.

Practical experiences and false positives

Reports from organizations with access describe:
- Large volumes of findings, many minor, inapplicable, or false positives.
- Executives overreacting to every flagged issue, creating chaos and busywork.
- Value mainly when used in multi‑stage pipelines with deduping, PoC generation, and human triage.
Others report good results from regular Claude/Opus for security and performance auditing, but still with substantial noise.

Compute constraints and business incentives

One camp claims Anthropic is compute‑constrained and using safety as cover, especially given Mythos’ high token cost and looming IPO.
Counterpoints:
- Anthropic has recently added large new compute contracts.
- They could, in principle, ration access via higher prices.
- It’s unclear whether safety, capacity, pricing optics, or IPO signaling is the dominant reason for the slow rollout.

Broader security, memory safety, and OSS impacts

Discussion branches into:
- Social‑engineering risks and a future where strong authentication / FIDO keys become mandatory, potentially at the cost of human‑centric support.
- Using AI (and Rust rewrites) to improve memory safety; others worry such rewrites are unmaintainable, disrespect OSS communities, and could introduce new logic bugs.
- Concern that LLM‑driven scanning will flood teams with alerts, shift liability expectations, and turn “AI said it’s a vuln” into management pressure, without necessarily improving real‑world security.

Related topics