Expanding Project Glasswing

Overall view of Project Glasswing / Mythos rollout

  • Many see the limited-access expansion as a classic scarcity/urgency marketing move targeting large enterprises and governments, cementing Anthropic as “essential infrastructure.”
  • Others argue the gated rollout looks like a genuine attempt at responsible deployment, given cyber misuse risks and time needed for remediation.
  • Some think the “too powerful to release” framing echoes past AI-safety marketing (e.g., prior GPT releases).

Capabilities vs hype

  • Supportive comments cite third‑party writeups (e.g., Mozilla, Cloudflare, wolfSSL, government benchmarks) claiming Mythos finds many real, high‑severity bugs and can chain vulnerabilities into working exploits.
  • Skeptical commenters emphasize:
    • Dependence on elaborate “harnesses” and workflows rather than unique model magic.
    • Irreproducible or cherry‑picked benchmarks.
    • Other strong models (e.g., GPT‑5.5‑Cyber, open‑weights ensembles) being close in capability.
  • Some note that Mythos appears to mainly amplify skilled humans rather than autonomously outclassing top experts, contradicting some marketing language.

Practical experiences and false positives

  • Reports from organizations with access describe:
    • Large volumes of findings, many minor, inapplicable, or false positives.
    • Executives overreacting to every flagged issue, creating chaos and busywork.
    • Value mainly when used in multi‑stage pipelines with deduping, PoC generation, and human triage.
  • Others report good results from regular Claude/Opus for security and performance auditing, but still with substantial noise.

Compute constraints and business incentives

  • One camp claims Anthropic is compute‑constrained and using safety as cover, especially given Mythos’ high token cost and looming IPO.
  • Counterpoints:
    • Anthropic has recently added large new compute contracts.
    • They could, in principle, ration access via higher prices.
    • It’s unclear whether safety, capacity, pricing optics, or IPO signaling is the dominant reason for the slow rollout.

Broader security, memory safety, and OSS impacts

  • Discussion branches into:
    • Social‑engineering risks and a future where strong authentication / FIDO keys become mandatory, potentially at the cost of human‑centric support.
    • Using AI (and Rust rewrites) to improve memory safety; others worry such rewrites are unmaintainable, disrespect OSS communities, and could introduce new logic bugs.
    • Concern that LLM‑driven scanning will flood teams with alerts, shift liability expectations, and turn “AI said it’s a vuln” into management pressure, without necessarily improving real‑world security.