2026-06-04

Anthropic's open-source framework for AI-powered vulnerability discovery

Project status and purpose

The repo is explicitly “not maintained” and positioned more as a reference harness than a supported tool.
Several commenters see it mainly as marketing for a commercial “Claude Security” / managed scanning offering.
Others frame it as a pattern library: useful to study, then recreate or adapt internally rather than adopt directly.

Costs and scalability

Token use per agent is high; full-codebase scans, especially with top-tier models (Opus/Mythos), are perceived as expensive.
Some argue you’d only run full scans periodically and diffs in CI, but others note many orgs ship every sprint, making recurring cost large.
Cost comparisons vary: some say a dedicated security hire might be cheaper; others cite reports claiming AI-based scanning can match many engineers, making it “pennies on the dollar” versus traditional audits.
External calculators are referenced showing multi-million–dollar annual token spends for 100+ dev teams if used heavily.

Effectiveness, false positives, and workflow

Without a well-designed harness, people report poor results and many false positives (“vibe auditing”).
Even with a harness, findings still need expert review and triage, otherwise developers drown in noise, similar to today’s linters and SAST.

Impact on attackers, defenders, and existing tools

Some see this as a potential existential threat or eventual feature for traditional SAST vendors.
Others note attackers can use the same models, turning vulnerability discovery into a “proof-of-work” arms race, but emphasize the bugs already existed.
Concerns include a flood of high-severity reports overwhelming maintainers and bug-bounty programs.

AI-generated code and security lifecycle

Many note it now takes far more tokens to secure code than to generate it.
There’s skepticism about AI vendors effectively charging first to generate “sloppy” code and then to scan/fix it.
Some argue models should be trained to emit secure code, but others reply that serious bugs often span large, dependency-heavy codebases beyond what can be fully reasoned about on each edit.

Business model and ecosystem debates

Debate over why vendors sell raw tokens vs vertical SaaS:
- One side claims if tokens were truly magical, vendors would hoard them and dominate industries directly.
- Others counter that selling infrastructure (like fabs or tractors) can still be optimal, and building end-user SaaS is a different, distraction-heavy business.
Some see security harnesses as part of a broader shift: AI companies turning domain-specific harnesses (design, security, etc.) into packaged products.

Open source harnesses and “shop jig” tools

Multiple alternative or similar tools are mentioned; some trip antivirus and are mainly for practitioners comfortable with that.
A recurring analogy likens these frameworks to “shop jigs”: custom tooling tuned to a team or individual’s workflow, often better built in-house than used off-the-shelf.
Commenters discuss making such harnesses portable across jobs, or shared within organizations to raise the team-wide productivity floor.
There’s a broader theme that AI makes bespoke tooling so cheap that generalized libraries/harnesses are increasingly used as inspiration rather than as direct dependencies.

Trust, naming, and practical concerns

Confusion around the GitHub account name (“Anthropics” vs “Anthropic”) recurs.
Some dismiss the project as “open-source glue to an LLM blob” or criticize that it’s unmaintained and closed to contributions.
A few express distrust of sending source code to remote LLMs or being gated by specific browsers/search engines.

Related topics