Anthropic's open-source framework for AI-powered vulnerability discovery
Project status and purpose
- The repo is explicitly “not maintained” and positioned more as a reference harness than a supported tool.
- Several commenters see it mainly as marketing for a commercial “Claude Security” / managed scanning offering.
- Others frame it as a pattern library: useful to study, then recreate or adapt internally rather than adopt directly.
Costs and scalability
- Token use per agent is high; full-codebase scans, especially with top-tier models (Opus/Mythos), are perceived as expensive.
- Some argue you’d only run full scans periodically and diffs in CI, but others note many orgs ship every sprint, making recurring cost large.
- Cost comparisons vary: some say a dedicated security hire might be cheaper; others cite reports claiming AI-based scanning can match many engineers, making it “pennies on the dollar” versus traditional audits.
- External calculators are referenced showing multi-million–dollar annual token spends for 100+ dev teams if used heavily.
Effectiveness, false positives, and workflow
- Without a well-designed harness, people report poor results and many false positives (“vibe auditing”).
- Even with a harness, findings still need expert review and triage, otherwise developers drown in noise, similar to today’s linters and SAST.
Impact on attackers, defenders, and existing tools
- Some see this as a potential existential threat or eventual feature for traditional SAST vendors.
- Others note attackers can use the same models, turning vulnerability discovery into a “proof-of-work” arms race, but emphasize the bugs already existed.
- Concerns include a flood of high-severity reports overwhelming maintainers and bug-bounty programs.
AI-generated code and security lifecycle
- Many note it now takes far more tokens to secure code than to generate it.
- There’s skepticism about AI vendors effectively charging first to generate “sloppy” code and then to scan/fix it.
- Some argue models should be trained to emit secure code, but others reply that serious bugs often span large, dependency-heavy codebases beyond what can be fully reasoned about on each edit.
Business model and ecosystem debates
- Debate over why vendors sell raw tokens vs vertical SaaS:
- One side claims if tokens were truly magical, vendors would hoard them and dominate industries directly.
- Others counter that selling infrastructure (like fabs or tractors) can still be optimal, and building end-user SaaS is a different, distraction-heavy business.
- Some see security harnesses as part of a broader shift: AI companies turning domain-specific harnesses (design, security, etc.) into packaged products.
Open source harnesses and “shop jig” tools
- Multiple alternative or similar tools are mentioned; some trip antivirus and are mainly for practitioners comfortable with that.
- A recurring analogy likens these frameworks to “shop jigs”: custom tooling tuned to a team or individual’s workflow, often better built in-house than used off-the-shelf.
- Commenters discuss making such harnesses portable across jobs, or shared within organizations to raise the team-wide productivity floor.
- There’s a broader theme that AI makes bespoke tooling so cheap that generalized libraries/harnesses are increasingly used as inspiration rather than as direct dependencies.
Trust, naming, and practical concerns
- Confusion around the GitHub account name (“Anthropics” vs “Anthropic”) recurs.
- Some dismiss the project as “open-source glue to an LLM blob” or criticize that it’s unmaintained and closed to contributions.
- A few express distrust of sending source code to remote LLMs or being gated by specific browsers/search engines.