2026-05-16

Frontier AI has broken the open CTF format

Impact of LLMs on CTFs

Many agree frontier LLMs now solve a large fraction of Jeopardy-style CTF challenges quickly, turning open online events into “who has the most/best agents and tokens.”
This shifts CTFs from human-skill contests to AI-orchestration or spending contests, similar to earlier shifts toward large, tool-heavy “mega-teams.”
Some see this as strongly confirming the “bitter lesson”: general models beat narrow security tools.

Fairness, Cheating, and Rules

Historically, heavy tooling and automation were culturally accepted; “attacking the infra” vs “play as intended” already split the community.
Banning AI is seen as nearly unenforceable in remote CTFs; easy to hide AI usage compared to, say, engine cheating in chess.
Some events now have dual leaderboards (AI-assisted vs “human”), or explicitly forbid LLMs in onsite finals with modest prizes to reduce incentive to cheat.

Learning, Skill, and Education Parallels

A core loss identified: the “ladder” for beginners. If top of the board is AI-driven, novices are pushed to outsource instead of struggle and learn.
Organizers report AI users often can’t explain their solves (“no idea what it did, but here’s the flag”), undermining learning and shared writeups.
Strong parallels drawn to universities and programming education: students using AI for assignments, then failing later because fundamentals never formed.

Proposed Adaptations

Move to offline/in‑person CTFs: organizer-provided machines, network isolation, possibly Faraday cages; but this is logistically and financially hard.
Design AI-hostile challenges: temporal/real‑time, multimodal, game-engine embedded, counterfactual puzzles, physical/lockpicking tasks, or real‑world interaction loops.
Others warn that “just make it harder” or more obfuscated often degrades educational value into guesswork and further excludes newcomers.

Comparisons and Broader Reflections

Chess/Go analogies: those games banned engines and built strong anti-cheat; some argue CTFs should similarly have human-only, AI-assisted, and AI-only tracks.
Others argue CTFs were always artificial training games; if pentesting tasks are automatable by AI, the field itself is changing, not just the competitions.
Some are optimistic: AI as powerful tutor and productivity tool; others worry about an emerging class of practitioners who can “ship” with AI but can’t understand or debug what they deploy.

Meta: Article, Terminology, and UX

Multiple complaints about the article/site: hard-to-read styling, lack of defining “CTF,” ambiguous use of “frontier AI,” and a title that confused non‑insiders.
Several note acronyms and insider language make the discussion opaque to people outside the CTF/security subculture.

Related topics