2026-06-10

Rich Sutton on AI creativity and discovery

Interpretations of the Core Argument

Many readers parse the argument as: creativity = variation + evaluation + selective retention, and claim that bare generative models mostly do variation.
Some think the critique only applies to pretraining-era LLMs, not to modern systems with reinforcement learning and tools.
Others say the talk is less “anti‑LLM” and more a call to embed evaluation and retention directly into AI workflows.

Novelty vs Quality (“Good and Novel”)

The claim that LLM outputs are either “novel or good, but never both” is widely challenged.
Commenters argue random exploration can still land on high‑quality ideas; the premise that randomness and goodness are mutually exclusive is disputed.
Several note that recombining learned “pieces” (styles, abstractions, procedures) can yield genuinely new, useful compositions.

Role of Evaluation, Feedback, and Harnesses

Strong agreement that closed‑loop systems (generation + test + refinement) underpin real discovery.
Coding agents, math systems with proof checkers, and systems using compilers or theorem provers are cited as examples where evaluation is “neurosymbolically closed” and AI makes novel, validated advances.
Many emphasize that the “agentic harness” around LLMs—tools, tests, self‑play, RL with verifiable rewards—is crucial, and the talk underplays this ecosystem.

Limits, Training Dynamics, and RLVR

Debate over whether reinforcement learning with verifiable rewards truly expands beyond the base model’s distribution or just “mode‑seeks.”
Some highlight continual-learning ideas (e.g., periodically reinitializing underused neurons) as a way to maintain variation and plasticity.
Others argue current planners and search procedures remain “dumb” and are the true bottleneck, not the models.

Human vs Machine Creativity and Evaluation

Several reject any special metaphysical barrier preventing AI from evaluating or discovering; humans also rely on tools and real‑world experiments.
Others stress that humans have built‑in goals, embodiment, and long evolutionary history shaping evaluation, which AI lacks.
Art and emotion: some say AI can create art if art is defined by evoking experiences in a viewer; others insist human intent and lived experience are essential.

Broader Reflections and Skepticism

Some appreciate the framing of creativity as variation–evaluation–retention but find the conclusions unsurprising or underspecified.
Concerns arise about over‑reliance on authority and dismissal of new empirical evidence from modern LLMs.
Several see today’s models as already highly useful “discovery assistants,” even if they never reach paradigm‑shifting scientific genius.

Related topics