Rich Sutton on AI creativity and discovery
Interpretations of the Core Argument
- Many readers parse the argument as: creativity = variation + evaluation + selective retention, and claim that bare generative models mostly do variation.
- Some think the critique only applies to pretraining-era LLMs, not to modern systems with reinforcement learning and tools.
- Others say the talk is less “anti‑LLM” and more a call to embed evaluation and retention directly into AI workflows.
Novelty vs Quality (“Good and Novel”)
- The claim that LLM outputs are either “novel or good, but never both” is widely challenged.
- Commenters argue random exploration can still land on high‑quality ideas; the premise that randomness and goodness are mutually exclusive is disputed.
- Several note that recombining learned “pieces” (styles, abstractions, procedures) can yield genuinely new, useful compositions.
Role of Evaluation, Feedback, and Harnesses
- Strong agreement that closed‑loop systems (generation + test + refinement) underpin real discovery.
- Coding agents, math systems with proof checkers, and systems using compilers or theorem provers are cited as examples where evaluation is “neurosymbolically closed” and AI makes novel, validated advances.
- Many emphasize that the “agentic harness” around LLMs—tools, tests, self‑play, RL with verifiable rewards—is crucial, and the talk underplays this ecosystem.
Limits, Training Dynamics, and RLVR
- Debate over whether reinforcement learning with verifiable rewards truly expands beyond the base model’s distribution or just “mode‑seeks.”
- Some highlight continual-learning ideas (e.g., periodically reinitializing underused neurons) as a way to maintain variation and plasticity.
- Others argue current planners and search procedures remain “dumb” and are the true bottleneck, not the models.
Human vs Machine Creativity and Evaluation
- Several reject any special metaphysical barrier preventing AI from evaluating or discovering; humans also rely on tools and real‑world experiments.
- Others stress that humans have built‑in goals, embodiment, and long evolutionary history shaping evaluation, which AI lacks.
- Art and emotion: some say AI can create art if art is defined by evoking experiences in a viewer; others insist human intent and lived experience are essential.
Broader Reflections and Skepticism
- Some appreciate the framing of creativity as variation–evaluation–retention but find the conclusions unsurprising or underspecified.
- Concerns arise about over‑reliance on authority and dismissal of new empirical evidence from modern LLMs.
- Several see today’s models as already highly useful “discovery assistants,” even if they never reach paradigm‑shifting scientific genius.