Richard Sutton and Andrew Barto Win 2024 Turing Award
Overall reaction to the award
- Broad agreement it’s “well deserved,” especially for foundational work in reinforcement learning (RL).
- Several commenters highlight their RL textbook as a classic: free, influential, and beautifully written, though opinions split on how accessible it is.
- Some note this recognition is “a long time coming,” given how much modern RL and agents build on their work.
The Bitter Lesson and black-box AI
- Many link and revisit “The Bitter Lesson”: scaling compute and general methods outperform hand-crafted knowledge.
- Some lament the move from understandable symbolic systems to opaque “big black boxes,” expressing distrust for using them in high‑risk domains.
- Others argue this is the reality of progress and point to AI‑assisted theorem proving and formal verification as a way to get mathematical guarantees even if the model’s internals are opaque.
Formal verification, safety, and “provably safe” AI
- Practitioners describe real-world use of formal methods in embedded systems, money handling, healthcare, aviation, and hardware design.
- Discussion of “provably safe AGI” notes a core difficulty: formally specifying human values or well‑being.
- Some suggest splitting “logical” systems (for safety‑critical tasks) from “feeling” systems for other domains; others point out that even formal logic rests on unprovable axioms.
Deep learning vs human-understanding goals
- Commenters contrast two AI research goals: building systems that perform tasks vs understanding human cognition.
- Some argue the Bitter Lesson is great if the goal is performance, but less helpful for explaining human intelligence.
- Debate over whether Go/Chess systems are “brute force”: consensus that they use heavy search plus learned evaluation, with human-crafted heuristics now largely replaced by learned ones.
Ethics, “successionism,” and public views
- A substantial subthread debates one awardee’s publicly stated “successionist” view (AI eventually replacing humanity’s role).
- Critics see this as dangerous or disqualifying; defenders say the award is for technical work and characterize his view as utopian/inevitable rather than genocidal.
- Several argue that technical achievement and views on AI governance should be evaluated separately, but both are relevant to policy discussions.
RL, education, and future impact
- Multiple commenters expect RL to grow in importance, particularly due to test-time compute and flexibility.
- Experiences with the RL textbook vary: some find it highly accessible; others find it too formal and recommend more hands-on “grokking” style books first.
- There’s nostalgia from early CV and classical NLP practitioners about being overtaken by deep learning, and a meta-lesson about not resisting new approaches that clearly work.