2025-03-05

Richard Sutton and Andrew Barto Win 2024 Turing Award

Overall reaction to the award

Broad agreement it’s “well deserved,” especially for foundational work in reinforcement learning (RL).
Several commenters highlight their RL textbook as a classic: free, influential, and beautifully written, though opinions split on how accessible it is.
Some note this recognition is “a long time coming,” given how much modern RL and agents build on their work.

The Bitter Lesson and black-box AI

Many link and revisit “The Bitter Lesson”: scaling compute and general methods outperform hand-crafted knowledge.
Some lament the move from understandable symbolic systems to opaque “big black boxes,” expressing distrust for using them in high‑risk domains.
Others argue this is the reality of progress and point to AI‑assisted theorem proving and formal verification as a way to get mathematical guarantees even if the model’s internals are opaque.

Formal verification, safety, and “provably safe” AI

Practitioners describe real-world use of formal methods in embedded systems, money handling, healthcare, aviation, and hardware design.
Discussion of “provably safe AGI” notes a core difficulty: formally specifying human values or well‑being.
Some suggest splitting “logical” systems (for safety‑critical tasks) from “feeling” systems for other domains; others point out that even formal logic rests on unprovable axioms.

Deep learning vs human-understanding goals

Commenters contrast two AI research goals: building systems that perform tasks vs understanding human cognition.
Some argue the Bitter Lesson is great if the goal is performance, but less helpful for explaining human intelligence.
Debate over whether Go/Chess systems are “brute force”: consensus that they use heavy search plus learned evaluation, with human-crafted heuristics now largely replaced by learned ones.

Ethics, “successionism,” and public views

A substantial subthread debates one awardee’s publicly stated “successionist” view (AI eventually replacing humanity’s role).
Critics see this as dangerous or disqualifying; defenders say the award is for technical work and characterize his view as utopian/inevitable rather than genocidal.
Several argue that technical achievement and views on AI governance should be evaluated separately, but both are relevant to policy discussions.

RL, education, and future impact

Multiple commenters expect RL to grow in importance, particularly due to test-time compute and flexibility.
Experiences with the RL textbook vary: some find it highly accessible; others find it too formal and recommend more hands-on “grokking” style books first.
There’s nostalgia from early CV and classical NLP practitioners about being overtaken by deep learning, and a meta-lesson about not resisting new approaches that clearly work.

Related topics