2024-06-11

Terence Tao on proof checkers and AI programs

Role of AI in Mathematics

Many see AI as a promising assistant for formal proofs, lemma search, and tedious case work, analogous to neural nets in chess engines rather than full automation of mathematics.
Some expect AI will soon autocomplete or significantly assist in current research, though “math being solved in 2–3 years” is widely viewed as unrealistic.
Others stress that even ugly, AI-found proofs are valuable starting points that humans can later simplify and understand.

Proof Assistants and Lean

Proof assistants (Lean, Coq, Isabelle, HOL, Metamath, etc.) are central to the discussion; Lean is described as currently dominant in pure math, with a large library and strong community support.
Tools like Isabelle’s Sledgehammer and Coq’s analogs already combine ATP/SMT solvers with interactive proof, though resulting proofs can be unreadable or brittle.
Formalization is viewed as hard but increasingly feasible; some fields (logic, algebra) fit better than highly geometric or pictorial areas, though there are projects formalizing deep topology (e.g., sphere eversion).

Correctness, Complexity, and NP

Multiple comments stress the distinction between finding a proof and checking one.
Verification in typical proof systems is said to be linear in proof size: check axioms and each inference rule.
There’s debate over complexity-theoretic framing (NP, k-SAT), but consensus that checking is much easier than search.

LLMs, Reliability, and “Hallucinations”

Some worry that using LLMs to translate informal math to Lean could misrepresent the intended theorem; others reply that humans only need to verify the formal statement and definitions, since the checker guarantees the proof.
A counterpoint notes that once AI also generates definitions, verifying they match intended concepts becomes nontrivial.
Concrete anecdotes describe GPT-4 producing correct, useful lemmas and slick derivations, but also examples where it outputs superficially plausible yet incoherent or subtly wrong arguments.

Adoption and Career Dynamics

One camp sees proof assistants as still niche and too costly for early-career researchers under publish-or-perish pressure; another reports active use, community growth, and even grants specifically for formalization.
There’s discussion of emerging roles like “mathematical programmers” who formalize others’ proofs, reflecting a division of labor.

Broader Reflections

Some compare the coming “project-managed” math ecosystem to the industrialization of software: less romantic individual genius, more coordinated large-scale projects.
Opinions diverge on whether AI and mechanization herald a revolution in what math can tackle or a threat to the traditional culture of mathematics.

Related topics