2025-07-30

The Math Is Haunted

Lean tactics, readability, and UX

Several commenters find Lean tactics (e.g. rfl, rw) overloaded, opaque, and dependent on distant context, making proofs feel “fuzzy” compared to low-level languages.
Others note you can inspect tactic definitions, use pattern-matching / term-style proofs instead of tactics, and rely on indentation conventions to show proof structure, but acknowledge a learning cliff.
There’s active work on a more explicit rw interface (position parameters, GUI widgets) and better error messages, plus external visualization tools like Paperproof and Lean→LaTeX exports.
Some worry that reading proofs without interactive tooling is hard; others say they scan for structure and lemmas rather than line-by-line states.

Comparisons: Lean, Coq/Rocq, Agda, Metamath, Mizar

Agda is preferred by some for direct proof terms and strong dependent pattern matching; Lean is seen as a middle ground, Coq/Rocq as more annotation-heavy.
Metamath and Metamath Zero are praised for minimal cores but require memorizing many lemmas; Lean’s tactic layer trades simplicity for usability.
Mizar is mentioned as an older system with a large library; Lean’s mathlib is perceived as having more momentum currently.

Kernel, tactics, “sorry”, and verification

Tactics are user-level; only the small kernel must be trusted. This can cause proofs to break across versions when tactic behavior changes (e.g. simp), so people pin down theorem lists (simp only [...]).
Lean produces binary “olean” objects containing fully elaborated terms that can be independently rechecked. Commands like #print axioms and extra tools (e.g. SafeVerify) help ensure no hidden axioms or sorry are used.

Formal statement vs intended meaning

Multiple comments stress that proof assistants only guarantee “if this formal statement is what you meant, the proof is correct”; they don’t ensure the statement matches human intent.
This is likened to the spec/implementation gap in software; there’s skepticism that any purely technical solution exists, though LLMs and computational semantics are floated as partial aids.

Applications beyond pure math

People speculate about Lean‑like systems for fact-checking news, mapping argument trees, or Bayesian reasoning; others respond that natural language ambiguity, rhetoric, and evidential reasoning differ fundamentally from mathematical proof.
Various argument‑mapping and debate tools (Kialo, Argdown, Claimify, older systems) are cited as lighter-weight attempts in this direction.

Foundations, axioms, and creativity

Discussion touches on Euclid, ZFC, controversial axioms (e.g. choice), and the fact that axioms need not be “cosmically true,” just assumed within a theory.
Some fear formal systems might constrain mathematical creativity; others counter that Lean lets you add arbitrary axioms and explore consequences, including intentionally “haunted” inconsistent ones.
There’s brief mention of localized inconsistency and paraconsistent logics as a research topic.

Gamification and learning

Several proof‑puzzle games and teaching resources (Natural Number Game, other proof games) are shared; some enjoy the “logic game” feel, others find early Lean proofs opaque or mis-termed (e.g. “goals”).
Opinions differ on whether Lean is currently good for learning advanced math; some say pencil-and-paper is still better, though formalization projects (e.g. analysis companions) are used for study.

AI and the future of formal methods

Commenters see strong synergy between LLMs and proof assistants, but note current LLM performance in Lean/Coq is mediocre and their scripts are painful to debug.
Research efforts (e.g. DeepSeek-Prover, SMT integration, tools from major labs, ongoing formalization of Fermat’s Last Theorem) are cited as promising, with a sense that AI may significantly reshape the landscape.
For now, Lean and Coq/Rocq are viewed as the safest bets in terms of community, libraries, and real-world use; Idris and Agda are seen as smaller, more niche.

Related topics