An OpenAI model has disproved a central conjecture in discrete geometry
Model capabilities and comparative performance
- Many commenters see this as strong evidence that frontier LLMs now match or exceed typical PhD‑level performance on some narrow math tasks, especially when combined with good scaffolding.
- Others report divergent day‑to‑day experiences: some find OpenAI better for research “getting things done,” Google’s Gemini better for pedagogy and web retrieval, Claude best for general interaction but weaker for deep research.
- Several note recent similar math/physics successes (other Erdős problems, theoretical physics results, “deep research” agents), viewing this as part of a trend rather than an isolated miracle.
Methodology, scaffolding, and transparency
- OpenAI says the proof came from a general‑purpose internal model, not a special math system; critics point out this does not exclude undisclosed scaffolding (parallel sampling, verifiers, Lean‑style tools).
- There are repeated calls for details: prompts, number of attempts, total tokens/compute, and whether any specialized training data or auto‑generated theorem‑proving corpora were used.
- Some suspect significant cherry‑picking and marketing spin, or even that human‑generated insights were fed into training; others consider that unlikely given independent mathematical validation.
Nature and significance of the mathematical result
- The conjecture was disproved by showing configurations with > n¹⁺δ unit distances for infinitely many n, where humans had long believed only “essentially linear” growth was possible.
- Several mathematicians (in the linked remarks PDF) view the techniques as drawing on known algebraic number theory tools applied in a new combination, not a radically new theory.
- Commenters stress it’s a disproof via existence, not a constructive picture of the configuration; no visual example is provided, which many find frustrating.
- Some argue finding a counterexample is algorithmically more “search‑like” and less conceptually deep than proving the conjecture true.
Erdős problems and benchmarking AI in math
- Erdős problems are seen as a de facto benchmark: numerous, curated, spanning difficulty, and often easy to state but nontrivial to solve.
- They are also attractive because many are important but not so central that decades of focused expert work have already exhausted all low‑hanging fruit.
Implications for research, work, and creativity
- Working scientists and students report LLMs already “supercharge” literature review, explanation, and navigating citation graphs, while warning about missed subtleties and hallucinations.
- Philosophical debate recurs: are LLMs merely “interpolating” existing knowledge, and is that fundamentally different from human mathematical discovery?
- Some foresee AI doing Fields‑Medal‑level work before it can run a McDonald’s; others argue managing messy real‑world systems is a harder, different kind of intelligence.