2026-05-20

An OpenAI model has disproved a central conjecture in discrete geometry

Model capabilities and comparative performance

Many commenters see this as strong evidence that frontier LLMs now match or exceed typical PhD‑level performance on some narrow math tasks, especially when combined with good scaffolding.
Others report divergent day‑to‑day experiences: some find OpenAI better for research “getting things done,” Google’s Gemini better for pedagogy and web retrieval, Claude best for general interaction but weaker for deep research.
Several note recent similar math/physics successes (other Erdős problems, theoretical physics results, “deep research” agents), viewing this as part of a trend rather than an isolated miracle.

Methodology, scaffolding, and transparency

OpenAI says the proof came from a general‑purpose internal model, not a special math system; critics point out this does not exclude undisclosed scaffolding (parallel sampling, verifiers, Lean‑style tools).
There are repeated calls for details: prompts, number of attempts, total tokens/compute, and whether any specialized training data or auto‑generated theorem‑proving corpora were used.
Some suspect significant cherry‑picking and marketing spin, or even that human‑generated insights were fed into training; others consider that unlikely given independent mathematical validation.

Nature and significance of the mathematical result

The conjecture was disproved by showing configurations with > n¹⁺δ unit distances for infinitely many n, where humans had long believed only “essentially linear” growth was possible.
Several mathematicians (in the linked remarks PDF) view the techniques as drawing on known algebraic number theory tools applied in a new combination, not a radically new theory.
Commenters stress it’s a disproof via existence, not a constructive picture of the configuration; no visual example is provided, which many find frustrating.
Some argue finding a counterexample is algorithmically more “search‑like” and less conceptually deep than proving the conjecture true.

Erdős problems and benchmarking AI in math

Erdős problems are seen as a de facto benchmark: numerous, curated, spanning difficulty, and often easy to state but nontrivial to solve.
They are also attractive because many are important but not so central that decades of focused expert work have already exhausted all low‑hanging fruit.

Implications for research, work, and creativity

Working scientists and students report LLMs already “supercharge” literature review, explanation, and navigating citation graphs, while warning about missed subtleties and hallucinations.
Philosophical debate recurs: are LLMs merely “interpolating” existing knowledge, and is that fundamentally different from human mathematical discovery?
Some foresee AI doing Fields‑Medal‑level work before it can run a McDonald’s; others argue managing messy real‑world systems is a harder, different kind of intelligence.

Related topics