Back to articles
May 25, 2026

An OpenAI model disproved an 80-year-old math conjecture — a new kind of milestone

OpenAI says one of its reasoning models overturned the planar unit-distance conjecture, open since Erdős posed it in 1946, finding an infinite family of constructions that beat the assumed answer — externally verified. AI just crossed from solving known problems to disproving believed ones.

a blackboard with a lot of writing on itPhoto: Thomas T / Unsplash

An OpenAI reasoning model has disproved a longstanding conjecture in discrete geometry — the planar unit-distance problem, open since Paul Erdős posed it in 1946. For nearly eight decades, mathematicians assumed the best arrangements of points looked like tidy square grids. OpenAI says its model found a new, infinite family of constructions that beat the grid, yielding a polynomial improvement, and that the result was checked by outside mathematicians. This is not an AI solving a homework problem. It's an AI overturning something the field believed.

What actually happened

The distinction that matters is disproof. Solving a known problem — even a hard one — is, at some level, search: find the path to an answer that exists. Disproving a longstanding conjecture means producing a genuinely new mathematical object that nobody had constructed, one that contradicts the prevailing intuition. The model didn't retrieve a proof; it generated counterexamples — an infinite family of them — that do measurably better than the structure everyone assumed was optimal.

Two guardrails keep this from being hype. First, it's narrow: one problem, in one subfield, not a general theorem-proving machine. Second, it was externally verified — human mathematicians confirmed the construction holds. In a domain where a confident wrong answer is worthless, that verification is the whole story.

Our read

This is the most concrete sign yet that frontier models are crossing from assistant to collaborator in research. The line we've held — AI is great at reproducing known reasoning, useless at original insight — just got a real counterexample, in the most unforgiving discipline there is. Math doesn't grade on vibes; a result is right or it isn't, and this one is right.

It rhymes with the shift we've traced across software: the human's job migrates from producing the work to specifying, steering, and verifying it. A model that can propose novel mathematical structures is only useful if someone can check them — which is exactly why the scarce skill becomes verification, the discipline we argued is the real job AI leaves behind. The Erdős result is a preview of that world: the machine generates the candidate; humans (and simulations, and proof checkers) decide whether it's true.

Here's the catch. One disproved conjecture is a milestone, not a revolution. Mathematics is enormous, and a single polynomial improvement in one corner of discrete geometry doesn't mean models are about to clear the field's open problems. The honest read is narrower and more interesting: we now have proof of capability, not proof of generality. The open questions are whether this approach reproduces across other problems, how much human scaffolding it actually required, and whether the model can do the harder thing — not just find a better construction, but explain why it works in a way that advances human understanding rather than just the leaderboard.

Still, the direction is set. For decades, "AI can't do real math" was a comfortable line for anyone who wanted to believe original thought was safely human. It's no longer true in the absolute. The frontier now isn't whether machines can contribute new mathematics — it's how fast, how broadly, and who's qualified to check the work. That last question is the one that will define the next few years of research, in math and well beyond it.


Reporting from OpenAI and TechCrunch.

Support our workIndependent, reader-funded tech journalism. If a piece helped you, chip in.Chip in →