






















POLAND - 2023/01/19: In this photo illustration an Open AI logo is displayed on a smartphone with stock market percentages on the background. (Photo Illustration by Omar Marques/SOPA Images/LightRocket via Getty Images)
SOPA Images/LightRocket via Getty Images
OpenAI announced this week that one of its general-purpose reasoning models made a breakthrough in a complex field of mathematics, and this has grabbed the attention of elite mathematicians. Specifically, OpenAI said in an announcement published on its website that its models disproved a central conjecture in an area of discrete geometry tied to Paul Erdős’s planar unit distance problem, a famously stubborn question first posed in 1946. The company said its model found an infinite family of point arrangements that beat the long-favored square-grid intuition. Outside mathematicians checked the results and with one noted researcher calling the result, “a milestone in AI mathematics.”
The problem begins with a question Erdős posed in 1946: place n points on a plane, then count how many pairs sit exactly one unit apart. For nearly 80 years, mathematicians suspected that the best answers would look roughly like square grids. OpenAI now says one of its internal general-purpose reasoning models has disproved that long-held conjecture by producing an infinite family of examples that have a substantial improvement over the grid-based constructions.
After external mathematicians checked the proof, according to OpenAI, they validated the approach. The Guardian reported that mathematicians described the result as significant, but stressed that the larger entire planar unit distance problem remains open.
While it’s no surprise that computers can indeed compute, the bigger insight is that increasingly powerful frontier models may now produce strange, useful solutions to even well-tested problems that experts can test, repair and turn into greater discoveries. In this case, for decades, the planar unit distance problem rewarded a certain kind of mathematical instinct. Points need to be one unit apart, so intuitively grids seem right.
OpenAI’s model didn’t have any of those instincts or pre-conceived notions, and so simply looked at different areas of mathematics for combinations humans didn’t think to test. The model found a construction that broke a central belief in discrete geometry, which humans then refined, simplified and explained the proof. The companion paper, written by Noga Alon, Thomas F. Bloom, W. T. Gowers, Daniel Litt, Will Sawin, Arul Shankar, Jacob Tsimerman, Victor Wang and Melanie Matchett Wood, calls it a “short, digested, human-verified version” of an OpenAI-generated counterexample.
MORE FOR YOU
Specifically, the model did not publish mathematics by itself. It produced a strange object worth the time of elite mathematicians. They then did what their field demands. They checked it, made it more concise in the language of mathematicians, connected it to prior theory and asked what it really means.
AI labs have spent years turning math into a scoreboard as a way of showing how sophisticated and capable their models are. Models compete on olympiad problems, coding tests and formal proof benchmarks. Those contests matter, but the problems are curated. The answers are known and the target is fixed.
Google DeepMind and OpenAI had already shown that frontier models can perform at high levels on olympiad-style reasoning. Reuters reported in July 2025 that models from both companies reached gold-medal performance at the International Mathematical Olympiad, with the systems solving five of six problems. An olympiad medal says a model can solve hard problems under contest rules.
In this case, OpenAI’s same general model approach was used, but in an area where the answers weren’t known and the solution target was open to reconsideration. OpenAI’s announcement says the model was not built only for this problem, not scaffolded for a special proof search and not trained as a single-purpose mathematics engine. OpenAI saw the problem as a way of seeing whether advanced models can contribute to frontier research.
The most important question behind the announcement goes beyond the proof. Did OpenAI’s model find a genuinely new solution, or did it pull together existing research and papers whose implications humans had not realized they could put together?
The honest answer appears to be both. The companion paper says the argument relies on ideas that, “at least in retrospect,” trace back to work by Ellenberg-Venkatesh, Golod-Shafarevich and Hajir-Maire-Ramakrishna. That means the proof did not arrive from nowhere. Its ingredients were already in mathematics. Notably, that makes the result more believable, not less.
Rather than coming up with an approach out of the blue, AI served more as a master curator, assembling work that people had already put together in bits and pieces. The history of mathematics is full of that sort of approach. A technique from one field of mathematics gets uniquely applied in another, and turns a famous obstacle into a solution.
In the case of the Erdős problem, the old grid-based intuition was geometric. OpenAI says its model found examples that beat the grid-like constructions mathematicians had treated as basically optimal. The Times described the result as drawing from unexpected areas, including algebraic number theory.
Mathematicians who by personality are often the first to be skeptical of big claims are supporting this recent announcement. Tim Gowers, a Fields Medalist, called the result a milestone in AI mathematics, and positioned it as one of the first clear cases of AI independently solving a famous open mathematical problem.
Scientific American reported that mathematician Daniel Litt called it the most “unique interesting result produced autonomously by AI so far.” The same article described the result as the first AI proof of a kind that would likely be publishable in a top math journal if humans had produced it alone. Gil Kalai, a prominent combinatorialist, wrote that the disproof was “amazing” and in an online post pointed readers to the technical paper and framed the event as noteworthy inside combinatorics, not merely inside the AI industry.
Still, the expert reaction is not a coronation. Gowers and others have drawn a boundary between finding a counterexample and developing a broad theory. A counterexample can come from search, stubbornness and a lucky collision of known tools. A sweeping upper-bound proof may require a different level of conceptual control.
The result points toward a new division of labor. Rather than treat AI as some alien technology that is more threatening than useful, mathematicians are now thinking that AI models may be a useful companion to propose unusual-looking constructions. They can then decide whether those constructions are real, whether they matter and whether they reveal a principle. This is similar to the role that AI is playing in pharmaceutical development and medical research, where AI is used to provide options and possibilities that humans put to the test.
AI systems will generate many dead ends, rediscoveries and ugly half-proofs. Human experts will need to sort signals from the noise. In that world, AI acts more as a collaborator than a solitary genius. This approach will allow the best mathematicians to spend less time producing every candidate idea and more time judging which idea candidate deserves greater scrutiny.
With this announcement, OpenAI is arguing that AI can do more than just write, code, summarize and tutor. It sees its role as a genuinely helpful collaborator on some of the biggest problems facing mankind. That market is the one every frontier lab wants. Math is a good test case because new ideas and proofs can be checked. In biology, chemistry or materials science, a model’s idea may need months of real world testing and lab work that could take years to validate. In mathematics, a proposed proof can be attacked line by line. That makes the field a critical proving ground for claims about reasoning.
This is why the originality question matters so much. If the model merely surfaces hidden, but existing answers, then the limits to what AI can provide will become clear. If AI models assemble even partially known ideas into something more substantial, then they can be more useful. Coming up with completely new ideas that have never been written or considered before might be the extreme of possibilities, at least for now.
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。