惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

C
Cybersecurity and Infrastructure Security Agency CISA
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
D
Darknet – Hacking Tools, Hacker News & Cyber Security
Know Your Adversary
Know Your Adversary
Malwarebytes
Malwarebytes
K
Kaspersky official blog
The Register - Security
The Register - Security
N
News and Events Feed by Topic
H
Hacker News: Front Page
T
The Exploit Database - CXSecurity.com
T
Tor Project blog
S
Secure Thoughts
Stack Overflow Blog
Stack Overflow Blog
Stack Overflow Blog
Stack Overflow Blog
Recent Announcements
Recent Announcements
Vercel News
Vercel News
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 热门话题
T
ThreatConnect
量子位
Apple Machine Learning Research
Apple Machine Learning Research
Application and Cybersecurity Blog
Application and Cybersecurity Blog
S
Security Archives - TechRepublic
Recent Commits to openclaw:main
Recent Commits to openclaw:main
雷峰网
雷峰网
F
Fortinet All Blogs
Y
Y Combinator Blog
Last Week in AI
Last Week in AI
月光博客
月光博客
P
Proofpoint News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
AWS News Blog
AWS News Blog
T
Tailwind CSS Blog
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
罗磊的独立博客
P
Privacy & Cybersecurity Law Blog
U
Unit 42
L
LINUX DO - 最新话题
M
MIT News - Artificial intelligence
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Cyberwarzone
Cyberwarzone
V
Vulnerabilities – Threatpost
F
Fox-IT International blog
MongoDB | Blog
MongoDB | Blog
Google Online Security Blog
Google Online Security Blog
博客园 - 司徒正美
C
CXSECURITY Database RSS Feed - CXSecurity.com
Engineering at Meta
Engineering at Meta
C
Check Point Blog
李成银的技术随笔

Forbes - Innovation

SpaceX Faces A Crucial Launch Test Ahead Of Its IPO What To Watch This Weekend: New Shows And Movies To Stream On Netflix, Hulu, Prime Video, Apple TV And More ‘Zero Parades’ Review: The Opera Never Tires 10 Best Star Wars Games To Play In 2026 The Rise Of The Multimodal LLM Apple Loop: iPhone 18 Pro Upgrades, Fortnite Returns To The App Store, iPhone Fold Delays Android Headlines: Galaxy Z Fold 8 Feature, RedMagic 11S Pro Debuts, Anker’s AI Earbuds Immigration Service May Significantly Restrict Green Cards In The U.S. Starbucks Drops AI As Meta And Intuit Cut 11,000 Jobs Why Hybrid AI Is No Longer Optional In Banking And Finance Did AI Really Beat ER Doctors At Diagnosis? Here’s What The Study Showed Did AI Really Beat ER Doctors At Diagnosis? No, Here’s What Study Really Showed Quordle Hints Today: Saturday, May 23 Clues And Answers Samsung Unveils Immersive Moomin Takeover And Pop-Up Experience At London King's Cross Manziel Vs. Menery Full Card, Ring Walk Times For Brand Risk 14 The AI Video Race Is Moving Beyond Pretty Clips Google Signals AI Video’s Shift From Clip Generation To Production Crafoord Prize Winner Ramanathan: Climate Action Enters Its “How” Phase Marriage Benefits Men's Life Expectancy More Than Women's 3 Steps Not To Ignore In Nature Plans The Post-‘The Boys’ Finale ‘Vought Rising’ Trailer Is Here, And Quite Good 2026 America Innovates | Responsible For All Our Digital Maps, Jack Dangermond Loves The Word 'Where' 2026 America Innovates | Fracking Pioneer Harold Hamm Calls Oil And Gas The Most Reliable Energy For AI Why Tom Hardy Was Reportedly Just Fired From ‘Mobland’ Season 3 How Small Studios Outrun Bigger Teams Sony Launches Reon Pocket Pro Plus Wearable Air Conditioner In Time For Summer Heatwaves Industry 5.0 Is Changing The Meaning Of Automation Garmin Watches, Coros And More Now Pair Better With Strava NYT Connections Hints Today: Saturday, May 23 Groups And Answers (#1077) The Architectural Difference Between Legal Productivity AI And EDiscovery AI ‘The Mandalorian And Grogu’ Sets A Rotten Tomatoes Audience Score Record How AI Tools Are Redefining The Role Of Technical Founders Apple Spotlights Student Entrepreneurs In Great Ideas Start Here Campaign The Growing Cybersecurity Risks To The Supply Chain In The AI Era Your Website Is Decaying Consumer Intent Faster Than You Think With ‘Destiny 2’ Gone, No ‘Destiny 3’ Is Coming ​How Operational Access Can Ensure Readiness For The Next Storm Why Russians Are In Despair Over Truck-Busting ’Martian’ Drones New ‘Crimson Desert’ Patch Adds Another Long-Time Player Request The Architecture Behind Cost-Effective AI Agents How To Think About High-Stakes Dispute Resolution Why Do Our Fingers Get Wrinkly In Water? An Evolutionary Biologist Explains You Can Build A CRM In A Day. You Still Can't Run A Company In One. 6 Teachable Moments From An Atlanta Rush Hour Downpour Why Your AI-Generated Marketing Content Sounds Generic ​The Accountability Crisis In The Creator Economy Scaling Across Borders: What It Takes To Succeed Globally Apple Rolls Out Two Crucial Health Features For Apple Watch And AirPods In India Competitive Advantage In Logistics Isn't AI ​Why AI Can Write Code, But It Can't Teach Engineers Critical Thinking The Importance Of Red Teaming For Scaling Enterprise AI Agents Why The Next AI Moat Won’t Be Productivity, But Emotional Value Banking’s AI Problem Isn’t The Model. It’s The Plumbing The Case For Structural Reform Through Tokenization SpaceX Scrubs Starship Launch As $2 Trillion IPO Nears LEGO F1 Ferrari Helmet Review (43014): Rough Build, Spectacular Finish Oleksandr Usyk Vs. Rico Verhoeven: Date, Time And How To Watch If Majoring In Computer Science Is Doomed Due To AI, The Latest Claim Is That Majoring In Philosophy Is The Next Best Choice MVP's Nakisa Bidarian On Rousey-Carano Viewership, Shields' Ban And PFL Co-Promotion See A ‘Planet Parade’ As Three Worlds Shine After Sunset This Weekend Soundcore’s Liberty 5 Are First Earbuds To Use Anker’s Thus AI Chip Code Ninjas: The AI-In-Education Problem Isn’t Cheating. It’s Passivity. Today’s Wordle #1798 Hints And Answer For Friday, May 22 NYT ‘Pips’ Hints, Answers And Walkthrough For Friday, May 22 Apple Teases iOS 27 AI Upgrades With Major Accessibility Overhaul To iPhone Samsung Releases Free One UI 8.5 Upgrade To Millions Of Galaxy Phones How Instagram Became A Venture Capital Deal Engine ‘Star Wars: The Mandalorian And Grogu’: Which Movie Is Best? New Study: A Quarter Of College Students Using AI Daily Cheat With It NYT Connections Hints Today: Friday, May 22 Clues And Answers (#1,076) NYT Connections Answers Explained Friday May 22 NYT Strands Hint Today: Friday, May 22 Clues And Answers (Put Down Your Ruler) Quordle Hints Today: Friday, May 22 Clues And Answers Webb Telescope Detects Cloudy Mornings And Clear Nights On Alien World AI Flattening Organizations Is The Latest Chapter In A Continuing Story AI Was Supposed To Reduce Your Workload. Here’s Why It Hasn’t, And Here’s How It Can. DevOps Practices Tech Teams Must Strengthen In The AI Era The End Of ‘Destiny 2’: All Expansions Canceled, Maintenance Mode Incoming ‘The Mandalorian And Grogu’ Recap Before You See The Movie, Post-Credits Scene And More Fidelity Collective Buys Up Westone Audio And Etymotic Brands Why AI Profitability Belongs To Enterprise, Not Consumer Scale OpenAI And Anthropic Are Testing Two Very Different AI Business Models Kordata Launches To Advance Neurotech-Powered Clinical Trials Solving The Identity Crisis: Putting Today’s Fragmented Consumer Back Together These Are The Most- And Least-Expensive New Cars To Run At Today’s Fuel Prices New Reports And New Paradigms Show Drive In AI Smart Glasses Market Samsung Galaxy Z Fold 8: Price Rise, Bad Crease News Anthropic And Microsoft Team Up Why Nvidia Needs More Than GPUs To Win The AI Infrastructure Race Nvidia Is Expanding Infra Partnerships. Will A Big Deal Happen? Drug Overdose Deaths Fell in 2024. Why Experts Remain Cautious Microsoft Is Scrapping SMS 2FA Codes—What You Need To Do ‘Wax Heads’ Review: Somehow The Vital Connection Is Made Securing The Internet’s Humanity Netflix’s Best New Show Lands A Perfect Rotten Tomatoes Score As A Final Duffer Bros. Effort AI Might Not Bring On A Job Crisis, But A Workforce ‘Mismatch’ Could Why Post-Quantum Compliance For Banks Starts In Containers Do Your AI Agents Have Governance? Most Don’t, And They’re Live Why Complexity Is The Insider Threat Hiding In Plain Sight ‘Supergirl’ Is Starting To Feel Like It May Be A Big DCU Miss
The AI Breakthrough That Has Mathematicians Paying Attention
Ron Schmelze · 2026-05-23 · via Forbes - Innovation
In this photo illustration an Open AI logo is displayed on a

POLAND - 2023/01/19: In this photo illustration an Open AI logo is displayed on a smartphone with stock market percentages on the background. (Photo Illustration by Omar Marques/SOPA Images/LightRocket via Getty Images)

SOPA Images/LightRocket via Getty Images

OpenAI announced this week that one of its general-purpose reasoning models made a breakthrough in a complex field of mathematics, and this has grabbed the attention of elite mathematicians. Specifically, OpenAI said in an announcement published on its website that its models disproved a central conjecture in an area of discrete geometry tied to Paul Erdős’s planar unit distance problem, a famously stubborn question first posed in 1946. The company said its model found an infinite family of point arrangements that beat the long-favored square-grid intuition. Outside mathematicians checked the results and with one noted researcher calling the result, “a milestone in AI mathematics.”

A Novel Approach To Solving A Problem

The problem begins with a question Erdős posed in 1946: place n points on a plane, then count how many pairs sit exactly one unit apart. For nearly 80 years, mathematicians suspected that the best answers would look roughly like square grids. OpenAI now says one of its internal general-purpose reasoning models has disproved that long-held conjecture by producing an infinite family of examples that have a substantial improvement over the grid-based constructions.

After external mathematicians checked the proof, according to OpenAI, they validated the approach. The Guardian reported that mathematicians described the result as significant, but stressed that the larger entire planar unit distance problem remains open.

While it’s no surprise that computers can indeed compute, the bigger insight is that increasingly powerful frontier models may now produce strange, useful solutions to even well-tested problems that experts can test, repair and turn into greater discoveries. In this case, for decades, the planar unit distance problem rewarded a certain kind of mathematical instinct. Points need to be one unit apart, so intuitively grids seem right.

OpenAI’s model didn’t have any of those instincts or pre-conceived notions, and so simply looked at different areas of mathematics for combinations humans didn’t think to test. The model found a construction that broke a central belief in discrete geometry, which humans then refined, simplified and explained the proof. The companion paper, written by Noga Alon, Thomas F. Bloom, W. T. Gowers, Daniel Litt, Will Sawin, Arul Shankar, Jacob Tsimerman, Victor Wang and Melanie Matchett Wood, calls it a “short, digested, human-verified version” of an OpenAI-generated counterexample.

MORE FOR YOU

Specifically, the model did not publish mathematics by itself. It produced a strange object worth the time of elite mathematicians. They then did what their field demands. They checked it, made it more concise in the language of mathematicians, connected it to prior theory and asked what it really means.

From Benchmarks To Open Problems

AI labs have spent years turning math into a scoreboard as a way of showing how sophisticated and capable their models are. Models compete on olympiad problems, coding tests and formal proof benchmarks. Those contests matter, but the problems are curated. The answers are known and the target is fixed.

Google DeepMind and OpenAI had already shown that frontier models can perform at high levels on olympiad-style reasoning. Reuters reported in July 2025 that models from both companies reached gold-medal performance at the International Mathematical Olympiad, with the systems solving five of six problems. An olympiad medal says a model can solve hard problems under contest rules.

In this case, OpenAI’s same general model approach was used, but in an area where the answers weren’t known and the solution target was open to reconsideration. OpenAI’s announcement says the model was not built only for this problem, not scaffolded for a special proof search and not trained as a single-purpose mathematics engine. OpenAI saw the problem as a way of seeing whether advanced models can contribute to frontier research.

Did The Model Discover Or Assemble?

The most important question behind the announcement goes beyond the proof. Did OpenAI’s model find a genuinely new solution, or did it pull together existing research and papers whose implications humans had not realized they could put together?

The honest answer appears to be both. The companion paper says the argument relies on ideas that, “at least in retrospect,” trace back to work by Ellenberg-Venkatesh, Golod-Shafarevich and Hajir-Maire-Ramakrishna. That means the proof did not arrive from nowhere. Its ingredients were already in mathematics. Notably, that makes the result more believable, not less.

Rather than coming up with an approach out of the blue, AI served more as a master curator, assembling work that people had already put together in bits and pieces. The history of mathematics is full of that sort of approach. A technique from one field of mathematics gets uniquely applied in another, and turns a famous obstacle into a solution.

In the case of the Erdős problem, the old grid-based intuition was geometric. OpenAI says its model found examples that beat the grid-like constructions mathematicians had treated as basically optimal. The Times described the result as drawing from unexpected areas, including algebraic number theory.

Mathematicians Are Not Dismissing It

Mathematicians who by personality are often the first to be skeptical of big claims are supporting this recent announcement. Tim Gowers, a Fields Medalist, called the result a milestone in AI mathematics, and positioned it as one of the first clear cases of AI independently solving a famous open mathematical problem.

Scientific American reported that mathematician Daniel Litt called it the most “unique interesting result produced autonomously by AI so far.” The same article described the result as the first AI proof of a kind that would likely be publishable in a top math journal if humans had produced it alone. Gil Kalai, a prominent combinatorialist, wrote that the disproof was “amazing” and in an online post pointed readers to the technical paper and framed the event as noteworthy inside combinatorics, not merely inside the AI industry.

Still, the expert reaction is not a coronation. Gowers and others have drawn a boundary between finding a counterexample and developing a broad theory. A counterexample can come from search, stubbornness and a lucky collision of known tools. A sweeping upper-bound proof may require a different level of conceptual control.

The result points toward a new division of labor. Rather than treat AI as some alien technology that is more threatening than useful, mathematicians are now thinking that AI models may be a useful companion to propose unusual-looking constructions. They can then decide whether those constructions are real, whether they matter and whether they reveal a principle. This is similar to the role that AI is playing in pharmaceutical development and medical research, where AI is used to provide options and possibilities that humans put to the test.

AI systems will generate many dead ends, rediscoveries and ugly half-proofs. Human experts will need to sort signals from the noise. In that world, AI acts more as a collaborator than a solitary genius. This approach will allow the best mathematicians to spend less time producing every candidate idea and more time judging which idea candidate deserves greater scrutiny.

Why This Matters Beyond Mathematics

With this announcement, OpenAI is arguing that AI can do more than just write, code, summarize and tutor. It sees its role as a genuinely helpful collaborator on some of the biggest problems facing mankind. That market is the one every frontier lab wants. Math is a good test case because new ideas and proofs can be checked. In biology, chemistry or materials science, a model’s idea may need months of real world testing and lab work that could take years to validate. In mathematics, a proposed proof can be attacked line by line. That makes the field a critical proving ground for claims about reasoning.

This is why the originality question matters so much. If the model merely surfaces hidden, but existing answers, then the limits to what AI can provide will become clear. If AI models assemble even partially known ideas into something more substantial, then they can be more useful. Coming up with completely new ideas that have never been written or considered before might be the extreme of possibilities, at least for now.