Paul Erdős famously said that mathematicians are machines that turn coffee into theorems. A couple of recent articles in the Wall Street Journal revealed the current state-of-the-art for AI to do the same.
In July, Ben Cohen published the article The High-Schoolers Who Just Beat the World’s Smartest AI Models. The focus of the article was the 2025 International Mathematical Olympiad, the pinnacle of the calendar for high school mathematics competitions. In the United States, the pathway to the IMO is first excellng at a sequence of increasing difficult exams: the AMC->12 (or possibly AMC->10), then the American Invitational Mathematics Exam (AIME), and then USA Mathematical Olympiad (USAMO) or USA Junior Mathematical Olympiad (USJAMO). The top USAMO and USJAMO participants then get invited to a special camp from which the participants in that year’s IMO are selected.
My personal story: back in high school, my score on the AMC->12 (then called the AHSME) qualified me for the AIME my sophomore and junior years, where my run in the competition ended with a resounding thud. My senior year, I caught lightning in a bottle and somehow qualified for the USAMO; I’m not sure what the cut-off is these days, but back then only 150 or so high school students qualified for the USAMO each year. My excitement at qualifying for the USAMO gave way to utter humiliation after I actually attempted the exam (to say that I “took” the exam is probably a misuse of the work “took”.) All this to say: I never came close to sniffing the IMO. From the Wall Street Journal article:
The famously grueling IMO exam is held over two days and gives students three increasingly difficult problems a day and more than four hours to solve them. The questions span algebra, geometry, number theory and combinatorics—and you can forget about answering them if you’re not a math whiz. You’ll give your brain a workout just trying to understand them.
Because those problems are both complex and unconventional, the annual math test has become a useful benchmark for measuring AI progress from one year to the next. In this age of rapid development, the leading research labs dreamed of a day their systems would be powerful enough to meet the standard for an IMO gold medal, which became the AI equivalent of a four-minute mile.
But nobody knew when they would reach that milestone or if they ever would—until now.
The unthinkable occurred earlier this month when an AI model from Google DeepMind earned a gold-medal score at IMO by perfectly solving five of the six problems. In another dramatic twist, OpenAI also claimed gold despite not participating in the official event. The companies described their feats as giant leaps toward the future—even if they’re not quite there yet.
In fact, the most remarkable part of this memorable event is that 26 students got higher scores on the IMO exam than the AI systems.
A second article by Ben Cohen, The Math Legend Who Just Left Academic — for an AI Startup Run by a 24-Year-Old, might be a precursor of things to come. One of the two starts of the article is number theorist Dr. Ken Ono. From the article:
In recent years, Ono began tracking AI’s remarkable progress as it rapidly improved. He was intrigued, though not intimidated. AI was astonishing at cognitive tasks and solving problems it had already seen, but it struggled with the creative elements of his field, which require intuition and abstract thinking.
That creativity is so fundamental to pure mathematics that Ono figured his job would be safe for decades.
But last spring, he was one of 30 mathematicians invited to curate research-level problems as a test of the AI models. He left the symposium profoundly shaken by what he’d seen.
“The lead I had on the models was shrinking,” he said. “And in areas of mathematics that were not in my wheelhouse, I felt like the models were already blowing me away.”
For months afterward, Ono felt like he was grieving his identity. He didn’t know what to do next, knowing that AI models would only get smarter.
“Then I had an epiphany,” he said. “I realized what the models were offering was a different way of doing math.”
Dr. Ono is now taking an extended leave from the University of Virginia to join a new AI startup company called Axiom. From Tech Funding News:
Led by Carina Hong, Axiom Math is developing an AI system that not only solves complex math problems but also generates new mathematical knowledge by proposing conjectures: mathematical statements that have yet to be proven.
The model produces rigorous, step-by-step proofs that can be independently verified using proof assistants such as Lean and Coq. This approach aims to transform English-language math from textbooks and research papers into code, enabling the AI to create and validate new problems that push the boundaries of existing knowledge…
Currently, Axiom is working on models that can discover and solve new math problems. The researchers also hope to apply their work in areas like finance, aircraft design, chip design, and quantitative trading.
Beyond pure mathematics, Axiom’s AI tool is being tested for practical applications in fields requiring rigorous computational precision, including finance, aircraft and chip design, and quantitative trading.
Time will tell if the intersection of AI with mathematics can generate a profitable company. What I don’t doubt is that the previously unthinkable — original mathematical work by AI — will eventually happen, given enough time.








