AI Solves Unsolved Mathematical Problems

Published on March 17, 2026 | Translated from Spanish

A new milestone challenges the boundary between assistant artificial intelligence and autonomous researcher. HorizonMath, a benchmark with over 100 largely unsolved mathematical problems, has served to demonstrate that models like GPT 5.4 Pro can propose solutions that surpass the best published results. This advance, pending expert review, suggests that AI is beginning to generate novel knowledge, not just recombine the existing, in domains that require true insight.

An AI model writes on a blackboard full of complex mathematical equations, pointing to a novel solution.

HorizonMath: A Testing Ground for Authentic Discovery 🔬

The key to HorizonMath lies in its design to avoid data contamination. By focusing on computational and applied problems without known solutions, where verification is simple but discovery is arduous, it ensures that models cannot memorize answers. Most state-of-the-art models have performance close to 0%, highlighting the difficulty. The platform automates evaluation, scaling a process that previously depended on costly formal verifications or manual reviews, and presents itself as an open community resource where a correct solution can equate to a legitimate mathematical contribution.

Implications for the Digital Creator and Knowledge Authorship 💡

This achievement transcends mathematics and raises profound questions for any creator using AI. If a model can contribute novelty in such a structured field, how do we redefine authorship and originality in the digital era? For the Foro3D community, accustomed to pushing the limits of technology, this reinforces the vision of AI as a tool for collaborative discovery, but also underscores the need for expert human criteria to validate and contextualize its findings, paving the way for a deeper creative symbiosis.

Can AI's autonomous resolution of unsolved mathematical problems redefine the role of the human researcher and the very process of scientific discovery?

(P.S.: tech nicknames are like children: you name them, but the community decides what to call them)