AI Models Achieve Gold Medals At International Math Olympiad

Recent advances in artificial intelligence (AI) have marked a historic milestone. Recently, AI models developed by OpenAI and Google DeepMind achieved gold medal-level scores at the International Mathematical Olympiad (IMO). This prestigious contest challenges high school students worldwide with complex math problems. For the first time, AI systems have matched the performance of top human competitors under official exam conditions.
The International Mathematical Olympiad Explained
The IMO is an annual global competition since 1959. Participants face two sessions, each lasting 4.5 hours. Each session requires solving three difficult problems from algebra, combinatorics, geometry, and number theory. Each problem carries seven points, with a maximum total of 42 points. Gold medals are awarded to those scoring above a set threshold, which was 35 points in 2025.
AI’s Performance and Techniques
The two AI models solved five out of six problems, scoring 35 points each. Unlike typical large language models, these AI systems use advanced reasoning techniques. They work through problems step-by-step before reaching answers. Google’s model, Gemini Deep Think, uses parallel thinking to explore multiple solutions simultaneously. OpenAI’s model, verified by former IMO medallists, also displayed strong reasoning but has not yet been officially certified by the IMO.
Human Competitors and AI Comparison
India won three gold medals, two silver, and one bronze at IMO 2025. One Indian gold medallist outscored the AI models by two points. Indian participants noted that AI excels at pattern recognition and memorising problem types. However, AI lacks the creativity and emotional experience that human participants bring. Certain novel problems requiring new ideas remain challenging for AI.
Significance of AI Achievements
This breakthrough shows rapid AI progress in mathematical reasoning. Previous AI systems required human assistance to translate problems into formal languages and took longer to compute answers. Now AI can generate rigorous proofs directly from natural language within exam time limits. These advances could impact research areas like cryptography and space exploration by solving unsolved mathematical problems.
Limitations and Future Prospects
Despite successes, AI models still show jagged intelligence, struggling with simple questions and inconsistent reasoning. Researchers caution against overestimating AI’s current abilities. Human creativity and intuition remain crucial in mathematics. Experts foresee AI as a tool for checking proofs and brainstorming rather than replacing mathematicians. AI may also assist in Olympiad training similar to how it aids chess players.