FrontierMath — benchmarking AI against advanced mathematical research

A benchmark of several hundred unpublished, expert-level mathematics problems that take specialists hours to days to solve. Difficulty Tiers 1-3 cover undergraduate through early graduate level problems, while Tier 4 is research-level mathematics. This project is supported by OpenAI.

Learn more Sample problems

See all benchmarks.

Publications & Commentary

Papers & Reports
Newsletter
Podcast

Data & Resources

Data on AI
AI Trends & Statistics
Data Insights

Projects

FrontierMath
GATE Playground
Distributed Training
Model Counts

Company

About Us
Our Team
Careers
Consultations
Our Funding
Donate
Latest
Contact

Privacy Notice Cookie Policy