FrontierMath Tiers 1-4

A benchmark of several hundred unpublished, highly challenging mathematics problems. Difficulty Tiers 1-3 cover undergraduate through early postdoc level problems, while Tier 4 is research-level mathematics.

This project is supported by OpenAI.