AI Benchmarking
Our database of benchmark results, featuring the performance of leading AI models on challenging tasks. It includes results from benchmarks evaluated internally by Epoch AI as well as data collected from external sources. Explore trends in AI capabilities across time, by benchmark, or by model.
Benchmarking updates
December 12, 2025
We used ECI to predict METR time horizons for recently-released models.
Check out the predictions
December 11, 2025
We added two benchmarks to the suite we run ourselves: SimpleQA Verified and a new benchmark of chess puzzles.
Read an overview