AlgoTune

AlgoTune evaluates whether language models can write code that runs faster than established reference implementations while still producing correct outputs. Tasks span mathematics, physics, computer science, and machine learning, including problems such as gzip compression, AES encryption, singular value decomposition, and convex optimization.

Methodology

We source results from the public AlgoTune leaderboard.

AlgoTune contains 154 coding tasks contributed by domain experts. Through the AlgoTuner harness, a model runs a budgeted loop, editing code, running and profiling it, and verifying correctness against held-out tests, keeping the fastest valid version under a fixed budget per task. Each solver is timed against a reference implementation from popular libraries such as SciPy, scikit-learn, and CVXPY. The headline score is the harmonic mean of speedups across all tasks, where 1.0x means no improvement over the reference and higher is better.

For full details, see the AlgoTune paper and code.

Featured

Publications

Data explorers

Benchmarks by Epoch AI

AI Progress

Industry

Infrastructure

Impacts

Papers & Reports

Data Insights

Newsletter

Podcast

Capabilities

Models

Data Centers

Chip Owners

Companies

Polling on AI Use

MirrorCode

Epoch Capabilities Index

FrontierMath: Open Problems

FrontierMath: Tiers 1-4

AlgoTune

AlgoTune

Methodology

AlgoTune

Featured

Publications

Data explorers

Benchmarks by Epoch AI

AI Progress

Industry

Infrastructure

Impacts

Publications

Papers & Reports

Data Insights

Newsletter

Podcast

Data explorers

Capabilities

Models

Data Centers

Chip Owners

Companies

Polling on AI Use

Benchmarks by Epoch AI

MirrorCode

Epoch Capabilities Index

FrontierMath: Open Problems

FrontierMath: Tiers 1-4

Scaling

Software progress

Open models

Capabilities

Math

Leading companies

Finances

Geopolitics

Chips

Data centers

Energy

Adoption and use

Economic impact

Future of AI

About Epoch AI

Donate

Team

Careers

Consultations

For press

Transparency

AlgoTune

AlgoTune

Methodology