ALE-Bench

ALE-Bench, created by Sakana AI in collaboration with AtCoder, evaluates AI on long-horizon, objective-driven algorithm engineering. Its problems are hard combinatorial optimization tasks, such as routing, scheduling, planning, and multi-agent control, that reward iterative refinement over long sessions rather than single-shot answers, mirroring real heuristic-programming contests.

Methodology

We source results from the public ALE-Bench leaderboard.

ALE-Bench is built from 40 past AtCoder Heuristic Contest (AHC) problems. A Python library replicates the AHC contest environment with a code sandbox, and models iteratively submit solutions and receive public-evaluation feedback over a fixed time budget. Each submission is scored and converted into a “Performance” value using an Elo-rating-like method based on where it would rank against human contestants. Our chart defaults to average Performance (higher is better, on a roughly 0–3500 scale) and also exposes the contest rank (lower is better).

For full details, see the ALE-Bench paper and code.

Featured

Publications

Data explorers

Benchmarks by Epoch AI

AI Progress

Industry

Infrastructure

Impacts

Papers & Reports

Data Insights

Newsletter

Podcast

Capabilities

Models

Data Centers

Chip Owners

Companies

Polling on AI Use

MirrorCode

Epoch Capabilities Index

FrontierMath: Open Problems

FrontierMath: Tiers 1-4

ALE-Bench

ALE-Bench

Methodology

ALE-Bench

Featured

Publications

Data explorers

Benchmarks by Epoch AI

AI Progress

Industry

Infrastructure

Impacts

Publications

Papers & Reports

Data Insights

Newsletter

Podcast

Data explorers

Capabilities

Models

Data Centers

Chip Owners

Companies

Polling on AI Use

Benchmarks by Epoch AI

MirrorCode

Epoch Capabilities Index

FrontierMath: Open Problems

FrontierMath: Tiers 1-4

Scaling

Software progress

Open models

Capabilities

Math

Leading companies

Finances

Geopolitics

Chips

Data centers

Energy

Adoption and use

Economic impact

Future of AI

About Epoch AI

Donate

Team

Careers

Consultations

For press

Transparency

ALE-Bench

ALE-Bench

Methodology