ForecastBench

ForecastBench, created by the Forecasting Research Institute, is a dynamic, continuously updated benchmark of AI forecasting ability. Because it asks models to assign probabilities to events whose outcomes are not yet known, there is no risk of training contamination, and it directly compares AI forecasts against human baselines, including professional superforecasters and the general public.

Methodology

We source results from the public ForecastBench leaderboard.

ForecastBench draws on two question types: “market” questions sourced from prediction markets (such as Manifold, Metaculus, and Polymarket) and “dataset” questions auto-generated from databases (such as ACLED, FRED, and Yahoo! Finance). The system runs nightly to ingest questions and resolutions, and samples new rounds of questions on a rolling basis. Forecasts are scored with a difficulty-adjusted Brier Index on an interpretable 0–100% scale, where higher is better (100% is perfect, 50% corresponds to an uninformed guess, and 0% is maximally wrong). Our chart defaults to the overall score and also exposes the separate dataset and market scores.

For full details, see the ForecastBench paper and the methodology documentation.

Featured

Publications

Data explorers

Benchmarks by Epoch AI

AI Progress

Industry

Infrastructure

Impacts

Papers & Reports

Data Insights

Newsletter

Podcast

Capabilities

Models

Data Centers

Chip Owners

Companies

Polling on AI Use

MirrorCode

Epoch Capabilities Index

FrontierMath: Open Problems

FrontierMath: Tiers 1-4

ForecastBench

ForecastBench

Methodology

ForecastBench

Featured

Publications

Data explorers

Benchmarks by Epoch AI

AI Progress

Industry

Infrastructure

Impacts

Publications

Papers & Reports

Data Insights

Newsletter

Podcast

Data explorers

Capabilities

Models

Data Centers

Chip Owners

Companies

Polling on AI Use

Benchmarks by Epoch AI

MirrorCode

Epoch Capabilities Index

FrontierMath: Open Problems

FrontierMath: Tiers 1-4

Scaling

Software progress

Open models

Capabilities

Math

Leading companies

Finances

Geopolitics

Chips

Data centers

Energy

Adoption and use

Economic impact

Future of AI

About Epoch AI

Donate

Team

Careers

Consultations

For press

Transparency

ForecastBench

ForecastBench

Methodology