Lech Mazur Writing

This benchmark aggregates writing prompts and scores from the lechmazur/writing project, reporting an average rating (typically on a 1–10 scale) across diverse tasks. Prompts span expository, persuasive, and creative writing, pushing models to maintain structure, develop ideas, and adopt consistent tone.

Because scoring emphasizes overall quality rather than narrow correctness, strong performance suggests better controllability, content planning, and fluency over longer outputs. Model cards often cite these results as a proxy for real-world writing assistance quality.

Featured

Publications

Data explorers

Benchmarks by Epoch AI

AI Progress

Industry

Infrastructure

Impacts

Papers & Reports

Data Insights

Newsletter

Podcast

Capabilities

Models

Data Centers

Chip Owners

Companies

Polling on AI Use

MirrorCode

Epoch Capabilities Index

FrontierMath: Open Problems

FrontierMath: Tiers 1-4

Lech Mazur Writing