Claude 3.7 Sonnet (16k thinking)
A previous flagship reasoning model from Anthropic, benchmarked with up to 16,000 reasoning tokens allowed.
138
Epoch Capabilities Index
RANK:
ECI is a composite metric based on over 42 benchmarks. Learn more.