models/anthropic/claude-sonnet-5
A
Anthropic·active

Anthropic: Claude Sonnet 5

Anthropic's flagship model. Long-context specialist with 1M window.

Overall score
4.77
/5.00 · ranked #4
Input
$2.00
per 1M tokens
Output
$10.00
per 1M tokens
Context
1M
tokens
Blended
$8.00
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on Anthropic: Claude Sonnet 5.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
5.0
Constrained Rewriting
4.0
Creative Problem Solving
5.0
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
4.0
Safety Calibration
5.0
Persona Consistency
5.0
Agentic Planning
5.0
Multilingual
5.0
Tabular Data
5.0

What you need to know

Claude Sonnet 5 is a high-tier generalist model optimized for complex reasoning and agentic workflows. It ranks 5th out of 112 evaluated models, driven by perfect scores in strategic analysis, tool calling, and agentic planning. Its ability to maintain faithfulness and persona consistency makes it a reliable choice for autonomous systems that require strict adherence to logic and identity.

The model offers a massive 1M token context window, though its internal performance in long-context tasks is slightly lower than its peak capabilities in other areas. While it excels at structured output and tabular data, it shows marginal relative weakness in classification and constrained rewriting.

At a blended cost of $8.00 per million tokens, this is a premium-priced model. The cost is justified for developers building complex agents or strategic analysis tools where accuracy and structured data are non-negotiable, but it may be inefficient for simple classification or high-volume rewriting tasks.

Use this model if you are building autonomous agents, complex strategic tools, or applications requiring high-fidelity structured outputs. Skip this model if your primary use case is basic text classification or if your budget requires a lower-cost alternative for simple rewriting tasks.

Strengths — Top 3

Structured Output5.0/5.0
Strategic Analysis5.0/5.0
Creative Problem Solving5.0/5.0

Relative weaknesses — Bottom 3

Constrained Rewriting4.0/5.0
Classification4.0/5.0
Long Context4.0/5.0

Similar models

ZZ.ai: GLM 5.2$2.484.85QQwen: Qwen3.6 Max Preview$4.944.85MKimi K2.7 Code$2.814.62AAnthropic: Claude Opus 4.8 (Fast)$40.004.77