Question 1

Is Claude Sonnet 4.6 better than Devstral Medium?

Accepted Answer

In our testing Claude Sonnet 4.6 wins 9 of 12 benchmarks (safety, tool calling, long-context, agentic planning, faithfulness, multilingual, creative problem solving, strategic analysis, persona consistency). Devstral Medium does not win any of the 12 internal benchmarks and ties Sonnet on structured output, constrained rewriting, and classification.

Question 2

Which model is cheaper to run?

Accepted Answer

Devstral Medium is substantially cheaper. Pricing in the payload: Sonnet input $3 / output $15 per mTok; Devstral input $0.4 / output $2 per mTok. The payload gives a price ratio of 7.5× between Sonnet and Devstral on those unit rates.

Question 3

Which model is better for coding and developer workflows?

Accepted Answer

Claude Sonnet 4.6 performed better for coding in our tests and also scores 75.2% on SWE-bench Verified (Epoch AI), ranking 4 of 12 on that external benchmark. Devstral Medium lacks an external SWE-bench score in the payload; internally it ties on classification and structured output but scores lower on creative problem solving and tool calling.

Question 4

How much more will Sonnet cost at scale?

Accepted Answer

Example (50/50 input/output split): 1M tokens → Sonnet ~$9,000 vs Devstral ~$1,200; 10M → Sonnet ~$90,000 vs Devstral ~$12,000; 100M → Sonnet ~$900,000 vs Devstral ~$120,000. These figures use the per-mTok rates from the payload and assume 1 mTok = 1,000 tokens.

Question 5

Are there external benchmark results I should consider?

Accepted Answer

Yes — the payload includes external scores for Claude Sonnet 4.6: 75.2% on SWE-bench Verified (Epoch AI) and 85.8% on AIME 2025 (Epoch AI). We cite those external benchmarks as supplementary evidence. Devstral Medium has no external SWE-bench/AIME scores present in the payload.

Claude Sonnet 4.6 vs Devstral Medium

Claude Sonnet 4.6

Devstral Medium

Benchmark Analysis

Pricing Analysis

Real-World Cost Comparison

Bottom Line

How We Test

Frequently Asked Questions