Question 1

They both score 3/5 on constrained_rewriting — why call Claude Haiku 4.5 the winner?

Accepted Answer

Because in our testing the headline constrained_rewriting score is tied, so we use supporting capabilities to decide. Claude Haiku 4.5 has higher long_context (5 vs 4), faithfulness (5 vs 4), and tool_calling (5 vs 3), which matter for preserving meaning, handling long sources, and integrating character-counting functions.

Question 2

How much more does Claude Haiku 4.5 cost compared with Devstral Medium?

Accepted Answer

In the provided pricing units Haiku's input_cost_per_mtok is 1 vs Devstral's 0.4, and Haiku's output_cost_per_mtok is 5 vs Devstral's 2. That matches the priceRatio 2.5 in the payload — Haiku is materially more expensive per mTok.

Question 3

If I only compress short SMS messages, which model should I pick?

Accepted Answer

For many short, low-risk rewrites Devstral Medium is a sensible choice: it ties Haiku 3/5 on constrained_rewriting while offering lower input/output costs (0.4 / 2 vs 1 / 5). Choose Devstral when budget and throughput matter more than handling long inputs or maximal faithfulness.

Question 4

Which model handles very long source texts better?

Accepted Answer

Claude Haiku 4.5: long_context in our testing is 5 for Haiku vs 4 for Devstral Medium, and Haiku's context_window is larger (200,000 vs 131,072). That makes Haiku more robust when the source exceeds typical short-input sizes.

Question 5

Do either model excel at producing strict formats or schemas when compressing?

Accepted Answer

Both models score 4/5 on structured_output in our testing, so they are roughly equal at adhering to JSON schemas or strict templates during compression. Choose Haiku for long or high-fidelity jobs; choose Devstral for cost-sensitive, format-constrained batches.

Claude Haiku 4.5 vs Devstral Medium for Constrained Rewriting

Claude Haiku 4.5

Devstral Medium

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions