Question 1

Both models score 3/5 on Constrained Rewriting — why declare a winner?

Accepted Answer

They tie on the direct task score in our testing (3/5, rank 31 of 52). We name Claude Haiku 4.5 the winner because supporting benchmarks that matter for constrained rewriting—faithfulness (5 vs 4), long_context (5 vs 4), and tool_calling (5 vs 4)—favor Haiku, which translates to fewer meaning-preservation errors when compressing under hard limits.

Question 2

How big is the cost difference between the two models?

Accepted Answer

In the payload Haiku's output cost is $5.00 per mTok and Devstral Small's output cost is $0.30 per mTok. That is a ~16.7× difference (priceRatio 16.666...), so Devstral is substantially cheaper for high-volume runs.

Question 3

Does modal input or context window affect constrained rewriting?

Accepted Answer

Yes. Claude Haiku 4.5 supports text+image->text and has a 200,000-token context window with max_output_tokens=64,000 in the payload; Devstral Small 1.1 is text->text with a 131,072-token context window and no max_output_tokens reported. Larger windows and explicit max_output_tokens help when the source text is very long or when you must guarantee output length.

Question 4

Is structured output a tie for this task?

Accepted Answer

Yes—both models score structured_output=4 in our testing, so they are comparable at adhering to format constraints (JSON schema, fixed templates). The deciding factors are faithfulness, long-context, and tool_calling where Haiku leads.

Question 5

Which model is better for low-latency, cheap batch compression?

Accepted Answer

Devstral Small 1.1 is the better pragmatic choice for low-cost, high-throughput batch jobs due to its much lower output cost ($0.30/mTok), despite the small hit in fidelity and context handling versus Haiku.

Claude Haiku 4.5 vs Devstral Small 1.1 for Constrained Rewriting

Claude Haiku 4.5

Devstral Small 1.1

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions