Question 1

They both score 3/5 on Constrained Rewriting — why declare a winner?

Accepted Answer

We declared Codestral 2508 the winner because supporting benchmarks in our tests show Codestral has stronger structured_output (5/5 vs Claude Haiku 4.5's 4/5) while matching faithfulness and long-context. Structured_output performance is decisive for tasks requiring exact-length or schema-constrained outputs, and Codestral is also far cheaper per mTok.

Question 2

How big is the cost difference between the models?

Accepted Answer

Per the data payload, Claude Haiku 4.5 costs 1 (input) and 5 (output) per mTok while Codestral 2508 costs 0.3 (input) and 0.9 (output) per mTok. At scale this makes Codestral materially less expensive for repeated constrained-rewrite runs.

Question 3

When should I use Claude Haiku 4.5 despite Codestral winning?

Accepted Answer

Choose Claude Haiku 4.5 when preserving tone, brand voice, or persona under severe compression is critical—Claude scores 5/5 on persona_consistency versus Codestral's 3/5—and when extra creative problem solving is required (creative_problem_solving 4/5 vs 2/5).

Question 4

Do either model hallucinate more during compression?

Accepted Answer

In our testing both models scored 5/5 on faithfulness for constrained rewriting, indicating they are equally strong at sticking to source material while compressing content.

Question 5

Are there any other operational differences to consider?

Accepted Answer

Context windows differ: Claude Haiku 4.5 has a 200,000-token window and Codestral 2508 has a 256,000-token window. Both score 5/5 on long_context in our tests, so for very long source texts both handle retrieval similarly; choose Codestral if format strictness and cost are primary, Claude Haiku 4.5 if persona fidelity is primary.

Claude Haiku 4.5 vs Codestral 2508 for Constrained Rewriting

Claude Haiku 4.5

Codestral 2508

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions