Question 1

How big is the performance gap for Creative Writing between these models?

Accepted Answer

In our testing Claude Haiku 4.5 scores 4.00 on the Creative Writing suite vs Codestral 2508's 2.6667 — a 1.33-point lead. The gap is driven mainly by persona_consistency (5 vs 3) and creative_problem_solving (4 vs 2).

Question 2

Are either model good for multi-chapter novels or long continuity?

Accepted Answer

Yes. Both models score 5 on long_context in our testing, so both handle long contexts well. Choose Haiku when maintaining character voice and inventive plotting matter more; choose Codestral when you need structured outputs or lower cost.

Question 3

Does cost affect which model I should use for Creative Writing?

Accepted Answer

Yes. Codestral 2508 is substantially cheaper per mTok (input 0.3 / output 0.9) versus Claude Haiku 4.5 (input 1 / output 5). For bulk, template-driven generation, Codestral lowers costs; for high-quality narrative and voice, Haiku's higher creative scores justify the higher cost.

Question 4

If I need JSON story outlines or beat sheets, which is better?

Accepted Answer

Codestral 2508 — it scores structured_output 5 vs Haiku's 4 in our testing, so it more reliably adheres to strict JSON schemas or formatted outlines.

Question 5

What about safety and hallucination risk for fiction prompts?

Accepted Answer

Our safety_calibration scores are modest: Claude Haiku 4.5 = 2, Codestral 2508 = 1. Neither model is top-ranked for safety calibration in our testing, so include guardrails or post-processing when generating sensitive content.

Claude Haiku 4.5 vs Codestral 2508 for Creative Writing

Claude Haiku 4.5

Codestral 2508

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions