Question 1

Why does Devstral 2 2512 rank so much higher for creative writing if both models tie on creative problem solving?

Accepted Answer

The task score aggregates three benchmarks: creative problem solving, persona consistency, and constrained rewriting. Both models tie on creative problem solving at 4/5. But Devstral 2 2512 scores 5/5 on constrained rewriting versus Haiku 4.5's 3/5 — a 2-point gap that pulls the composite score to 4.33 versus 4.0. Haiku 4.5 recovers some ground with a 5/5 persona consistency score, but the constrained rewriting gap is large enough to determine the ranking.

Question 2

Is Devstral 2 2512 actually built for creative writing, or is it primarily a coding model?

Accepted Answer

Devstral 2 2512's description in our data identifies it as specializing in agentic coding. However, benchmark performance is what we measure — and in our creative writing task tests, it scores 4.33/5 and ranks 5th of 52 models. Its constrained rewriting strength likely transfers from its general instruction-following and format-adherence capabilities, regardless of its primary design focus. Judge it by its scores on the task, not its marketing category.

Question 3

Does the price difference between these models matter for creative writing use cases?

Accepted Answer

It depends on volume. Devstral 2 2512 costs $0.40 input / $2.00 output per million tokens; Claude Haiku 4.5 costs $1.00 input / $5.00 output — a 2.5x price ratio. For a single user writing a novel or generating occasional creative content, the dollar difference is negligible. For a platform generating thousands of creative variations daily, that 2.5x cost gap becomes significant. Combined with Devstral 2 2512's higher task score, cost-conscious developers have a dual reason to consider it.

Question 4

Which model is better for writing in languages other than English?

Accepted Answer

Both score 5/5 on multilingual in our testing, tied for 1st among 35 models out of 55 tested. Neither has an advantage here. If your creative writing is primarily non-English, the constrained rewriting and persona consistency differences are still the deciding factors.

Question 5

Does Claude Haiku 4.5's image input support matter for creative writing?

Accepted Answer

It can. Claude Haiku 4.5 supports text+image input, meaning you can feed it a photograph, painting, or visual reference and ask it to write ekphrastic poetry, scene descriptions, or stories inspired by the image. Devstral 2 2512 is text-only (text→text modality per our data). If visual prompting is part of your creative workflow, Haiku 4.5 is the only option of the two.

Claude Haiku 4.5 vs Devstral 2 2512 for Creative Writing

Claude Haiku 4.5

Devstral 2 2512

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions