Question 1

Which model keeps characters and narrator voice more consistent?

Accepted Answer

Claude Haiku 4.5 — persona_consistency is 5 for Haiku vs 4 for DeepSeek in our tests, so Haiku better maintains character voice across scenes and long contexts.

Question 2

Which model is less likely to invent facts when a story requires research?

Accepted Answer

Claude Haiku 4.5 — faithfulness is 5 for Haiku vs 3 for DeepSeek in our benchmarks, and Haiku’s tool_calling is 5 vs 3, so Haiku is both better at sticking to source material and at integrating tool-based research reliably.

Question 3

I need exact formatting (screenplays, verse, JSON). Which is better?

Accepted Answer

DeepSeek V3.1 Terminus — structured_output 5 vs Haiku 4, so DeepSeek is more likely to meet strict schema or format constraints out of the box.

Question 4

How big is the cost difference for bulk Creative Writing generation?

Accepted Answer

Substantial: output_cost_per_mtok is 5 for Claude Haiku 4.5 vs 0.79 for DeepSeek V3.1 Terminus. The payload’s priceRatio indicates Haiku is ~6.33× more expensive per output mTok, so DeepSeek is more cost-efficient for high-volume drafts.

Question 5

Do either model struggle with long-form context?

Accepted Answer

No — both models score 5 on long_context in our tests, so they handle 30K+ token contexts equally well for multi-chapter content.

Claude Haiku 4.5 vs DeepSeek V3.1 Terminus for Creative Writing

Claude Haiku 4.5

DeepSeek V3.1 Terminus

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions