Claude Haiku 4.5 vs Gemini 2.5 Flash Lite for Creative Writing

Winner: Claude Haiku 4.5. In our testing both models score 4/5 on the Creative Writing task overall, but Claude Haiku 4.5 edges Gemini 2.5 Flash Lite on creative ideation (creative_problem_solving 4 vs 3) and brings stronger strategic/agentic reasoning (strategic_analysis 5 vs 3; agentic_planning 5 vs 4). Both models tie on persona_consistency (5) and long_context (5), but Gemini is measurably stronger at constrained_rewriting (4 vs 3) and is far cheaper. For fiction-first work that prioritizes generative idea quality, character depth, and complex story planning, Claude Haiku 4.5 is the better pick in our benchmarks.

anthropic

Claude Haiku 4.5

Overall
4.33/5Strong

Benchmark Scores

Faithfulness
5/5
Long Context
5/5
Multilingual
5/5
Tool Calling
5/5
Classification
4/5
Agentic Planning
5/5
Structured Output
4/5
Safety Calibration
2/5
Strategic Analysis
5/5
Persona Consistency
5/5
Constrained Rewriting
3/5
Creative Problem Solving
4/5

External Benchmarks

SWE-bench Verified
N/A
MATH Level 5
N/A
AIME 2025
N/A

Pricing

Input

$1.00/MTok

Output

$5.00/MTok

Context Window200K

modelpicker.net

google

Gemini 2.5 Flash Lite

Overall
3.92/5Strong

Benchmark Scores

Faithfulness
5/5
Long Context
5/5
Multilingual
5/5
Tool Calling
5/5
Classification
3/5
Agentic Planning
4/5
Structured Output
4/5
Safety Calibration
1/5
Strategic Analysis
3/5
Persona Consistency
5/5
Constrained Rewriting
4/5
Creative Problem Solving
3/5

External Benchmarks

SWE-bench Verified
N/A
MATH Level 5
N/A
AIME 2025
N/A

Pricing

Input

$0.100/MTok

Output

$0.400/MTok

Context Window1049K

modelpicker.net

Task Analysis

What Creative Writing demands: fiction and storytelling need strong ideation, consistent character voice, handling of very long contexts (drafts/series), and the ability to produce tight, constrained rewrites when needed. In the absence of a third‑party external benchmark for this task, we use our internal task proxies. Both models register a task score of 4/5 in our 3-test Creative Writing suite (creative_problem_solving, persona_consistency, constrained_rewriting). Key differences in our data: Claude Haiku 4.5 scores higher on creative_problem_solving (4 vs 3), plus stronger strategic_analysis (5 vs 3) and agentic_planning (5 vs 4), indicating better idea generation and multi-step story decomposition. Gemini 2.5 Flash Lite outperforms Claude on constrained_rewriting (4 vs 3), making it superior for hard character limits and compression tasks. Both tie at persona_consistency (5) and long_context (5), showing they both sustain voice and manage long manuscripts. Additional practical considerations from the model metadata: Claude Haiku 4.5 has a 200,000-token context window and modality text+image->text; Gemini 2.5 Flash Lite supports a larger 1,048,576-token window and additional modalities (file/audio/video), and has much lower output cost (0.4 vs 5 per mTok).

Practical Examples

Where Claude Haiku 4.5 shines (use its strengths):

  • Deep ideation and plot generation: stronger creative_problem_solving (4 vs 3) — better at producing non-obvious, feasible story beats and multi-act outlines.
  • Complex character arcs and planning: strategic_analysis 5 and agentic_planning 5 help decompose long-term story goals and recovery strategies for plot holes.
  • Collaborative drafting where high-quality first‑pass prose matters, even at higher cost (output cost 5 per mTok).

Where Gemini 2.5 Flash Lite shines:

  • Microfiction and social copy with strict length limits: constrained_rewriting 4 vs 3 makes Gemini better for precise compression and line edits.
  • Low‑cost, high‑volume creative runs: output cost 0.4 per mTok (12.5x cheaper by our priceRatio), useful for iterative A/B testing of story openings.
  • Multimodal source material (audio/video/files): Gemini’s wider modality support lets you feed recorded interviews or reference files into prompts.

Shared strengths (both models): persona_consistency 5 and long_context 5 — both maintain character voice over long drafts and handle large documents reliably in our tests.

Bottom Line

For Creative Writing, choose Claude Haiku 4.5 if you prioritize higher‑quality idea generation, multi‑step story planning, and stronger strategic/agentic reasoning (creative_problem_solving 4 vs 3; strategic_analysis 5 vs 3). Choose Gemini 2.5 Flash Lite if you need constrained rewrites and extreme cost efficiency (constrained_rewriting 4 vs 3; output cost 0.4 vs 5), or if you want multimodal inputs and the largest context window.

How We Test

We test every model against our 12-benchmark suite covering tool calling, agentic planning, creative problem solving, safety calibration, and more. Each test is scored 1–5 by an LLM judge. Read our full methodology.

Frequently Asked Questions