Question 1

Is the winner based on an external benchmark?

Accepted Answer

No. The payload contains no external benchmark for Data Analysis. The winner is determined from our internal task composite: Claude Haiku 4.5 scores 4.3333 vs Devstral Medium 3.3333 on our 3-test suite (strategic_analysis, classification, structured_output).

Question 2

How much more does Claude Haiku 4.5 cost per mTok compared to Devstral Medium?

Accepted Answer

In the provided data Haiku 4.5 has input_cost_per_mtok = 1 and output_cost_per_mtok = 5; Devstral Medium has input_cost_per_mtok = 0.4 and output_cost_per_mtok = 2. Use these figures to estimate runtime cost differences for your workload.

Question 3

When should I pick Devstral Medium despite the lower task score?

Accepted Answer

Pick Devstral Medium for cost-sensitive, high-volume jobs that are primarily about format conversion, classification, or simple batch transformations where strategic tradeoffs and complex tool orchestration are not required. It ties with Haiku on structured_output and classification (both score 4).

Question 4

How do context windows and modalities affect this choice?

Accepted Answer

Claude Haiku 4.5 has a 200000-token context window and supports text+image->text, which helps large multi-file or multimodal analyses. Devstral Medium has a 131072-token context window and text->text modality. If you must analyze very large contexts or images + text together, Haiku’s context and modality advantage is relevant.

Question 5

Which specific benchmark dimensions drive Haiku’s Data Analysis lead?

Accepted Answer

In our testing Haiku leads on strategic_analysis (5 vs 2), tool_calling (5 vs 3), faithfulness (5 vs 4), long_context (5 vs 4), and agentic_planning (5 vs 4). Structured_output and classification are ties at 4.

Claude Haiku 4.5 vs Devstral Medium for Data Analysis

Claude Haiku 4.5

Devstral Medium

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions