Question 1

They tied at 3/5 — why is Haiku declared the winner?

Accepted Answer

They tie on the constrained_rewriting task (3/5 and rank 31/52). We call Haiku the winner for most use cases because supporting proxies that matter for compression (structured_output 4, faithfulness 5, long_context 5, tool_calling 5) are equal, while Haiku’s output cost is $5 per m-token versus Opus $25 per m-token and Anthropic describes Haiku as faster and more efficient.

Question 2

When should I pick Opus 4.6 instead?

Accepted Answer

Pick Claude Opus 4.6 when safety calibration or creative compression quality matters more than cost: in our tests Opus has safety_calibration 5 (Haiku 2) and creative_problem_solving 5 (Haiku 4). Those differences matter for safety-critical redaction or highly inventive shortening.

Question 3

Do either model have an advantage for very long source texts?

Accepted Answer

Both models show long_context = 5 in our testing, so for preserving long source material while compressing they are equivalent on that proxy. Opus does offer a larger context window (1,000,000 tokens vs Haiku 200,000 tokens) and higher max_output_tokens (128,000 vs 64,000), which can matter if you plan multi-stage rewrites over extremely long documents.

Question 4

How should I weigh cost vs quality here?

Accepted Answer

In our testing quality for constrained_rewriting is tied at 3/5. If throughput and budget are the priority, Haiku’s lower input/output costs (input 1 vs 5, output 5 vs 25 per m-token) deliver immediate savings. If marginal quality improvements around safety or creative rephrasing are critical, Opus may justify its higher cost.

Claude Haiku 4.5 vs Claude Opus 4.6 for Constrained Rewriting

Claude Haiku 4.5

Claude Opus 4.6

Task Analysis

Practical Examples

Bottom Line

How We Test

Frequently Asked Questions