models/qwen/qwen3-6-max-preview
Q
Qwen·active

Qwen: Qwen3.6 Max Preview

Qwen's flagship model. Context window: 262K tokens.

Overall score
4.85
/5.00 · ranked #1
Input
$1.04
per 1M tokens
Output
$6.24
per 1M tokens
Context
262K
tokens
Blended
$4.94
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on Qwen: Qwen3.6 Max Preview.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
5.0
Constrained Rewriting
4.0
Creative Problem Solving
5.0
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
5.0
Safety Calibration
5.0
Persona Consistency
5.0
Agentic Planning
5.0
Multilingual
5.0
Tabular Data
5.0
SWE-bench Verified
76.7
AIME 2025
91.1

What you need to know

Qwen3.6 Max Preview is currently the highest-performing model in our dataset, ranking first among 71 evaluated models. Its primary differentiator is a near-perfect internal score of 4.85/5.0, driven by maximum scores in complex logic tasks including agentic planning, strategic analysis, and tool calling. It demonstrates exceptional reliability in maintaining persona consistency and faithfulness across a wide range of prompts.

The model handles large-scale data efficiently with a 262K context window and a perfect 5/5 score for long-context processing. While it excels in structured output and tabular data, it shows a slight relative dip in classification and constrained rewriting, though these remain strong at 4/5.

At a blended cost of $4.94/MTok, this model sits in a premium price tier. However, the cost is justified by its versatility; it performs at a top-tier level across almost every technical category, from multilingual support to creative problem solving, reducing the need to chain multiple specialized models.

Use this model if you are building complex autonomous agents, requiring high-precision structured data, or processing very long documents. Skip this model if your workload consists primarily of simple classification tasks where a cheaper, smaller model would suffice.

Strengths — Top 3

Structured Output5.0/5.0
Strategic Analysis5.0/5.0
Creative Problem Solving5.0/5.0

Relative weaknesses — Bottom 3

Constrained Rewriting4.0/5.0
Classification4.0/5.0
Structured Output5.0/5.0

Similar models

AAnthropic: Claude Opus 4.8 (Fast)$40.004.77XMiMo-V2.5$0.2454.69ZGLM-4.7$1.414.69OGPT-5.2$10.944.69