models/qwen/qwen3-235b-a22b-2507

Qwen·active

Qwen: Qwen3 235B A22B Instruct 2507

Name: Qwen: Qwen3 235B A22B Instruct 2507
Brand: Qwen
Price: 0.60 USD
Availability: InStock
Rating: 4.08 (13 reviews)

Qwen's efficiency model. Context window: 262K tokens.

Overall score

4.08

/5.00 · ranked #92

Input

$0.150

per 1M tokens

Output

$0.598

per 1M tokens

Context

262K

tokens

Blended

$0.486

3:1 out:in ratio

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →

Structured Output

5.0

Strategic Analysis

5.0

Constrained Rewriting

4.0

Creative Problem Solving

4.0

Tool Calling

4.0

Faithfulness

5.0

Classification

3.0

Long Context

5.0

Safety Calibration

1.0

Persona Consistency

5.0

Agentic Planning

4.0

Multilingual

5.0

Tabular Data

3.0

What you need to know

Qwen3 235B A22B Instruct 2507 is built for high-complexity cognitive tasks and long-form processing. It excels in strategic analysis, persona consistency, and faithfulness, making it a reliable choice for applications requiring strict adherence to a specific voice or complex reasoning over large datasets. Its 262K context window is supported by a perfect 5/5 internal score for long context, ensuring it maintains coherence across extensive inputs.

From a cost perspective, the model is positioned as a budget-friendly option for its scale, with a blended cost of $0.435/MTok. While it ranks #94 out of 130 models overall, its strength is concentrated in specialized outputs rather than general utility. It delivers perfect scores in structured output and multilingual capabilities, though it performs mediocrely in basic classification and tabular data handling.

Developers should be aware of a significant weakness in safety calibration. A 1/5 score indicates the model frequently struggles to distinguish between harmful and benign requests, often resulting in the over-refusal of legitimate prompts. This makes it less suitable for customer-facing bots where friction-less interaction is a priority.

Use this model if you need an affordable, high-capacity engine for strategic planning, multilingual structured data generation, or deep analysis of long documents. Skip this model if your workflow relies heavily on data classification, tabular formatting, or requires a high degree of nuance in safety filtering to avoid false positives.

Strengths — Top 3

Structured Output5.0/5.0

Strategic Analysis5.0/5.0

Faithfulness5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0

Classification3.0/5.0

Tabular Data3.0/5.0