models/xai/grok-4-3

xAI·active

Grok 4.3

xAI's efficiency model. Long-context specialist with 1M window.

Overall score

4.15

/5.00 · ranked #60

Input

$1.25

per 1M tokens

Output

$2.50

per 1M tokens

Context

tokens

Blended

$2.19

3:1 out:in ratio

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →

Structured Output

5.0

Strategic Analysis

5.0

Constrained Rewriting

4.0

Creative Problem Solving

5.0

Tool Calling

4.0

Faithfulness

4.0

Classification

3.0

Long Context

5.0

Safety Calibration

2.0

Persona Consistency

5.0

Agentic Planning

4.0

Multilingual

4.0

Tabular Data

4.0

What you need to know

Grok 4.3 distinguishes itself through high-level reasoning and structural precision. It achieves perfect scores in structured output, strategic analysis, and creative problem solving, making it a strong candidate for complex architectural planning or generating strictly formatted data. Its 1M token context window is fully leveraged, scoring 5/5 in long context performance, which allows for the processing of massive datasets without significant degradation in retrieval or coherence.

At a blended cost of $2.19/MTok, this model sits in a mid-to-high price tier. While the pricing is substantial, the value is concentrated in its agentic capabilities and persona consistency. However, the model struggles with basic classification tasks and exhibits a significant weakness in safety calibration, scoring only 2/5. This indicates a tendency to bypass safety guardrails or fail to adhere to strict content filtering.

Use this model if your workflow requires high-reasoning strategic analysis, large-scale document processing, or reliable JSON/structured outputs. Skip this model if your application requires strict safety alignment, high-accuracy classification, or if you are operating on a tight budget where lower-cost models suffice for simpler tasks.

Strengths — Top 3

Structured Output5.0/5.0

Strategic Analysis5.0/5.0

Creative Problem Solving5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration2.0/5.0

Classification3.0/5.0

Constrained Rewriting4.0/5.0