models/xai/grok-build-0-1
X
xAI·active

xAI: Grok Build 0.1

xAI's mid-tier model. Context window: 256K tokens.

Overall score
4.31
/5.00 · ranked #32
Input
$1.00
per 1M tokens
Output
$2.00
per 1M tokens
Context
256K
tokens
Blended
$1.75
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on xAI: Grok Build 0.1.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
5.0
Constrained Rewriting
4.0
Creative Problem Solving
4.0
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
5.0
Safety Calibration
1.0
Persona Consistency
5.0
Agentic Planning
4.0
Multilingual
4.0
Tabular Data
5.0

What you need to know

Grok Build 0.1 is optimized for high-precision technical tasks, specifically excelling in structured output, tool calling, and strategic analysis. With a perfect 5/5 score across these domains, the model is built for reliability in programmatic workflows and data-heavy applications, including tabular data processing and long-context retrieval via its 256K window.

The model presents a significant risk regarding safety calibration, scoring 1/5. This indicates a lack of internal guardrails, which may lead to unpredictable or unfiltered responses. Developers must implement their own robust filtering layers if the model is intended for user-facing applications.

At a blended cost of $1.75/MTok, the model is priced as a premium offering. While its performance in agentic planning and classification is strong, it does not offer a cost-to-performance advantage over other top-tier models, making its value proposition dependent on its specific strengths in structured data and tool integration.

Use this model if you require a high-reliability engine for tool calling, complex strategic analysis, or processing large datasets within a 256K context. Skip this model if your application requires built-in safety alignment or if you are operating on a tight budget.

Strengths — Top 3

Structured Output5.0/5.0
Strategic Analysis5.0/5.0
Tool Calling5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0
Constrained Rewriting4.0/5.0
Creative Problem Solving4.0/5.0

Similar models

GGemma 4 26B A4B $0.2634.23OGPT-5$7.814.54GGemini 3 Flash Preview$2.384.46QQwen 3.7 Max$6.254.62