models/deepseek/deepseek-r1-0528
D
DeepSeek·active

R1 0528

DeepSeek's mid-tier model. Context window: 164K tokens.

Overall score
4.46
/5.00 · ranked #20
Input
$0.500
per 1M tokens
Output
$2.15
per 1M tokens
Context
164K
tokens
Blended
$1.74
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on R1 0528.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
5.0
Constrained Rewriting
5.0
Creative Problem Solving
4.0
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
4.0
Safety Calibration
2.0
Persona Consistency
5.0
Agentic Planning
5.0
Multilingual
5.0
Tabular Data
4.0
MATH Level 5
96.6
AIME 2025
66.4

What you need to know

R1 0528 is a high-performance model that excels in precision-oriented tasks, specifically agentic planning, tool calling, and faithfulness. Its strongest technical differentiator is its mathematical capability, evidenced by a 96.6% score on MATH Level 5 and 66.4% on AIME 2025. These metrics indicate a model capable of handling complex logical reasoning and quantitative analysis far beyond standard general-purpose models.

The model maintains a high internal quality average of 4.46/5.0, with perfect scores in persona consistency, multilingual support, and long context handling. With a 164K context window and top-tier scores in faithfulness, it is reliable for processing large datasets without losing coherence or introducing hallucinations.

At a blended cost of $1.74/MTok, the model is priced competitively for its rank (#12 of 71). It provides elite-level reasoning and agentic capabilities at a price point that is significantly lower than many proprietary models in the same performance tier.

Use this model if your application requires rigorous mathematical accuracy, complex tool integration, or stable persona management across long conversations. Skip this model if your primary need is highly creative writing or nuanced strategic analysis, where it scores slightly lower relative to its own technical benchmarks.

Strengths — Top 3

Structured Output5.0/5.0
Strategic Analysis5.0/5.0
Constrained Rewriting5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration2.0/5.0
Creative Problem Solving4.0/5.0
Classification4.0/5.0

Similar models

GGemma 4 31B$0.3074.38QQwen: Qwen3.6 Plus$1.544.54OGPT-5$7.814.54NNVIDIA: Nemotron 3 Super$0.3604.46