models/deepseek/deepseek-r1

DeepSeek·active

R1

Name: R1
Brand: DeepSeek
Price: 2.50 USD
Availability: InStock
Rating: 4.00 (13 reviews)

DeepSeek's efficiency model. Context window: 164K tokens.

Overall score

4.00

/5.00 · ranked #91

Input

$0.700

per 1M tokens

Output

$2.50

per 1M tokens

Context

164K

tokens

Blended

$2.05

3:1 out:in ratio

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →

Structured Output

4.0

Strategic Analysis

5.0

Constrained Rewriting

4.0

Creative Problem Solving

5.0

Tool Calling

4.0

Faithfulness

5.0

Classification

2.0

Long Context

4.0

Safety Calibration

1.0

Persona Consistency

5.0

Agentic Planning

4.0

Multilingual

5.0

Tabular Data

4.0

MATH Level 5

93.1

AIME 2025

53.3

What you need to know

R1 distinguishes itself through high-level reasoning and complex problem-solving capabilities. With a 93.1% score on MATH Level 5 and 53.3% on AIME 2025, the model is optimized for quantitative and strategic tasks. Internal testing confirms this strength, yielding perfect scores in strategic analysis, creative problem solving, and faithfulness.

The model offers a competitive price point with a blended cost of $2.05/MTok, making high-end reasoning accessible for budget-conscious deployments. While it handles multilingual tasks and persona consistency perfectly, it exhibits significant failures in safety calibration and basic classification. Developers should note that its 64K context window is modest compared to some competitors, though it maintains a strong 4/5 internal rating for long-context performance.

Use this model for complex mathematical reasoning, strategic planning, or multilingual applications where high accuracy and persona stability are required. Skip this model for tasks requiring strict safety guardrails, simple classification, or massive context windows.

Strengths — Top 3

Strategic Analysis5.0/5.0

Creative Problem Solving5.0/5.0

Faithfulness5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0

Classification2.0/5.0

Structured Output4.0/5.0