models/mistral/codestral-2508
M
Mistral·active

Codestral 2508

Mistral's efficiency model. Context window: 256K tokens.

Overall score
3.23
/5.00 · ranked #79
Input
$0.300
per 1M tokens
Output
$0.900
per 1M tokens
Context
256K
tokens
Blended
$0.750
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on Codestral 2508.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
2.0
Constrained Rewriting
3.0
Creative Problem Solving
2.0
Tool Calling
5.0
Faithfulness
5.0
Classification
3.0
Long Context
5.0
Safety Calibration
1.0
Persona Consistency
3.0
Agentic Planning
4.0
Multilingual
4.0
Tabular Data

What you need to know

Codestral 2508 is optimized for high-precision execution and long-context retrieval rather than cognitive reasoning. It achieves perfect scores in faithfulness, structured output, and tool calling, making it a reliable engine for programmatic tasks where strict adherence to a schema or a large codebase is required. Its 256K context window is fully leveraged, as evidenced by its top-tier long-context performance.

The model struggles significantly with high-level cognition. With low scores in strategic analysis and creative problem solving, it cannot be relied upon for architectural design or complex logic puzzles. Additionally, a critical failure in safety calibration indicates a lack of robust guardrails, which may be a risk for public-facing deployments.

At a blended cost of $0.750/MTok, the model is priced as a mid-tier utility. While it lacks the general intelligence of top-ranked models—ranking 61st out of 71 overall—the price is justified for developers who need a dependable tool for data extraction and API orchestration rather than a reasoning agent.

Use this model if you need a reliable tool for structured data generation, long-document analysis, or agentic tool calling. Skip this model if your application requires nuanced strategic reasoning, creative synthesis, or strict safety filtering.

Strengths — Top 3

Structured Output5.0/5.0
Tool Calling5.0/5.0
Faithfulness5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0
Strategic Analysis2.0/5.0
Creative Problem Solving2.0/5.0

Similar models

MMinistral 3 3B 2512$0.1003.31MDevstral Small 1.1$0.2502.85QQwen: Qwen3 Coder 30B A3B Instruct$0.2203.23MMinistral 3 8B 2512$0.1503.38