models/mistral/codestral-2508

Mistral·active

Codestral 2508

Name: Codestral 2508
Brand: Mistral
Price: 0.90 USD
Availability: InStock
Rating: 3.38 (13 reviews)

Mistral's efficiency model. Context window: 256K tokens.

Overall score

3.38

/5.00 · ranked #113

Input

$0.300

per 1M tokens

Output

$0.900

per 1M tokens

Context

256K

tokens

Blended

$0.750

3:1 out:in ratio

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →

Structured Output

5.0

Strategic Analysis

2.0

Constrained Rewriting

3.0

Creative Problem Solving

2.0

Tool Calling

5.0

Faithfulness

5.0

Classification

3.0

Long Context

5.0

Safety Calibration

1.0

Persona Consistency

3.0

Agentic Planning

4.0

Multilingual

4.0

Tabular Data

2.0

What you need to know

Codestral 2508 is optimized for high-precision execution and long-context retrieval rather than cognitive reasoning. It achieves perfect scores in faithfulness, structured output, and tool calling, making it a reliable engine for programmatic tasks where strict adherence to a schema or a large codebase is required. Its 256K context window is fully leveraged, as evidenced by its top-tier long-context performance.

The model struggles significantly with high-level cognition. With low scores in strategic analysis and creative problem solving, it cannot be relied upon for architectural design or complex logic puzzles. Additionally, a critical failure in safety calibration indicates a lack of robust guardrails, which may be a risk for public-facing deployments.

At a blended cost of $0.750/MTok, the model is priced as a mid-tier utility. While it lacks the general intelligence of top-ranked models—ranking 61st out of 71 overall—the price is justified for developers who need a dependable tool for data extraction and API orchestration rather than a reasoning agent.

Use this model if you need a reliable tool for structured data generation, long-document analysis, or agentic tool calling. Skip this model if your application requires nuanced strategic reasoning, creative synthesis, or strict safety filtering.

Strengths — Top 3

Structured Output5.0/5.0

Tool Calling5.0/5.0

Faithfulness5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0

Strategic Analysis2.0/5.0

Creative Problem Solving2.0/5.0