models/openai/gpt-5-4-mini
O
OpenAI·active

GPT-5.4 Mini

OpenAI's efficiency model. Context window: 400K tokens.

Overall score
4.15
/5.00 · ranked #46
Input
$0.750
per 1M tokens
Output
$4.50
per 1M tokens
Context
400K
tokens
Blended
$3.56
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on GPT-5.4 Mini.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
5.0
Constrained Rewriting
4.0
Creative Problem Solving
4.0
Tool Calling
4.0
Faithfulness
5.0
Classification
4.0
Long Context
5.0
Safety Calibration
2.0
Persona Consistency
5.0
Agentic Planning
4.0
Multilingual
5.0
Tabular Data
2.0
AIME 2025
87.2

What you need to know

GPT-5.4 Mini distinguishes itself through high reliability in structured output, faithfulness, and long-context processing, all scoring 5/5 internally. Its 400K context window is paired with a high AIME 2025 score of 87.2%, indicating strong reasoning capabilities for a mini-tier model.

The pricing is moderate for the performance tier, with a blended cost of $3.56/MTok. While it excels in strategic analysis and multilingual tasks, it has significant deficits in handling tabular data and safety calibration, both scoring 2/5. These gaps suggest the model may struggle with precise data extraction from tables or strict adherence to safety guardrails.

Use this model if your workflow requires high-fidelity structured data, complex reasoning over large documents, or consistent persona maintenance. Skip this model if your application relies heavily on tabular data processing or requires high-precision safety calibration.

Strengths — Top 3

Structured Output5.0/5.0
Strategic Analysis5.0/5.0
Faithfulness5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration2.0/5.0
Tabular Data2.0/5.0
Constrained Rewriting4.0/5.0

Similar models

QQwen: Qwen3 235B A22B Instruct 2507$0.0934.08XGrok 4.20$2.194.00MMistral Medium 3.5$6.004.15GGemma 4 26B A4B $0.2634.23