models/minimax/minimax-m2
M
minimax·active

MiniMax: MiniMax M2

minimax's efficiency model. Context window: 205K tokens.

Overall score
3.85
/5.00 · ranked #80
Input
$0.255
per 1M tokens
Output
$1.00
per 1M tokens
Context
205K
tokens
Blended
$0.814
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on MiniMax: MiniMax M2.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
4.0
Strategic Analysis
5.0
Constrained Rewriting
4.0
Creative Problem Solving
4.0
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
4.0
Safety Calibration
1.0
Persona Consistency
5.0
Agentic Planning
4.0
Multilingual
Tabular Data
5.0

What you need to know

MiniMax M2 is a high-utility model optimized for complex reasoning and operational reliability. It achieves perfect scores in strategic analysis, tool calling, and faithfulness, indicating it is highly capable of executing multi-step plans and adhering to source material without hallucinating. These strengths, combined with a 205K context window, make it a strong candidate for agentic workflows and data-heavy analysis.

The pricing is competitive for its performance tier, with a blended cost of $0.814/MTok. Developers get high-end reasoning and tabular data processing at a price point that allows for scaling across larger datasets. While it ranks 59th overall, its specific strengths in tool calling and persona consistency suggest it outperforms its general rank in specialized automation tasks.

A critical weakness is safety calibration, where it scores 1/5. This indicates a lack of built-in guardrails, meaning the model is prone to generating unfiltered content or ignoring safety constraints. Developers will need to implement robust external moderation layers if the model is being deployed in user-facing environments.

Use this model if you are building autonomous agents, complex analysis tools, or applications requiring high faithfulness and tool integration. Skip this model if your application requires strict safety alignment or if you cannot implement your own output filtering.

Strengths — Top 3

Strategic Analysis5.0/5.0
Tool Calling5.0/5.0
Faithfulness5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0
Structured Output4.0/5.0
Constrained Rewriting4.0/5.0

Similar models

BByteDance Seed: Seed 1.6 Flash$0.2443.77XxAI: Grok Build 0.1$1.754.31MMinistral 3 14B 2512$0.2003.77XMiMo-V2.5-Pro$0.7614.46