Question 1

Is Devstral Small 1.1 better than Gemini 3.1 Pro Preview?

Accepted Answer

On most benchmarks in our testing, no. Gemini 3.1 Pro Preview wins 9 of 12 tests, Devstral Small 1.1 wins 1 (classification), and they tie on 2. Devstral Small 1.1 scores 4/5 on classification where Gemini 3.1 Pro Preview scores just 2/5 — that's a genuine win for routing and categorization tasks. But on agentic planning (2 vs 5), strategic analysis (2 vs 5), and creative problem solving (2 vs 5), Gemini 3.1 Pro Preview leads substantially. The right answer depends entirely on your task.

Question 2

Which is cheaper, Devstral Small 1.1 or Gemini 3.1 Pro Preview?

Accepted Answer

Devstral Small 1.1 is dramatically cheaper: $0.10/M input and $0.30/M output, versus Gemini 3.1 Pro Preview's $2.00/M input and $12.00/M output. That's a 40x gap on output tokens. At 10M output tokens/month, you're paying $3,000 vs $120,000. Additionally, Gemini 3.1 Pro Preview uses reasoning tokens, which can inflate costs further in complex workflows.

Question 3

Which is better for coding and software engineering?

Accepted Answer

Both models are described for software engineering use cases in our data. Devstral Small 1.1 is specifically described as built for software engineering agents (developed with All Hands AI, finetuned from Mistral Small 3.1). Gemini 3.1 Pro Preview scores 95.6% on AIME 2025 (Epoch AI), ranking 2nd of 23 models — indicating strong reasoning that benefits complex coding tasks. No external coding benchmark (SWE-bench Verified) score is available for either model in our current data. On our internal benchmarks, Gemini 3.1 Pro Preview scores higher on agentic planning (5 vs 2) and structured output (5 vs 4), both relevant to engineering workflows.

Question 4

Which model handles longer documents better?

Accepted Answer

Gemini 3.1 Pro Preview has a significant advantage: a 1,048,576-token context window versus Devstral Small 1.1's 131,072 tokens — roughly 8x larger. In our testing, Gemini 3.1 Pro Preview also scores 5/5 on long-context retrieval accuracy (tied for 1st with 36 other models out of 55 tested) vs Devstral Small 1.1's 4/5 (ranked 38th of 55). For large codebases, long documents, or multi-document analysis, Gemini 3.1 Pro Preview is the stronger choice.

Question 5

Does Devstral Small 1.1 support image or audio input?

Accepted Answer

No. According to our data, Devstral Small 1.1 is text-to-text only. Gemini 3.1 Pro Preview supports text, image, file, audio, and video input. If your workflow involves any non-text modality, Devstral Small 1.1 is not an option.

Question 6

Which model is better for agentic or autonomous workflows?

Accepted Answer

Gemini 3.1 Pro Preview scores 5/5 on agentic planning in our testing, tied for 1st with 14 other models out of 54 tested. Devstral Small 1.1 scores 2/5 and ranks 53rd of 54 — near the bottom of all models we've benchmarked on this test. Agentic planning measures goal decomposition and failure recovery, both critical for autonomous workflows. Gemini 3.1 Pro Preview is the clear choice here.

Devstral Small 1.1 vs Gemini 3.1 Pro Preview

Devstral Small 1.1

Gemini 3.1 Pro Preview

Benchmark Analysis

Pricing Analysis

Real-World Cost Comparison

Bottom Line

How We Test

Frequently Asked Questions