models/google/gemini-3-flash-preview
G
Google·active

Gemini 3 Flash Preview

Google's mid-tier model. Long-context specialist with 1.0M window.

Overall score
4.46
/5.00 · ranked #19
Input
$0.500
per 1M tokens
Output
$3.00
per 1M tokens
Context
1.0M
tokens
Blended
$2.38
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on Gemini 3 Flash Preview.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
5.0
Constrained Rewriting
4.0
Creative Problem Solving
5.0
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
5.0
Safety Calibration
1.0
Persona Consistency
5.0
Agentic Planning
5.0
Multilingual
5.0
Tabular Data
4.0
SWE-bench Verified
75.4
AIME 2025
92.8

What you need to know

Gemini 3 Flash Preview is defined by its massive 1.0M token context window and high-tier performance across agentic planning, tool calling, and structured output. With a 75.4% score on SWE-bench Verified and 92.8% on AIME 2025, the model demonstrates technical reasoning and coding capabilities that exceed typical lightweight model expectations.

At a blended cost of $2.38 per million tokens, this model is positioned as a high-value option for complex automation. It maintains maximum internal scores (5/5) in strategic analysis, multilingual support, and faithfulness, making it a bargain for developers who need frontier-level reasoning without the cost of a flagship model.

The primary technical risk is a critical failure in safety calibration, which scored 1/5. This indicates a propensity for hallucinations or unsafe outputs that will require rigorous external guardrails or a robust system prompt to manage.

Use this model for long-context retrieval, autonomous agent workflows, and complex coding tasks. Skip this model if your application requires strict, out-of-the-box safety compliance or high-precision tabular data classification.

Strengths — Top 3

Structured Output5.0/5.0
Strategic Analysis5.0/5.0
Creative Problem Solving5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0
Constrained Rewriting4.0/5.0
Classification4.0/5.0

Similar models

QQwen 3.7 Max$6.254.62GGemma 4 31B$0.3074.38OGPT-5$7.814.54GGemini 2.5 Pro$7.814.23