models/google/gemma-4-26b-a4b-it
G
Google·active·free tier available

Gemma 4 26B A4B

Google's mid-tier model. Context window: 262K tokens.

Overall score
4.23
/5.00 · ranked #36
Input
$0.060
per 1M tokens
Output
$0.330
per 1M tokens
Context
262K
tokens
Blended
$0.263
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on Gemma 4 26B A4B .

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
5.0
Constrained Rewriting
3.0
Creative Problem Solving
4.0
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
5.0
Safety Calibration
1.0
Persona Consistency
5.0
Agentic Planning
4.0
Multilingual
5.0
Tabular Data
4.0

What you need to know

Gemma 4 26B A4B distinguishes itself through high reliability in structured data tasks and long-context processing. With perfect scores in structured output, faithfulness, and tool calling, this model is engineered for precision and adherence to technical schemas. Its 262K context window is backed by a maximum internal score for long-context retrieval, making it a viable option for analyzing large datasets or extensive codebases.

From a cost perspective, the model is priced competitively for its performance tier. At a blended cost of $0.263/MTok, it provides high-end capabilities in strategic analysis and multilingual support without the premium pricing associated with the top-ranked frontier models. It currently ranks 30th out of 71 models, placing it in the upper-middle tier of general utility.

The model has a critical failure in safety calibration, scoring 1/5. This indicates a significant lack of built-in guardrails, meaning developers must implement their own robust filtering and moderation layers. It also shows moderate weakness in constrained rewriting, suggesting it may struggle with strict character or word-count limitations.

Use this model for complex tool-calling pipelines, structured data extraction, and long-document analysis where precision is prioritized over safety. Skip this model if your application requires native safety alignment or highly constrained creative rewriting.

Strengths — Top 3

Structured Output5.0/5.0
Strategic Analysis5.0/5.0
Tool Calling5.0/5.0

Relative weaknesses — Bottom 3

Safety Calibration1.0/5.0
Constrained Rewriting3.0/5.0
Creative Problem Solving4.0/5.0

Similar models

XxAI: Grok Build 0.1$1.754.31GGemini 3 Flash Preview$2.384.46QQwen: Qwen3 235B A22B Instruct 2507$0.0934.08GGemma 4 31B$0.3074.38