models/zhipu-ai/glm-4-7-flash
Z
Zhipu AI·active

GLM-4.7 Flash

Zhipu AI's efficiency model. Context window: 203K tokens.

Overall score
3.85
/5.00 · ranked #54
Input
$0.060
per 1M tokens
Output
$0.400
per 1M tokens
Context
203K
tokens
Blended
$0.315
3:1 out:in ratio

Price drops, new benchmarks, model updates. Stay current on GLM-4.7 Flash.

One email per change. Unsubscribe anytime.

modelpicker.aipowered by live benchmark data

Scores by test

Methodology →
Structured Output
5.0
Strategic Analysis
Constrained Rewriting
5.0
Creative Problem Solving
Tool Calling
5.0
Faithfulness
5.0
Classification
4.0
Long Context
5.0
Safety Calibration
4.0
Persona Consistency
5.0
Agentic Planning
4.0
Multilingual
5.0
Tabular Data
3.0

What you need to know

GLM-4.7 Flash is a high-efficiency model that prioritizes a massive 203K context window and precise adherence to formatting constraints. With a perfect 5.0 internal score across its primary benchmarks, it demonstrates a rare combination of high-capacity memory and strict output control, making it particularly effective for complex rewriting tasks that require maintaining specific structural rules over long documents.

Economically, this model is positioned as a high-value option for developers. At a blended cost of $0.315 per million tokens, it provides top-tier performance—ranking second overall among 76 evaluated models—at a price point typical of smaller, less capable flash models. The low input cost of $0.060 per million tokens makes it sustainable for processing the large datasets its context window allows.

Use this model if you need to process extensive documentation or require guaranteed output formats without sacrificing general reasoning quality. Skip this model if you require an open-weight solution for local deployment, as it is a proprietary offering from Zhipu AI.

Strengths — Top 3

Structured Output5.0/5.0
Constrained Rewriting5.0/5.0
Tool Calling5.0/5.0

Relative weaknesses — Bottom 3

Tabular Data3.0/5.0
Classification4.0/5.0
Safety Calibration4.0/5.0

Similar models

XGrok 3 Mini$0.4503.77QQwen: Qwen3 Coder 30B A3B Instruct$0.2203.23GGemini 2.5 Flash$1.954.15GGemini 2.5 Flash Lite$0.3253.92