Skip to main content
Glama

glm-4.7 vs glm-4.7-flash

Pricing, Performance & Features Comparison

Price unit:
Authorzai
Context Length200K
Reasoning
-
Providers1
ReleasedDec 2025
Knowledge Cutoff-
License-

GLM-4.7 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

Input$0.6
Output$2.2
Latency (p50)11.7s
Output Limit128K
Function Calling
-
JSON Mode
-
Input-
Output-
in$0.6out$2.2cache$0.11-
Latency (24h)
Success Rate (24h)
Authorzai
Context Length200K
Reasoning
-
Providers1
ReleasedJan 2026
Knowledge Cutoff-
License-

GLM-4.7-Flash is a 30B Mixture-of-Experts (MoE) reasoning model with approximately 3.6B active parameters, designed for local deployment with best-in-class performance for coding, agentic workflows, and chat. It supports a 200K context window and achieves open-source state-of-the-art scores on benchmarks like SWE-bench Verified and τ²-Bench, excelling particularly in frontend and backend development capabilities.

Input$0.07
Output$0.4
Latency (p50)8.2s
Output Limit131K
Function Calling
JSON Mode
-
InputText
OutputText
in$0.07out$0.4-write$0.01
Latency (24h)
Success Rate (24h)