Skip to main content
Glama

kimi-k2.6 vs glm-5.1

Pricing, Performance & Features Comparison

Authormoonshot
Context Length262K
Reasoning
Providers1
ReleasedApr 2026
Knowledge CutoffApr 2025
LicenseMIT License

Mixture-of-Experts model with 1T total parameters and 32B activated per token. Features MLA attention, MoonViT vision encoder, and agent swarm orchestration.

Input$0.95
Output$4
Latency (p50)5.4s
Output Limit66K
Function Calling
JSON Mode
InputText, Image, Video
OutputText
in$0.95out$4cache$0.16write$0.95
Latency (24h)
Success Rate (24h)
Authorzai
Context Length200K
Reasoning
Providers1
ReleasedApr 2026
Knowledge Cutoff-
LicenseMIT License

Post-training upgrade to GLM-5. Mixture-of-Experts model with 744B total parameters and 40B activated per token. Trained on Huawei Ascend 910B chips with enhanced RL for agentic capabilities.

Input$1.4
Output$4.4
Latency (p50)6.1s
Output Limit131K
Function Calling
JSON Mode
InputText
OutputText
in$1.4out$4.4cache$0.26write$1.4
Latency (24h)
Success Rate (24h)