glm-5 vs kimi-k2.5

Pricing, Performance & Features Comparison

glm-5

Authorzai

Context Length200K

Reasoning

Providers1

ReleasedFeb 2026

Knowledge Cutoff-

License-

GLM-5 is a mixture-of-experts language model from Z.ai with 744 billion total parameters and 40 billion active parameters, designed for complex systems engineering and long-horizon agentic tasks. It utilizes DeepSeek Sparse Attention (DSA) to reduce deployment costs while maintaining long-context capacity, and achieves best-in-class performance among open-source models in reasoning, coding, and agentic tasks.

Input$1

Output$3.2

Latency (p50)5.2s

Output Limit131K

Function Calling

JSON Mode

InputText

OutputText

zai

in$1out$3.2cache$0.2-

Latency (24h)

Success Rate (24h)

kimi-k2.5

Authormoonshot

Context Length262K

Reasoning

Providers2

ReleasedJan 2026

Knowledge CutoffApr 2024

License-

Kimi K2.5 is Moonshot's most intelligent and versatile model to date, featuring a native multimodal architecture that supports both visual and text input alongside thinking and non-thinking modes. It achieves state-of-the-art performance in coding, reasoning, and Agent tasks, utilizing a 256K context window to solve complex logical and mathematical problems.

Input$0.45

Output$2.3

Latency (p50)-

Output Limit96K

Function Calling

JSON Mode

InputText, Image, Video

OutputText

deepinfra

Cheapest

in$0.45out$2.3cache$0.07write$0.07

moonshot

in$0.6out$3cache$0.1-