glm-5.1 vs gpt-5.4-mini-2026-03-17
Pricing, Performance & Features Comparison
Post-training upgrade to GLM-5. Mixture-of-Experts model with 744B total parameters and 40B activated per token. Trained on Huawei Ascend 910B chips with enhanced RL for agentic capabilities.
Input$1.4
Output$4.4
Latency (p50)6.1s
Output Limit131K
Function Calling
JSON Mode
InputText
OutputText
in$1.4out$4.4cache$0.26write$1.4
Latency (24h)
Success Rate (24h)
Input$0.75
Output$4.5
Latency (p50)2.7s
Output Limit128K
Function Calling
JSON Mode
-
InputText, Image
OutputText
in$0.75out$4.5cache$0.075-