Skip to main content
Glama

kimi-latest-32k vs grok-4-0709

Pricing, Performance & Features Comparison

Authormoonshot
Context Length33K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff
License

Kimi-latest-32k is a multimodal large language model developed by Moonshot AI, capable of interpreting text, images, and code. It features a 32,768 token context window and supports image understanding, automatic context caching, and various functions like ToolCalls and web search.

Input$1
Output$3
Latency (p50)2.3s
Output Limit32K
Function Calling
JSON Mode
-
InputText, Image, Video
OutputText, Audio
in$1out$3cache$0.15
Latency (24h)
Success Rate (24h)
Authorxai
Context Length256K
Reasoning
Providers1
ReleasedJul 2025
Knowledge CutoffJul 2025
License

Our latest and greatest flagship model, offering unparalleled performance in natural language, math and reasoning - the perfect jack of all trades.

Input$3
Output$15
Latency (p50)3.5s
Output Limit256K
Function Calling
JSON Mode
-
InputText
OutputText
in$3out$15cache$0.75
Latency (24h)
Success Rate (24h)