Skip to main content
Glama

kimi-latest-8k vs grok-4-0709

Pricing, Performance & Features Comparison

Authormoonshot
Context Length8K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff
License

Kimi-latest-8k is a variant of the Kimi K2 model series, a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. It is designed for frontier knowledge, reasoning, and coding tasks while being optimized for agentic capabilities including tool use and autonomous problem-solving.

Input$0.2
Output$2
Latency (p50)2.2s
Output Limit8K
Function Calling
JSON Mode
-
InputText, Image, Audio, Video
OutputText
in$0.2out$2cache$0.15
Latency (24h)
Success Rate (24h)
Authorxai
Context Length256K
Reasoning
Providers1
ReleasedJul 2025
Knowledge CutoffJul 2025
License

Our latest and greatest flagship model, offering unparalleled performance in natural language, math and reasoning - the perfect jack of all trades.

Input$3
Output$15
Latency (p50)3.5s
Output Limit256K
Function Calling
JSON Mode
-
InputText
OutputText
in$3out$15cache$0.75
Latency (24h)
Success Rate (24h)