Skip to main content
Glama

kimi-latest-32k vs kimi-latest-8k

Pricing, Performance & Features Comparison

Price unit:
Authormoonshot
Context Length33K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff-
License-

Kimi-latest-32k is a multimodal large language model developed by Moonshot AI, capable of interpreting text, images, and code. It features a 32,768 token context window and supports image understanding, automatic context caching, and various functions like ToolCalls and web search.

Input$1
Output$3
Latency (p50)2.2s
Output Limit32K
Function Calling
JSON Mode
-
InputText, Image, Video
OutputText, Audio
in$1out$3cache$0.15-
Latency (24h)
Success Rate (24h)
Authormoonshot
Context Length8K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff-
License-

Kimi-latest-8k is a variant of the Kimi K2 model series, a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. It is designed for frontier knowledge, reasoning, and coding tasks while being optimized for agentic capabilities including tool use and autonomous problem-solving.

Input$0.2
Output$2
Latency (p50)2s
Output Limit8K
Function Calling
JSON Mode
-
InputText, Image, Audio, Video
OutputText
in$0.2out$2cache$0.15-
Latency (24h)
Success Rate (24h)