Skip to main content
Glama

kimi-k2-0711-preview vs kimi-latest-32k

Pricing, Performance & Features Comparison

Price unit:
Authormoonshot
Context Length128K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff-
License-

Kimi-k2-0711-preview is a version of the Kimi K2 language model developed by Moonshot AI. It is a mixture-of-experts model with 32 billion activated parameters and 1 trillion total parameters, optimized for agentic tasks to act, execute, and reason through complex, tool-driven processes. The model is designed for general-purpose chat and autonomous task execution with enhanced coding capabilities.

Input$0.6
Output$2.5
Latency (p50)1.7s
Output Limit128K
Function Calling
JSON Mode
InputText
OutputText, Image
in$0.6out$2.5cache$0.15-
Latency (24h)
Success Rate (24h)
Authormoonshot
Context Length33K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff-
License-

Kimi-latest-32k is a multimodal large language model developed by Moonshot AI, capable of interpreting text, images, and code. It features a 32,768 token context window and supports image understanding, automatic context caching, and various functions like ToolCalls and web search.

Input$1
Output$3
Latency (p50)2.2s
Output Limit32K
Function Calling
JSON Mode
-
InputText, Image, Video
OutputText, Audio
in$1out$3cache$0.15-
Latency (24h)
Success Rate (24h)