Skip to main content
Glama

kimi-latest-32k vs devstral-small-2507

Pricing, Performance & Features Comparison

Price unit:
Authormoonshot
Context Length33K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff-
License-

Kimi-latest-32k is a multimodal large language model developed by Moonshot AI, capable of interpreting text, images, and code. It features a 32,768 token context window and supports image understanding, automatic context caching, and various functions like ToolCalls and web search.

Input$1
Output$3
Latency (p50)2.2s
Output Limit32K
Function Calling
JSON Mode
-
InputText, Image, Video
OutputText, Audio
in$1out$3cache$0.15-
Latency (24h)
Success Rate (24h)
Authormistral
Context Length128K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff-
License-

Devstral-Small-2507 is an agentic Large Language Model (LLM) developed by Mistral AI and All Hands AI specifically for software engineering tasks. It excels at using tools to explore codebases, edit multiple files, and power software engineering agents. This model is notable for its remarkable performance on the SWE-bench benchmark, where it is positioned as the #1 open-source model.

Input$0.1
Output$0.3
Latency (p50)639ms
Output Limit128K
Function Calling
JSON Mode
InputText
OutputText
in$0.1out$0.3--
Latency (24h)
Success Rate (24h)