Skip to main content
Glama

gemini-3-flash vs kimi-k2.5

Pricing, Performance & Features Comparison

Authorgoogle
Context Length1M
Reasoning
Providers1
ReleasedDec 2025
Knowledge CutoffJan 2025
License-

Gemini 3 Flash combines Gemini 3 Pro's reasoning capabilities with the Flash line's levels on latency, efficiency, and cost. It not only enables everyday tasks with improved reasoning, but is designed to tackle the most complex agentic workflows.

Input$0.5
Output$3
Latency (p50)6.6s
Output Limit66K
Function Calling
JSON Mode
-
Input-
Output-
in$0.5out$3--
Latency (24h)
Success Rate (24h)
Authormoonshot
Context Length262K
Reasoning
-
Providers1
ReleasedJan 2026
Knowledge CutoffApr 2024
License-

Kimi K2.5 is Moonshot's most intelligent and versatile model to date, featuring a native multimodal architecture that supports both visual and text input alongside thinking and non-thinking modes. It achieves state-of-the-art performance in coding, reasoning, and Agent tasks, utilizing a 256K context window to solve complex logical and mathematical problems.

Input$0.6
Output$3
Latency (p50)3.7s
Output Limit96K
Function Calling
JSON Mode
-
InputText, Image, Video
OutputText
in$0.6out$3cache$0.1-
Latency (24h)
Success Rate (24h)