Skip to main content
Glama

gemini-2.5-flash vs kimi-k2-0711-preview

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length1M
Reasoning
-
Providers1
ReleasedJun 2025
Knowledge CutoffJan 2025
License-

Gemini 2.5 Flash is our best model in terms of price and performance, and offers well-rounded capabilities.

Input$0.15
Output$0.6
Latency (p50)1.5s
Output Limit66K
Function Calling
JSON Mode
-
Input-
Output-
in$0.15out$0.6--
Latency (24h)
Success Rate (24h)
Authormoonshot
Context Length128K
Reasoning
-
Providers1
ReleasedJul 2025
Knowledge Cutoff-
License-

Kimi-k2-0711-preview is a version of the Kimi K2 language model developed by Moonshot AI. It is a mixture-of-experts model with 32 billion activated parameters and 1 trillion total parameters, optimized for agentic tasks to act, execute, and reason through complex, tool-driven processes. The model is designed for general-purpose chat and autonomous task execution with enhanced coding capabilities.

Input$0.6
Output$2.5
Latency (p50)1.7s
Output Limit128K
Function Calling
JSON Mode
InputText
OutputText, Image
in$0.6out$2.5cache$0.15-
Latency (24h)
Success Rate (24h)