gemini-2.5-flash vs kimi-latest-8k

Pricing, Performance & Features Comparison

gemini-2.5-flash

Authorgoogle

Context Length1M

Reasoning

Providers1

ReleasedJun 2025

Knowledge CutoffJan 2025

License-

Gemini 2.5 Flash is our best model in terms of price and performance, and offers well-rounded capabilities.

Input$0.15

Output$0.6

Latency (p50)1.1s

Output Limit66K

Function Calling

JSON Mode

Input-

Output-

google-vertex

in$0.15out$0.6--

Latency (24h)

Success Rate (24h)

kimi-latest-8k

Authormoonshot

Context Length8K

Reasoning

Providers1

ReleasedJul 2025

Knowledge Cutoff-

License-

Kimi-latest-8k is a variant of the Kimi K2 model series, a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. It is designed for frontier knowledge, reasoning, and coding tasks while being optimized for agentic capabilities including tool use and autonomous problem-solving.

Input$0.2

Output$2

Latency (p50)1.4s

Output Limit8K

Function Calling

JSON Mode

InputText, Image, Audio, Video

OutputText

moonshot

in$0.2out$2cache$0.15-