kimi-latest-128k vs grok-4-0709

Pricing, Performance & Features Comparison

kimi-latest-128k

Authormoonshot

Context Length128K

Reasoning

Providers1

ReleasedJul 2025

Knowledge Cutoff-

License-

Kimi-latest-128k refers to the Kimi K2 model, a state-of-the-art Mixture-of-Experts (MoE) language model with 32 billion activated and 1 trillion total parameters. It features a 128K context length and is meticulously optimized for agentic capabilities, specifically designed for tool use, reasoning, and autonomous problem-solving.

Input$2

Output$5

Latency (p50)1.5s

Output Limit128K

Function Calling

JSON Mode

InputText, Image, Audio, Video

OutputText, Audio

moonshot

in$2out$5cache$0.15-

Latency (24h)

Success Rate (24h)

grok-4-0709

Authorxai

Context Length256K

Reasoning

Providers1

ReleasedJul 2025

Knowledge CutoffJul 2025

License-

Our latest and greatest flagship model, offering unparalleled performance in natural language, math and reasoning - the perfect jack of all trades.

Input$3

Output$15

Latency (p50)28s

Output Limit256K

Function Calling

JSON Mode

InputText

OutputText

xai

in$3out$15cache$0.75-