gemma-2-9b-it vs devstral-medium-2507

Pricing, Performance & Features Comparison

gemma-2-9b-it

Authorgoogle

Context Length4K

Reasoning

Providers1

ReleasedJun 2024

Knowledge Cutoff-

License-

google/gemma-2-9b-it is a text-to-text, decoder-only large language model optimized for tasks such as question answering, summarization, and reasoning. It was trained on 8 trillion tokens with 9 billion parameters, aiming for efficient inference and strong performance despite its size. Its design supports flexible deployment for applications like chatbots, code generation, and research in NLP.

Input$0.00

Output$0.00

Latency (p50)-

Output Limit2K

Function Calling

JSON Mode

InputText

OutputText

deepinfra

in$0.00out$0.00--

devstral-medium-2507

Authormistral

Context Length128K

Reasoning

Providers1

ReleasedJul 2024

Knowledge Cutoff-

License-

Devstral Medium 2507 is a high-performance, code-centric large language model designed for agentic coding capabilities and enterprise use. It features a 128k token context window and achieves a 61.6% score on SWE-Bench Verified, outperforming several commercial models like Gemini 2.5 Pro and GPT-4.1. The model excels at code generation, multi-file editing, and powering software engineering agents with structured outputs and tool integration.

Input$0.4

Output$2

Latency (p50)893ms

Output Limit128K

Function Calling

JSON Mode

InputText

OutputText

mistral

in$0.4out$2--

gemma-2-9b-it vs devstral-medium-2507

Latency (24h)

Success Rate (24h)