gemini-2.0-flash-001 vs gemini-2.0-flash-thinking-exp-1219

Pricing, Performance & Features Comparison

gemini-2.0-flash-001

Authorgoogle

Context Length1M

Reasoning

Providers2

ReleasedDec 2024

Knowledge CutoffJun 2024

License-

Workhorse model for all daily tasks. Strong overall performance.

Input$0.1

Output$0.4

Latency (p50)2s

Output Limit8K

Function Calling

JSON Mode

InputText, Image

OutputText

google-ai-studio

Cheapest

in$0.1out$0.4--

google-vertex

in$0.15out$0.6--

Latency (24h)

Success Rate (24h)

gemini-2.0-flash-thinking-exp-1219

Authorgoogle

Context Length32K

Reasoning

Providers1

ReleasedDec 2024

Knowledge Cutoff-

LicenseCreative Commons Attribution 4.0 International

Superseded Bygemini-2.0-flash-thinking

Gemini 2.0 Flash Thinking Mode is an experimental multimodal LLM that provides not only standard responses but also exposes its thought process, enhancing reasoning capabilities. It supports text and image inputs and outputs text-only responses, with a larger token context than previous Gemini versions.

Input$0.00

Output$0.00

Latency (p50)-