gemini-2.0-flash-thinking-exp-01-21 vs o3-mini-high

Pricing, Performance & Features Comparison

gemini-2.0-flash-thinking-exp-01-21

Authorgoogle

Context Length32K

Reasoning

Providers1

ReleasedJan 2025

Knowledge Cutoff-

LicenseCreative Commons Attribution 4.0 International

Gemini 2.0 Flash Thinking Mode is an experimental multimodal LLM that provides not only standard responses but also exposes its thought process, enhancing reasoning capabilities. It supports text and image inputs and outputs text-only responses, with a larger token context than previous Gemini versions.

Input$0.15

Output$0.6

Latency (p50)-

Output Limit8K

Function Calling

JSON Mode

InputText, Image

OutputText

google-vertex

in$0.15out$0.6--

o3-mini-high

Authoropenai

Context Length200K

Reasoning

Providers1

ReleasedJan 2025

Knowledge Cutoff-

License-

03-mini with reasoning effort set to high.

Input$1

Output$4.4

Latency (p50)4.1s

Output Limit100K

Function Calling

JSON Mode

InputText

OutputText

openai

in$1out$4.4cache$0.55-

gemini-2.0-flash-thinking-exp-01-21 vs o3-mini-high

Latency (24h)

Success Rate (24h)