Skip to main content
Glama

gemini-2.0-flash-thinking-exp-01-21 vs o3-mini-high

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length32K
Reasoning
-
Providers1
ReleasedJan 2025
Knowledge Cutoff-
LicenseCreative Commons Attribution 4.0 International

Gemini 2.0 Flash Thinking Mode is an experimental multimodal LLM that provides not only standard responses but also exposes its thought process, enhancing reasoning capabilities. It supports text and image inputs and outputs text-only responses, with a larger token context than previous Gemini versions.

Input$0.15
Output$0.6
Latency (p50)-
Output Limit8K
Function Calling
-
JSON Mode
-
InputText, Image
OutputText
in$0.15out$0.6--
Authoropenai
Context Length200K
Reasoning
-
Providers1
ReleasedJan 2025
Knowledge Cutoff-
License-

03-mini with reasoning effort set to high.

Input$1
Output$4.4
Latency (p50)4s
Output Limit100K
Function Calling
JSON Mode
-
InputText
OutputText
in$1out$4.4cache$0.55-
Latency (24h)
Success Rate (24h)