Skip to main content
Glama

gemini-2.0-flash-001 vs gemini-2.0-flash-exp

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length1M
Reasoning
-
Providers2
ReleasedDec 2024
Knowledge CutoffJun 2024
License-

Workhorse model for all daily tasks. Strong overall performance.

Input$0.1
Output$0.4
Latency (p50)956ms
Output Limit8K
Function Calling
-
JSON Mode
InputText, Image
OutputText
in$0.1out$0.4--
in$0.15out$0.6--
Latency (24h)
Success Rate (24h)
Authorgoogle
Context Length1M
Reasoning
-
Providers1
ReleasedDec 2024
Knowledge CutoffAug 2024
License-

Gemini 2.0 Flash-Exp is an experimental low-latency model from Google Vertex AI that supports multimodal inputs and outputs, including real-time vision, audio streaming, and text-to-speech. It provides improved performance over earlier Gemini releases, offering features such as bounding box detection, native image generation, and complex function calling. The model excels at agentic tasks and is suitable for scenarios requiring fast responses and versatile tool use.

Input$0.00
Output$0.00
Latency (p50)-
Output Limit8K
Function Calling
-
JSON Mode
-
InputText, Image, Audio, Video
OutputText, Image, Audio
in$0.00out$0.00--