o3-mini-high vs gemini-2.0-flash-lite-preview-02-05
Pricing, Performance & Features Comparison
03-mini with reasoning effort set to high.
Input$1
Output$4.4
Latency (p50)3.7s
Output Limit100K
Function Calling
JSON Mode
-
InputText
OutputText
in$1out$4.4cache$0.55–
Latency (24h)
Success Rate (24h)
Gemini 2.0 Flash-Lite delivers better quality than Gemini 1.5 Flash at the same speed and cost. It has a 1M token context window and multimodal input.
Input$0.075
Output$0.3
Latency (p50)–
Output Limit8K
Function Calling
-
JSON Mode
InputText, Image
OutputText
google-vertexCheapest
in$0.075out$0.3––
google-ai-studioCheapest
in$0.075out$0.3––