gemini-2.0-flash-lite-preview-02-05 vs o3-mini-2025-01-31
Pricing, Performance & Features Comparison
Gemini 2.0 Flash-Lite delivers better quality than Gemini 1.5 Flash at the same speed and cost. It has a 1M token context window and multimodal input.
Input$0.075
Output$0.3
Latency (p50)–
Output Limit8K
Function Calling
-
JSON Mode
InputText, Image
OutputText
google-vertexCheapest
in$0.075out$0.3––
google-ai-studioCheapest
in$0.075out$0.3––
o3-mini is OpenAI's most recent small reasoning model, providing high intelligence at the same cost and latency targets of o1-mini. o3-mini also supports key developer features, like Structured Outputs, function calling, Batch API, and more. Like other models in the o-series, it is designed to excel at science, math, and coding tasks.
Input$1
Output$4.4
Latency (p50)2.2s
Output Limit100K
Function Calling
JSON Mode
-
InputText
OutputText
in$1out$4.4cache$0.55–