Skip to main content
Glama

gemini-3-1-pro vs gemini-3.1-flash-lite-preview

Pricing, Performance & Features Comparison

Authorgoogle
Context Length1M
Reasoning
Providers1
ReleasedFeb 2026
Knowledge CutoffJan 2025
License

Gemini 3.1 Pro is the next iteration in the Gemini 3 series of highly capable, natively multimodal reasoning models. As Google's most advanced model for complex tasks, it can comprehend vast datasets and challenging problems from sources including text, audio, images, video, and entire code repositories with a 1M token context window. Built to refine the performance and reliability of the Gemini 3 Pro series, it provides better thinking, improved token efficiency, and a more grounded, factually consistent experience.

Input$2
Output$12
Latency (p50)5.9s
Output Limit66K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$2out$12
Latency (24h)
Success Rate (24h)
Authorgoogle
Context Length1M
Reasoning
Providers1
ReleasedFeb 2025
Knowledge CutoffJun 2024
License

Gemini 2.0 Flash-Lite is Google's most cost-efficient Gemini 2.0 model, optimized for cost efficiency and low latency. It is a multimodal model supporting inputs including text, code, images, audio, and video, with text-only output. It features a maximum input capacity of 1,048,576 tokens and is cost-optimized for large-scale text output use cases.

Input$0.25
Output$1.5
Latency (p50)2.3s
Output Limit8K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$0.25out$1.5cache$0.025
Latency (24h)
Success Rate (24h)