gemini-3-1-pro vs gemini-3.1-flash-lite-preview

Pricing, Performance & Features Comparison

gemini-3-1-pro

Authorgoogle

Context Length1M

Reasoning

Providers1

ReleasedFeb 2026

Knowledge CutoffJan 2025

License–

Gemini 3.1 Pro is the next iteration in the Gemini 3 series of highly capable, natively multimodal reasoning models. As Google's most advanced model for complex tasks, it can comprehend vast datasets and challenging problems from sources including text, audio, images, video, and entire code repositories with a 1M token context window. Built to refine the performance and reliability of the Gemini 3 Pro series, it provides better thinking, improved token efficiency, and a more grounded, factually consistent experience.

Input$2

Output$12

Latency (p50)5.9s

Output Limit66K

Function Calling

JSON Mode

InputText, Image, Audio, Video

OutputText

google-vertex

in$2out$12––

Latency (24h)

Success Rate (24h)

gemini-3.1-flash-lite-preview

Authorgoogle

Context Length1M

Reasoning

Providers1

ReleasedFeb 2025

Knowledge CutoffJun 2024

License–

Gemini 2.0 Flash-Lite is Google's most cost-efficient Gemini 2.0 model, optimized for cost efficiency and low latency. It is a multimodal model supporting inputs including text, code, images, audio, and video, with text-only output. It features a maximum input capacity of 1,048,576 tokens and is cost-optimized for large-scale text output use cases.

Input$0.25

Output$1.5

Latency (p50)2.3s

Output Limit8K

Function Calling

JSON Mode

InputText, Image, Audio, Video

OutputText

google-vertex

in$0.25out$1.5cache$0.025–