deepseek-v4-pro vs gemini-3.1-flash-lite-preview

Pricing, Performance & Features Comparison

deepseek-v4-pro

Authordeepseek

Context Length1M

Reasoning

Providers1

ReleasedApr 2026

Knowledge Cutoff–

LicenseMIT License

Flagship Mixture-of-Experts model with 1.6T total parameters and 49B activated per token. Trained on 32T+ tokens with hybrid attention for efficient 1M context processing.

Input$1.7

Output$3.5

Latency (p50)3.7s

Output Limit384K

Function Calling

JSON Mode

InputText

OutputText

deepseek

in$1.7out$3.5cache$0.15write$1.7

Latency (24h)

Success Rate (24h)

gemini-3.1-flash-lite-preview

Authorgoogle

Context Length1M

Reasoning

Providers1

ReleasedFeb 2025

Knowledge CutoffJun 2024

License–

Gemini 2.0 Flash-Lite is Google's most cost-efficient Gemini 2.0 model, optimized for cost efficiency and low latency. It is a multimodal model supporting inputs including text, code, images, audio, and video, with text-only output. It features a maximum input capacity of 1,048,576 tokens and is cost-optimized for large-scale text output use cases.

Input$0.25

Output$1.5

Latency (p50)2.1s

Output Limit8K

Function Calling

JSON Mode

InputText, Image, Audio, Video

OutputText

google-vertex

in$0.25out$1.5cache$0.025–