Skip to main content
Glama

claude-sonnet-4-6 vs gemini-3.1-flash-lite-preview

Pricing, Performance & Features Comparison

Authoranthropic
Context Length1M
Reasoning
Providers1
ReleasedFeb 2026
Knowledge CutoffAug 2025
License

Claude Sonnet 4.6 is Anthropic's best combination of speed and intelligence, delivering frontier performance across coding, agents, and professional work at scale. It features advanced capabilities including extended and adaptive thinking, supports a 1M token context window, and excels in reasoning, coding, multilingual tasks, long-context handling, and image processing with a training data cutoff of January 2026.

Input$3
Output$15
Latency (p50)3.5s
Output Limit128K
Function Calling
JSON Mode
InputText, Image
OutputText
in$3out$15
Latency (24h)
Success Rate (24h)
Authorgoogle
Context Length1M
Reasoning
Providers1
ReleasedFeb 2025
Knowledge CutoffJun 2024
License

Gemini 2.0 Flash-Lite is Google's most cost-efficient Gemini 2.0 model, optimized for cost efficiency and low latency. It is a multimodal model supporting inputs including text, code, images, audio, and video, with text-only output. It features a maximum input capacity of 1,048,576 tokens and is cost-optimized for large-scale text output use cases.

Input$0.25
Output$1.5
Latency (p50)2s
Output Limit8K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$0.25out$1.5cache$0.025
Latency (24h)
Success Rate (24h)