deepseek-v4-flash vs deepseek-v4-pro

Pricing, Performance & Features Comparison

deepseek-v4-flash

Authordeepseek

Context Length1M

Reasoning

Providers1

ReleasedApr 2026

Knowledge Cutoff–

LicenseMIT License

Mixture-of-Experts model with 284B total parameters and 13B activated per token. Features hybrid attention architecture for efficient 1M context processing.

Input$0.14

Output$0.28

Latency (p50)4.1s

Output Limit384K

Function Calling

JSON Mode

InputText

OutputText

deepseek

in$0.14out$0.28cache$0.028write$0.14

Latency (24h)

Success Rate (24h)

deepseek-v4-pro