deepseek-v4-pro vs gpt-5.4-nano-2026-03-17
Pricing, Performance & Features Comparison
Context Length1M
Reasoning
Providers1
ReleasedApr 2026
Knowledge Cutoff-
LicenseMIT License
Flagship Mixture-of-Experts model with 1.6T total parameters and 49B activated per token. Trained on 32T+ tokens with hybrid attention for efficient 1M context processing.
Input$1.7
Output$3.5
Latency (p50)4.5s
Output Limit384K
Function Calling
JSON Mode
-
InputText
OutputText
in$1.7out$3.5cache$0.15write$1.7
Latency (24h)
Success Rate (24h)
Input$0.2
Output$1.3
Latency (p50)2.5s
Output Limit128K
Function Calling
JSON Mode
-
InputText, Image
OutputText
in$0.2out$1.3cache$0.02-