Skip to main content
Glama

gemini-2.0-flash-001 vs llama-3.3-70b-instruct

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length1M
Reasoning
-
Providers2
ReleasedDec 2024
Knowledge CutoffJun 2024
License-

Workhorse model for all daily tasks. Strong overall performance.

Input$0.1
Output$0.4
Latency (p50)956ms
Output Limit8K
Function Calling
-
JSON Mode
InputText, Image
OutputText
in$0.1out$0.4--
in$0.15out$0.6--
Latency (24h)
Success Rate (24h)
Authormeta
Context Length128K
Reasoning
-
Providers1
ReleasedDec 2024
Knowledge CutoffDec 2023
License-

Llama 3.3 is a text-only 70B instruction-tuned model that provides enhanced performance relative to Llama 3.1 70B–and to Llama 3.2 90B when used for text-only applications. Moreover, for some applications, Llama 3.3 70B approaches the performance of Llama 3.1 405B.

Input$0.45
Output$0.45
Latency (p50)-
Output Limit4K
Function Calling
JSON Mode
-
InputText
OutputText
in$0.45out$0.45--