Skip to main content
Glama

gemma-2-9b-it vs gpt-4o-mini-2024-07-18

Pricing, Performance & Features Comparison

Authorgoogle
Context Length4K
Reasoning
-
Providers1
ReleasedJun 2024
Knowledge Cutoff-
License-

google/gemma-2-9b-it is a text-to-text, decoder-only large language model optimized for tasks such as question answering, summarization, and reasoning. It was trained on 8 trillion tokens with 9 billion parameters, aiming for efficient inference and strong performance despite its size. Its design supports flexible deployment for applications like chatbots, code generation, and research in NLP.

Input$0.00
Output$0.00
Latency (p50)-
Output Limit2K
Function Calling
-
JSON Mode
-
InputText
OutputText
in$0.00out$0.00--
Authoropenai
Context Length128K
Reasoning
-
Providers1
ReleasedJul 2024
Knowledge CutoffOct 2023
License-

GPT-4o-mini is a cost-effective and high-performing large language model from OpenAI, capable of handling both text and image inputs. It supports advanced features such as JSON Mode and parallel function calling, and can handle up to 128,000 tokens in its context window. This makes it an excellent choice for a variety of AI tasks, including those requiring large-scale context processing.

Input$0.15
Output$0.6
Latency (p50)2s
Output Limit16K
Function Calling
JSON Mode
InputText, Image
OutputText
in$0.15out$0.6cache$0.075-
Latency (24h)
Success Rate (24h)