gpt-4o-mini-2024-07-18 vs llama-3.1-8b-instruct

Pricing, Performance & Features Comparison

gpt-4o-mini-2024-07-18

Authoropenai

Context Length128K

Reasoning

Providers1

ReleasedJul 2024

Knowledge CutoffOct 2023

License-

GPT-4o-mini is a cost-effective and high-performing large language model from OpenAI, capable of handling both text and image inputs. It supports advanced features such as JSON Mode and parallel function calling, and can handle up to 128,000 tokens in its context window. This makes it an excellent choice for a variety of AI tasks, including those requiring large-scale context processing.

Input$0.15

Output$0.6

Latency (p50)2s

Output Limit16K

Function Calling

JSON Mode

InputText, Image

OutputText

openai

in$0.15out$0.6cache$0.075-

Latency (24h)

Success Rate (24h)

llama-3.1-8b-instruct

Authormeta

Context Length128K

Reasoning

Providers2

ReleasedJul 2024

Knowledge CutoffDec 2023

License-

Llama 3.1-8B-Instruct is an auto-regressive language model optimized for multilingual dialogue and instruction-following tasks. It employs supervised fine-tuning and reinforcement learning with human feedback to align with human preferences. The model supports a 128k token context and is suitable for generating text and code in multiple languages.

Input$0.02

Output$0.05

Latency (p50)-

Output Limit4K

Function Calling

JSON Mode

InputText

OutputText

deepinfra

Cheapest

in$0.02out$0.05--

avian

in$0.1out$0.1--