Skip to main content
Glama

ministral-3b-2410 vs ministral-8b-2410

Pricing, Performance & Features Comparison

Price unit:
Authormistral
Context Length128K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge CutoffOct 2024
License-

Ministral/ministral-3b-2410 is described as the world’s best edge model, designed for robust performance in resource-constrained environments. It focuses on delivering high-quality outputs while keeping computational requirements low, making it ideal for edge deployments.

Input$0.04
Output$0.04
Latency (p50)669ms
Output Limit4K
Function Calling
-
JSON Mode
-
Input-
Output-
in$0.04out$0.04--
Latency (24h)
Success Rate (24h)
Authormistral
Context Length128K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge CutoffOct 2023
License-

Ministral-8B-Instruct-2410 is an instruction-tuned language model built on Mistral’s 8B-parameter dense transformer architecture. It supports large context windows (up to 128k tokens) and is particularly strong in multilingual applications, code-related tasks, and chat-based interactions. Its design targets efficient on-device and edge computing scenarios with high performance at scale.

Input$0.1
Output$0.1
Latency (p50)674ms
Output Limit128K
Function Calling
JSON Mode
-
InputText
OutputText
in$0.1out$0.1--
Latency (24h)
Success Rate (24h)