Skip to main content
Glama

qwen-2.5-7b-instruct vs ministral-3b-2410

Pricing, Performance & Features Comparison

Price unit:
Authoralibaba
Context Length131K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge Cutoff-
LicenseApache License 2.0

Qwen/Qwen2.5-7B-Instruct is an instruction-tuned, decoder-only language model offering enhanced coding, math capabilities, and multilingual support for over 29 languages. It can handle up to 128K tokens of context and generate up to 8K tokens, making it ideal for tasks requiring extended text generation or JSON outputs. Its resilient instruction-following features make it well-suited for chatbot role-play and structured output scenarios.

Input$0.27
Output$0.27
Latency (p50)-
Output Limit8K
Function Calling
-
JSON Mode
InputText
OutputText
in$0.27out$0.27--
Authormistral
Context Length128K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge CutoffOct 2024
License-

Ministral/ministral-3b-2410 is described as the world’s best edge model, designed for robust performance in resource-constrained environments. It focuses on delivering high-quality outputs while keeping computational requirements low, making it ideal for edge deployments.

Input$0.04
Output$0.04
Latency (p50)659ms
Output Limit4K
Function Calling
-
JSON Mode
-
Input-
Output-
in$0.04out$0.04--
Latency (24h)
Success Rate (24h)