Skip to main content
Glama

ministral-3b-2410 vs qwen-2.5-72b-instruct

Pricing, Performance & Features Comparison

Price unit:
Authormistral
Context Length128K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge CutoffOct 2024
License-

Ministral/ministral-3b-2410 is described as the world’s best edge model, designed for robust performance in resource-constrained environments. It focuses on delivering high-quality outputs while keeping computational requirements low, making it ideal for edge deployments.

Input$0.04
Output$0.04
Latency (p50)669ms
Output Limit4K
Function Calling
-
JSON Mode
-
Input-
Output-
in$0.04out$0.04--
Latency (24h)
Success Rate (24h)
Authoralibaba
Context Length131K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge Cutoff-
License-

Qwen2.5-72B-Instruct is a 72-billion-parameter, decoder-only language model designed for advanced instruction following and long-text generation. It excels at structured data understanding and output, especially JSON, and offers improved coding and mathematical reasoning. The model also supports over 29 languages and can handle extended contexts of up to 128K tokens.

Input$0.23
Output$0.4
Latency (p50)-
Output Limit8K
Function Calling
-
JSON Mode
InputText
OutputText
in$0.23out$0.4--