Skip to main content
Glama

qwen-2.5-7b-instruct vs qwen-2.5-72b-instruct

Pricing, Performance & Features Comparison

Price unit:
Authoralibaba
Context Length131K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge Cutoff-
LicenseApache License 2.0

Qwen/Qwen2.5-7B-Instruct is an instruction-tuned, decoder-only language model offering enhanced coding, math capabilities, and multilingual support for over 29 languages. It can handle up to 128K tokens of context and generate up to 8K tokens, making it ideal for tasks requiring extended text generation or JSON outputs. Its resilient instruction-following features make it well-suited for chatbot role-play and structured output scenarios.

Input$0.27
Output$0.27
Latency (p50)-
Output Limit8K
Function Calling
-
JSON Mode
InputText
OutputText
in$0.27out$0.27--
Authoralibaba
Context Length131K
Reasoning
-
Providers1
ReleasedSep 2024
Knowledge Cutoff-
License-

Qwen2.5-72B-Instruct is a 72-billion-parameter, decoder-only language model designed for advanced instruction following and long-text generation. It excels at structured data understanding and output, especially JSON, and offers improved coding and mathematical reasoning. The model also supports over 29 languages and can handle extended contexts of up to 128K tokens.

Input$0.23
Output$0.4
Latency (p50)-
Output Limit8K
Function Calling
-
JSON Mode
InputText
OutputText
in$0.23out$0.4--