Skip to main content
Glama

deepseek-r1-distill-qwen-32b vs moonshot-v1-128k

Pricing, Performance & Features Comparison

Price unit:
Authordeepseek
Context Length128K
Reasoning
-
Providers3
ReleasedJan 2015
Knowledge CutoffJul 2024
License-

DeepSeek-R1-Distill-Qwen-32B outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.

Input$0.45
Output$0.7
Latency (p50)6.9s
Output Limit8K
Function Calling
-
JSON Mode
-
InputText
OutputText
glama
Cheapest
in$0.45out$1.6--
in$0.5out$4.9--
in$0.7out$0.7--
Latency (24h)
Success Rate (24h)
Authormoonshot
Context Length128K
Reasoning
-
Providers1
ReleasedJan 2023
Knowledge Cutoff-
License-

Moonshot-v1-128k is a large language model with ultra-long context processing capabilities, capable of handling up to 128,000 tokens. It is designed for generating extremely long texts and meeting the demands of complex generation tasks, making it ideal for research, academia, and large document generation.

Input$2
Output$5
Latency (p50)1.2s
Output Limit128K
Function Calling
JSON Mode
-
InputText
OutputText
in$2out$5--
Latency (24h)
Success Rate (24h)