Skip to main content
Glama

deepseek-r1-distill-qwen-32b vs moonshot-v1-8k

Pricing, Performance & Features Comparison

Price unit:
Authordeepseek
Context Length128K
Reasoning
-
Providers3
ReleasedJan 2015
Knowledge CutoffJul 2024
License-

DeepSeek-R1-Distill-Qwen-32B outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.

Input$0.45
Output$0.7
Latency (p50)6.9s
Output Limit8K
Function Calling
-
JSON Mode
-
InputText
OutputText
glama
Cheapest
in$0.45out$1.6--
in$0.5out$4.9--
in$0.7out$0.7--
Latency (24h)
Success Rate (24h)
Authormoonshot
Context Length8K
Reasoning
-
Providers1
ReleasedJan 2024
Knowledge CutoffJan 2023
License-

The Moonshot V1 8K model is specifically designed for short text generation tasks. It features efficient processing performance and can handle up to 8,192 tokens, making it suitable for brief dialogues, note-taking, and rapid content generation.

Input$0.2
Output$2
Latency (p50)1.4s
Output Limit8K
Function Calling
JSON Mode
InputText
OutputText
in$0.2out$2--
Latency (24h)
Success Rate (24h)