deepseek-r1-distill-qwen-32b vs moonshot-v1-128k

Pricing, Performance & Features Comparison

deepseek-r1-distill-qwen-32b

Authordeepseek

Context Length128K

Reasoning

Providers3

ReleasedJan 2015

Knowledge CutoffJul 2024

License-

DeepSeek-R1-Distill-Qwen-32B outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.

Input$0.45

Output$0.7

Latency (p50)7.7s

Output Limit8K

Function Calling

JSON Mode

InputText

OutputText

glama

Cheapest

in$0.45out$1.6--

cloudflare

in$0.5out$4.9--

fireworks

in$0.7out$0.7--

Latency (24h)

Success Rate (24h)

moonshot-v1-128k

Authormoonshot

Context Length128K

Reasoning

Providers1

ReleasedJan 2023

Knowledge Cutoff-

License-

Moonshot-v1-128k is a large language model with ultra-long context processing capabilities, capable of handling up to 128,000 tokens. It is designed for generating extremely long texts and meeting the demands of complex generation tasks, making it ideal for research, academia, and large document generation.

Input$2

Output$5

Latency (p50)1.4s

Output Limit128K

Function Calling

JSON Mode

InputText

OutputText

moonshot

in$2out$5--