Skip to main content
Glama

moonshot-v1-128k vs moonshot-v1-8k

Pricing, Performance & Features Comparison

Price unit:
Authormoonshot
Context Length128K
Reasoning
-
Providers1
ReleasedJan 2023
Knowledge Cutoff-
License-

Moonshot-v1-128k is a large language model with ultra-long context processing capabilities, capable of handling up to 128,000 tokens. It is designed for generating extremely long texts and meeting the demands of complex generation tasks, making it ideal for research, academia, and large document generation.

Input$2
Output$5
Latency (p50)1.2s
Output Limit128K
Function Calling
JSON Mode
-
InputText
OutputText
in$2out$5--
Latency (24h)
Success Rate (24h)
Authormoonshot
Context Length8K
Reasoning
-
Providers1
ReleasedJan 2024
Knowledge CutoffJan 2023
License-

The Moonshot V1 8K model is specifically designed for short text generation tasks. It features efficient processing performance and can handle up to 8,192 tokens, making it suitable for brief dialogues, note-taking, and rapid content generation.

Input$0.2
Output$2
Latency (p50)1.4s
Output Limit8K
Function Calling
JSON Mode
InputText
OutputText
in$0.2out$2--
Latency (24h)
Success Rate (24h)