moonshot-v1-128k vs open-mistral-7b

Pricing, Performance & Features Comparison

moonshot-v1-128k

Authormoonshot

Context Length128K

Reasoning

Providers1

ReleasedJan 2023

Knowledge Cutoff-

License-

Moonshot-v1-128k is a large language model with ultra-long context processing capabilities, capable of handling up to 128,000 tokens. It is designed for generating extremely long texts and meeting the demands of complex generation tasks, making it ideal for research, academia, and large document generation.

Input$2

Output$5

Latency (p50)-

Output Limit128K

Function Calling

JSON Mode

InputText

OutputText

moonshot

in$2out$5--

Success Rate (24h)

open-mistral-7b

Authormistral

Context Length33K

Reasoning

Providers1

ReleasedFeb 2024

Knowledge CutoffOct 2023

License-

Mistral 7B is a 7.3 billion parameter language model engineered to significantly outperform larger models like Llama 2 13B across all benchmarks and Llama 1 34B on many. It leverages advanced architectural features such as Grouped-query attention for faster inference and Sliding Window Attention for efficient handling of longer sequences, allowing it to approach CodeLlama 7B's performance on coding tasks while maintaining strong English language capabilities.

Input$0.25

Output$0.25

Latency (p50)838ms

Output Limit8K

Function Calling

JSON Mode

InputText

OutputText

mistral

in$0.25out$0.25--

moonshot-v1-128k vs open-mistral-7b

Success Rate (24h)

Latency (24h)

Success Rate (24h)