open-mixtral-8x7b vs jamba-instruct

Pricing, Performance & Features Comparison

open-mixtral-8x7b

Authormistral

Context Length33K

Reasoning

Providers1

ReleasedFeb 2024

Knowledge CutoffDec 2023

License-

Mixtral 8x7B is a high-quality sparse mixture of experts (SMoE) large language model with open weights, released under Apache 2.0 license. Despite having 45 billion parameters, its architecture requires compute equivalent to a 14 billion parameter model, enabling 6x faster inference and strong performance that outperforms Llama 2 70B and matches or exceeds GPT-3.5 on many benchmarks.

Input$0.7

Output$0.7

Latency (p50)1.1s

Output Limit4K

Function Calling

JSON Mode

InputText

OutputText

mistral

in$0.7out$0.7--

Latency (24h)

Success Rate (24h)

jamba-instruct

Authorai21

Context Length256K

Reasoning

Providers1

ReleasedMar 2024

Knowledge CutoffMar 2024

License-

ai21/jamba-instruct is an instruction-tuned LLM from AI21 Labs, built on the Mamba-Transformer architecture. It provides a 256k-token context window and excels at tasks like summarization, entity extraction, function calling, JSON-based output, and citation. It is specifically designed for enterprise use and top-tier performance across multiple benchmarks.

Input$0.5

Output$0.7

Latency (p50)-

Output Limit4K

Function Calling

JSON Mode

InputText

OutputText

ai21

in$0.5out$0.7--