qwen3-max-2025-09-23 vs claude-3-5-haiku-20241022

Pricing, Performance & Features Comparison

qwen3-max-2025-09-23

Authoralibaba

Context Length262K

Reasoning

Providers1

ReleasedOct 2024

Knowledge Cutoff-

License-

Qwen-Max is a 100-billion-parameter large language model developed by Alibaba Cloud, optimized for Chinese and English. It is capable of tasks including text creation, translation, dialogue simulation, and data visualization. The model supports a context length of up to 8,000 tokens, with 6,000 tokens available for input in normal usage.

Input$1.6

Output$6.4

Latency (p50)3.8s

Output Limit2K

Function Calling

JSON Mode

InputText, Image, Audio

OutputText, Image

alibaba

in$1.6out$6.4--

Latency (24h)

Success Rate (24h)

claude-3-5-haiku-20241022

Authoranthropic

Context Length200K

Reasoning

Providers1

ReleasedOct 2024

Knowledge CutoffOct 2023

License-

Claude 3.5 Haiku is Anthropic’s fastest model, designed for real-time user-facing chatbots, coding, and data extraction. It offers a large 200K-token context window and excels at rapid, accurate instruction following. This model balances cost, performance, and speed, making it well-suited for enterprise use cases.

Input$0.8

Output$4

Latency (p50)-

Output Limit8K

Function Calling

JSON Mode

InputText

OutputText

anthropic

in$0.8out$4cache$0.03write$0.3