Skip to main content
Glama

qwen3-max-2025-09-23 vs claude-3-5-haiku-20241022

Pricing, Performance & Features Comparison

Price unit:
Authoralibaba
Context Length262K
Reasoning
-
Providers1
ReleasedOct 2024
Knowledge Cutoff-
License-

Qwen-Max is a 100-billion-parameter large language model developed by Alibaba Cloud, optimized for Chinese and English. It is capable of tasks including text creation, translation, dialogue simulation, and data visualization. The model supports a context length of up to 8,000 tokens, with 6,000 tokens available for input in normal usage.

Input$1.6
Output$6.4
Latency (p50)2.4s
Output Limit2K
Function Calling
-
JSON Mode
-
InputText, Image, Audio
OutputText, Image
in$1.6out$6.4--
Latency (24h)
Success Rate (24h)
Authoranthropic
Context Length200K
Reasoning
-
Providers1
ReleasedOct 2024
Knowledge CutoffOct 2023
License-

Claude 3.5 Haiku is Anthropic’s fastest model, designed for real-time user-facing chatbots, coding, and data extraction. It offers a large 200K-token context window and excels at rapid, accurate instruction following. This model balances cost, performance, and speed, making it well-suited for enterprise use cases.

Input$0.8
Output$4
Latency (p50)879ms
Output Limit8K
Function Calling
-
JSON Mode
-
InputText
OutputText
in$0.8out$4cache$0.03write$0.3
Latency (24h)
Success Rate (24h)