claude-3-5-haiku-20241022 vs qwen3-max-2025-09-23
Pricing, Performance & Features Comparison
Context Length200K
Reasoning
-
Providers1
ReleasedOct 2024
Knowledge CutoffOct 2023
License-
Claude 3.5 Haiku is Anthropic’s fastest model, designed for real-time user-facing chatbots, coding, and data extraction. It offers a large 200K-token context window and excels at rapid, accurate instruction following. This model balances cost, performance, and speed, making it well-suited for enterprise use cases.
Input$0.8
Output$4
Latency (p50)-
Output Limit8K
Function Calling
-
JSON Mode
-
InputText
OutputText
in$0.8out$4cache$0.03write$0.3
Qwen-Max is a 100-billion-parameter large language model developed by Alibaba Cloud, optimized for Chinese and English. It is capable of tasks including text creation, translation, dialogue simulation, and data visualization. The model supports a context length of up to 8,000 tokens, with 6,000 tokens available for input in normal usage.
Input$1.6
Output$6.4
Latency (p50)5.2s
Output Limit2K
Function Calling
-
JSON Mode
-
InputText, Image, Audio
OutputText, Image
in$1.6out$6.4--