Skip to main content
Glama

claude-3-5-haiku-20241022 vs qwen-turbo-2025-04-28

Pricing, Performance & Features Comparison

Price unit:
Authoranthropic
Context Length200K
Reasoning
-
Providers1
ReleasedOct 2024
Knowledge CutoffOct 2023
License-

Claude 3.5 Haiku is Anthropic’s fastest model, designed for real-time user-facing chatbots, coding, and data extraction. It offers a large 200K-token context window and excels at rapid, accurate instruction following. This model balances cost, performance, and speed, making it well-suited for enterprise use cases.

Input$0.8
Output$4
Latency (p50)879ms
Output Limit8K
Function Calling
-
JSON Mode
-
InputText
OutputText
in$0.8out$4cache$0.03write$0.3
Latency (24h)
Success Rate (24h)
Authoralibaba
Context Length1M
Reasoning
-
Providers1
ReleasedOct 2024
Knowledge Cutoff-
License-

The alibaba/qwen-turbo model is part of Alibaba Cloud's Qwen LLM series, offering capabilities like conversation, content creation, and code interpretation. It builds on advanced natural language processing techniques and supports multilingual tasks with a focus on Chinese and English.

Input$0.5
Output$0.2
Latency (p50)1.7s
Output Limit8K
Function Calling
JSON Mode
-
InputText
OutputText
in$0.5out$0.2--
Latency (24h)
Success Rate (24h)