qwen-flash-2025-07-28 vs gpt-oss-20b

Pricing, Performance & Features Comparison

Authoralibaba

Context Length1M

Reasoning

Providers1

ReleasedJul 2025

Knowledge Cutoff-

License-

Qwen-Flash is the fastest and most cost-effective model in the Qwen series and is suitable for simple jobs.

Input$0.25

Output$2

Latency (p50)2s

Output Limit33K

Function Calling

JSON Mode

Input-

Output-

in$0.25out$2--

Authoropenai

Context Length131K

Reasoning

Providers0

ReleasedAug 2025

Knowledge Cutoff-

License-

OpenAI medium-sized open weight model for low latency.

Input-

Output-

Latency (p50)-

Output Limit131K

Function Calling

JSON Mode

InputText

OutputText