Qwen: Qwen-Max is an API-access large language model from Qwen priced at $1.04/1M input tokens and $4.16/1M output tokens with a 33k context window. Context caching reduces input costs to $0.52/1M for cache reads.
Economics (OpenRouter)
Input / 1M
$1.04
Output / 1M
$4.16
Est. Monthly Cost
$65.83
Based on current usage settings
Cache Read$0.52
Cache Write$1.30
Capabilities (models.dev)
ReasoningNo
Tool CallNo
Audio InNo
Audio OutNo
Modalitytext
Context Window33k
Benchmarks (AA)
Intelligence Index
General reasoning and knowledge
—
Coding Index
Code generation & debugging
—
Math Index
Mathematical reasoning
—
Performance
—tokens/sec
* Median output throughput
Model ID: qwen/qwen-max • Data Sources: OpenRouter, Artificial Analysis, models.dev