Model reference · Synced 2025-04-29
Qwen: Qwen3 Coder Flash
Qwen: Qwen3 Coder Flash is an AI model from Kilo Gateway. 1M context window. Capabilities: tool calling. Available on 1 provider. Cheapest listing: $0.195 input / $0.975 output per 1M tokens.
Quick facts
- Cheapest input: $0.195 per 1M tokens (Kilo Gateway)
- Cheapest output: $0.975 per 1M tokens
- Context window: 1M tokens
- Max output: 66K tokens
- Release date: 2025-07-23
- Capabilities: tool calling
- Provider count: 1
Provider pricing
Same model, different providers, different prices. Cheapest first.
| Provider | Input / 1M | Output / 1M | Context | Listed |
|---|---|---|---|---|
| Kilo Gateway | $0.195 | $0.975 | 1M | 2025-07-23 |
Prices synced daily from models.dev + provider docs.
How to use this model
If you're picking Qwen: Qwen3 Coder Flash for a project, the three things that matter most:
- Compare it side-by-side with one or two alternatives in the live comparison tool. Pricing differences matter more than benchmarks at scale.
- Pick the cheapest provider that meets your latency / SLA need. Big spread across providers for the same weights.
- Re-evaluate every 3 months. Frontier prices drop fast; a model that's cheapest today may not be in a quarter.
Related models
FAQ
How much does Qwen: Qwen3 Coder Flash cost? $0.195 input / $0.975 output per 1M tokens at the cheapest listing. See the table above for other providers.
What is the context window? 1M tokens.
Which providers offer it? Kilo Gateway.
Where do these numbers come from? models.dev + provider documentation, synced daily. About the data.