Which providers offer Qwen3 30B A3B FP8?

2 providers list this model: Cloudflare AI Gateway, LLM Gateway.

Model reference · Synced 2025-04-29

Qwen3 30B A3B FP8

Q: How much does Qwen3 30B A3B FP8 cost?

$0.051 per 1M input tokens and $0.34 per 1M output tokens at the cheapest provider listing. Other providers may price it differently — see the comparison table on this page.

Qwen3 30B A3B FP8 is an AI model from Cloudflare AI Gateway. 128K context window. Capabilities: reasoning, tool calling, open weights. Available on 2 providers. Cheapest listing: $0.051 input / $0.34 output per 1M tokens.

Quick facts

Cheapest input: $0.051 per 1M tokens (Cloudflare AI Gateway)
Cheapest output: $0.34 per 1M tokens
Context window: 128K tokens
Max output: 16K tokens
Release date: 2025-11-14
Capabilities: reasoning, tool calling, open weights
Provider count: 2

→ Add Qwen3 30B A3B FP8 to the comparison tool

Provider pricing

Same model, different providers, different prices. Cheapest first.

Provider	Input / 1M	Output / 1M	Context	Listed
Cloudflare AI Gateway	$0.051	$0.34	128K	2025-11-14
LLM Gateway	$0.1	$0.1	131K	2025-04-28

Prices synced daily from models.dev + provider docs.

How to use this model

If you're picking Qwen3 30B A3B FP8 for a project, the three things that matter most:

Compare it side-by-side with one or two alternatives in the live comparison tool. Pricing differences matter more than benchmarks at scale.
Pick the cheapest provider that meets your latency / SLA need. Big spread across providers for the same weights.
Re-evaluate every 3 months. Frontier prices drop fast; a model that's cheapest today may not be in a quarter.

Related models

FAQ

How much does Qwen3 30B A3B FP8 cost? $0.051 input / $0.34 output per 1M tokens at the cheapest listing. See the table above for other providers.

What is the context window? 128K tokens.

Which providers offer it? Cloudflare AI Gateway, LLM Gateway.

Where do these numbers come from? models.dev + provider documentation, synced daily. About the data.