Model reference · Synced 2025-04-29
Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision Instruct is an AI model from OpenRouter. 131K context window. Capabilities: reasoning, tool calling, multimodal vision, audio, open weights. Available on 8 providers. Cheapest listing: $0 input / $0 output per 1M tokens.
Quick facts
- Cheapest input: $0 per 1M tokens (OpenRouter)
- Cheapest output: $0 per 1M tokens
- Context window: 131K tokens
- Max output: 8K tokens
- Release date: 2024-09-25
- Knowledge cutoff: 2023-12
- Capabilities: reasoning, tool calling, multimodal vision, audio, open weights
- Provider count: 8
Provider pricing
Same model, different providers, different prices. Cheapest first.
| Provider | Input / 1M | Output / 1M | Context | Listed |
|---|---|---|---|---|
| OpenRouter | $0 | $0 | 131K | 2024-09-25 |
| Nvidia | $0 | $0 | 128K | 2024-09-18 |
| GitHub Models | $0 | $0 | 128K | 2024-09-25 |
| Cloudflare AI Gateway | $0.049 | $0.68 | 128K | 2025-04-03 |
| Inference | $0.055 | $0.055 | 16K | 2025-01-01 |
| Vercel AI Gateway | $0.16 | $0.16 | 128K | 2024-09-25 |
| Azure Cognitive Services | $0.37 | $0.37 | 128K | 2024-09-25 |
| Azure | $0.37 | $0.37 | 128K | 2024-09-25 |
Prices synced daily from models.dev + provider docs.
How to use this model
If you're picking Llama 3.2 11B Vision Instruct for a project, the three things that matter most:
- Compare it side-by-side with one or two alternatives in the live comparison tool. Pricing differences matter more than benchmarks at scale.
- Pick the cheapest provider that meets your latency / SLA need. Big spread across providers for the same weights.
- Re-evaluate every 3 months. Frontier prices drop fast; a model that's cheapest today may not be in a quarter.
Related models
FAQ
How much does Llama 3.2 11B Vision Instruct cost? $0 input / $0 output per 1M tokens at the cheapest listing. See the table above for other providers.
What is the context window? 131K tokens.
Which providers offer it? OpenRouter, Cloudflare AI Gateway, Azure Cognitive Services, Nvidia, Inference, Vercel AI Gateway, Azure, GitHub Models.
Where do these numbers come from? models.dev + provider documentation, synced daily. About the data.