Check.AI

Model reference · Synced 2025-04-29

Llama 2 7B Chat FP16

Llama 2 7B Chat FP16 is an AI model from Cloudflare AI Gateway. 128K context window. Capabilities: text generation. Available on 1 provider. Cheapest listing: $0.56 input / $6.67 output per 1M tokens.

Quick facts

→ Add Llama 2 7B Chat FP16 to the comparison tool

Provider pricing

Same model, different providers, different prices. Cheapest first.

ProviderInput / 1MOutput / 1MContextListed
Cloudflare AI Gateway $0.56 $6.67 128K 2025-04-03

Prices synced daily from models.dev + provider docs.

How to use this model

If you're picking Llama 2 7B Chat FP16 for a project, the three things that matter most:

Related models

FAQ

How much does Llama 2 7B Chat FP16 cost? $0.56 input / $6.67 output per 1M tokens at the cheapest listing. See the table above for other providers.

What is the context window? 128K tokens.

Which providers offer it? Cloudflare AI Gateway.

Where do these numbers come from? models.dev + provider documentation, synced daily. About the data.