Model

Meta: Llama 3.1 8B Instruct

Catalog snapshot from OpenRouter. This model is discoverable on-site even when it is not currently included in the global ranking list.

Data updated:

Data version: v20260430T083453Z Data size: 100

This model is currently available from the catalog snapshot and may not be included in the latest ranked board yet.

About this model

Meta: Llama 3.1 8B Instruct is listed in our model catalog as a LLM model with 16,384 ctx and a snapshot average price around $0.04 per 1M tokens. The data below is generated from the latest catalog snapshot and integration examples are provided when an OpenRouter id is available.

You can also explore more models from Meta Llama , and browse more options from 🇺🇸 United States .

Key metrics

Rank
Not ranked
Kind
LLM
Core metric
16,384 ctx
1M tokens (avg)
$0.04
Vendor / team
Meta Llama
Origin
🇺🇸 United States
License
Custom / community
VRAM requirement
Estimate by quantization

Hippo's Quick Action

OpenRouter chat completions URL; set Authorization and body per docs.

Price calculator

Est. monthly cost (USD):

Price comparison (snapshot)

Source / aggregator Price / 1M tokens Latency
Snapshot average (board) $0.04

Figures come from the imported leaderboard snapshot; live aggregator pricing and latency can change.

Token pricing by provider

Compare per-provider token prices for this model across available platforms.

Provider Input / 1M tokens Output / 1M tokens Status Price updated
OpenRouter $0.02 $0.05 Verified 2026-04-30T08:36:20.572Z
Groq $0.15 $0.60 Snapshot 2026-04-29T05:13:01.371Z
Fireworks $0.15 $0.60 Snapshot 2026-04-29T05:13:01.371Z
DeepInfra $0.15 $0.60 Snapshot 2026-04-29T05:13:01.371Z
SiliconFlow $0.15 $0.60 Snapshot 2026-04-29T05:13:01.371Z
Together $8.00 $20.00 Snapshot 2026-04-30T06:05:58.828Z

Provider prices are sourced from the token comparison dataset and may change between snapshots.

How to integrate

OpenRouter exposes an OpenAI-compatible Chat Completions endpoint. Use the tabs below to switch example languages. Replace the model id with the one from your provider page if you route elsewhere.

// Node.js 18+ — set OPENROUTER_API_KEY in your environment
const res = await fetch('https://openrouter.ai/api/v1/chat/completions', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.OPENROUTER_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: "meta-llama/llama-3.1-8b-instruct",
    messages: [{ role: 'user', content: 'Hello' }],
  }),
});
const data = await res.json();
console.log(data);

Store API keys in environment variables or a secret manager—never commit them to source control.

Alternative picks

Pick one or two more models on global rankings and use Compare to view them side by side.

Run with Ollama

Paste into your terminal (install Ollama first):