Llama 3.1 8B

Multilingual LLM pre-trained and instruction-tuned, surpassing open and closed models on key benchmarks.

About model

Llama 3.1 8B generates human-like text based on input prompts, excelling at conversational dialogue and long-form content creation. It is designed for developers and businesses seeking advanced language capabilities.

Quickstart guides

RAG

Building a RAG Workflow

Agents

Agent Workflows

Apps

Next.js Chat Quickstart

Related models

Model specifications

Model data

Model provider
Meta
Type
LLM
Chat
Main use cases
Chat
Small & Fast
Function Calling
Features
Function Calling
JSON Mode
Deployment
On-Demand Dedicated
Monthly Reserved
Parameters
8B
Context length
128K
Input price
$0.18 / 1M tokens
Output price
$0.18 / 1M tokens
Input modalities
Text
Output modalities
Text

Released
July 22, 2024
Last updated
March 5, 2026
Quantization level
FP8
External link
Provider docs
Category
Chat

Quickstart docs

Deploy model