Haimaker.ai Logo
Meta Llama logo

Llama 3.2 3B Instruct

meta-llama/llama-3.2-3b-instruct
Chatllama3.2
Meta Llama|Released Sep 2024 · Updated Oct 2024

Llama 3.2 3B Instruct (meta-llama/llama-3.2-3b-instruct) is a llama 3.2B-parameter model from Meta Llama with a 131,072-token context window and 80,000 max output tokens, priced at $0.05/1M input and $0.34/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Parameters
3.2B
Context Window
131K
tokens
Max Output
80K
tokens
Input Price
$0.05
/1M tokens
Output Price
$0.34
/1M tokens

Overview

Llama 3.2 3B Instruct is a chat model by Meta Llama. It has 3.2B parameters. It supports a 131K token context window.

Features & Capabilities

Modechat
Context Window131,072 tokens
Max Output80,000 tokens
Function Calling-
Vision-
Reasoning-
Web Search-
Url Context-

Technical Details

ArchitectureLlamaForCausalLM
Model Typellama
Languagesen, de, fr, it, pt, hi, es, th
Librarytransformers

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="meta-llama/llama-3.2-3b-instruct",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Llama 3.2 3B Instruct?

Llama 3.2 3B Instruct (meta-llama/llama-3.2-3b-instruct) has a 131,072-token context window and supports up to 80,000 output tokens per request.

How much does Llama 3.2 3B Instruct cost?

Llama 3.2 3B Instruct is priced at $0.05 per 1M input tokens and $0.34 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

How do I use Llama 3.2 3B Instruct via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "meta-llama/llama-3.2-3b-instruct" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Llama 3.2 3B Instruct with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from Meta Llama