Haimaker.ai Logo
Meta Llama logo

Llama 4 Maverick 17B 128E Instruct

meta-llama/llama-4-maverick
Chatother
Meta Llama|
Vision
|Released Apr 2025 · Updated May 2025

Llama 4 Maverick 17B 128E Instruct (meta-llama/llama-4-maverick) is a llama4 401.6B-parameter model from Meta Llama with a 1,048,576-token context window and 16,384 max output tokens, priced at $0.15/1M input and $0.60/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Parameters
401.6B
Context Window
1M
tokens
Max Output
16K
tokens
Input Price
$0.15
/1M tokens
Output Price
$0.60
/1M tokens

Overview

Llama 4 Maverick is a chat model by Meta Llama. It has 401.6B parameters. It supports a 1.0M token context window. Supports vision.

Features & Capabilities

Modechat
Context Window1,048,576 tokens
Max Output16,384 tokens
Function Calling-
VisionSupported
Reasoning-
Web Search-
Url Context-

Technical Details

ArchitectureLlama4ForConditionalGeneration
Model Typellama4
Base Modelmeta-llama/Llama-4-Maverick-17B-128E
Languagesar, de, en, es, fr, hi, id, it, pt, th, tl, vi
Librarytransformers

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="meta-llama/llama-4-maverick",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Llama 4 Maverick 17B 128E Instruct?

Llama 4 Maverick 17B 128E Instruct (meta-llama/llama-4-maverick) has a 1,048,576-token context window and supports up to 16,384 output tokens per request.

How much does Llama 4 Maverick 17B 128E Instruct cost?

Llama 4 Maverick 17B 128E Instruct is priced at $0.15 per 1M input tokens and $0.60 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does Llama 4 Maverick 17B 128E Instruct support?

Llama 4 Maverick 17B 128E Instruct supports vision.

How do I use Llama 4 Maverick 17B 128E Instruct via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "meta-llama/llama-4-maverick" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Llama 4 Maverick 17B 128E Instruct with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from Meta Llama