Name: Llama 4 Maverick 17B 128E Instruct
Brand: Meta Llama
SKU: meta-llama/llama-4-maverick
Price: 0.1500 USD
Availability: InStock

Question 1

What is the context window of Llama 4 Maverick 17B 128E Instruct?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct (meta-llama/llama-4-maverick) has a 1,048,576-token context window and supports up to 16,384 output tokens per request.

Question 2

How much does Llama 4 Maverick 17B 128E Instruct cost?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct is priced at $0.15 per 1M input tokens and $0.60 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

Question 3

What features does Llama 4 Maverick 17B 128E Instruct support?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct supports vision.

Question 4

How do I use Llama 4 Maverick 17B 128E Instruct via API?

Accepted Answer

Send requests to https://api.haimaker.ai/v1/chat/completions with model "meta-llama/llama-4-maverick" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Mode	chat
Context Window	1,048,576 tokens
Max Output	16,384 tokens
Function Calling	Not supported
Vision	Supported
Reasoning	Not supported
Web Search	Not supported
Url Context	Not supported

Architecture	Llama4ForConditionalGeneration
Model Type	llama4
Base Model	meta-llama/Llama-4-Maverick-17B-128E
Languages	ar, de, en, es, fr, hi, id, it, pt, th, tl, vi
Library	transformers

Llama 4 Maverick 17B 128E Instruct

Overview

Features & Capabilities

Technical Details

API Usage

Frequently Asked Questions

What is the context window of Llama 4 Maverick 17B 128E Instruct?

How much does Llama 4 Maverick 17B 128E Instruct cost?

What features does Llama 4 Maverick 17B 128E Instruct support?

How do I use Llama 4 Maverick 17B 128E Instruct via API?

Use Llama 4 Maverick 17B 128E Instruct with the haimaker API

More from Meta Llama