Haimaker.ai Logo
NVIDIA logo

Llama 3.3 Nemotron Super 49B V1.5

nvidia/llama-3.3-nemotron-super-49b-v1.5
Chat
NVIDIA|
Function CallingReasoning

Llama 3.3 Nemotron Super 49B V1.5 (nvidia/llama-3.3-nemotron-super-49b-v1.5) is an AI model from NVIDIA with a 131,072-token context window and 16,384 max output tokens, priced at $0.10/1M input and $0.40/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Context Window
131K
tokens
Max Output
16K
tokens
Input Price
$0.10
/1M tokens
Output Price
$0.40
/1M tokens

Overview

Llama 3.3 Nemotron Super 49B V1.5 is a chat model by NVIDIA. It supports a 131K token context window. Supports function calling, reasoning.

Features & Capabilities

Modechat
Context Window131,072 tokens
Max Output16,384 tokens
Function CallingSupported
Vision-
ReasoningSupported
Web Search-
Url Context-

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="nvidia/llama-3.3-nemotron-super-49b-v1.5",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Llama 3.3 Nemotron Super 49B V1.5?

Llama 3.3 Nemotron Super 49B V1.5 (nvidia/llama-3.3-nemotron-super-49b-v1.5) has a 131,072-token context window and supports up to 16,384 output tokens per request.

How much does Llama 3.3 Nemotron Super 49B V1.5 cost?

Llama 3.3 Nemotron Super 49B V1.5 is priced at $0.10 per 1M input tokens and $0.40 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does Llama 3.3 Nemotron Super 49B V1.5 support?

Llama 3.3 Nemotron Super 49B V1.5 supports function calling, reasoning.

How do I use Llama 3.3 Nemotron Super 49B V1.5 via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "nvidia/llama-3.3-nemotron-super-49b-v1.5" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Llama 3.3 Nemotron Super 49B V1.5 with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from NVIDIA