Name: Llama 3.3 Nemotron Super 49B V1.5
Brand: NVIDIA
SKU: nvidia/llama-3.3-nemotron-super-49b-v1.5
Price: 0.1000 USD
Availability: InStock

Question 1

What is the context window of Llama 3.3 Nemotron Super 49B V1.5?

Accepted Answer

Llama 3.3 Nemotron Super 49B V1.5 (nvidia/llama-3.3-nemotron-super-49b-v1.5) has a 131,072-token context window and supports up to 16,384 output tokens per request.

Question 2

How much does Llama 3.3 Nemotron Super 49B V1.5 cost?

Accepted Answer

Llama 3.3 Nemotron Super 49B V1.5 is priced at $0.10 per 1M input tokens and $0.40 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

Question 3

What features does Llama 3.3 Nemotron Super 49B V1.5 support?

Accepted Answer

Llama 3.3 Nemotron Super 49B V1.5 supports function calling, reasoning.

Question 4

How do I use Llama 3.3 Nemotron Super 49B V1.5 via API?

Accepted Answer

Send requests to https://api.haimaker.ai/v1/chat/completions with model "nvidia/llama-3.3-nemotron-super-49b-v1.5" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Mode	chat
Context Window	131,072 tokens
Max Output	16,384 tokens
Function Calling	Supported
Vision	-
Reasoning	Supported
Web Search	-
Url Context	-

Llama 3.3 Nemotron Super 49B V1.5

Overview

Features & Capabilities

API Usage

Frequently Asked Questions

What is the context window of Llama 3.3 Nemotron Super 49B V1.5?

How much does Llama 3.3 Nemotron Super 49B V1.5 cost?

What features does Llama 3.3 Nemotron Super 49B V1.5 support?

How do I use Llama 3.3 Nemotron Super 49B V1.5 via API?

Use Llama 3.3 Nemotron Super 49B V1.5 with the haimaker API

More from NVIDIA