haimaker.ai provides API access to all 4 NVIDIA models, with context windows up to 1M tokens, priced from $0.04 to $0.10 per 1M input tokens. OpenAI-compatible endpoint, instant access.
NVIDIA
nvidia/nemotron-3-nano-30b-a3b
NVIDIA
nvidia/nemotron-3-super-120b-a12b
NVIDIA
nvidia/nemotron-nano-9b-v2
NVIDIA
nvidia/llama-3.3-nemotron-super-49b-v1.5