Haimaker.ai Logo
OpenAI logo

GPT Realtime 2

openai/gpt-realtime-2
Chat
OpenAI|
Function Calling

GPT Realtime 2 (openai/gpt-realtime-2) is an AI model from OpenAI with a 32,000-token context window and 4,096 max output tokens, priced at $4.00/1M input and $16.00/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Context Window
32K
tokens
Max Output
4K
tokens
Input Price
$4.00
/1M tokens
Output Price
$16.00
/1M tokens

Overview

OpenAI's real-time model for low-latency conversational applications with streaming audio and text.

Features & Capabilities

Modechat
Context Window32,000 tokens
Max Output4,096 tokens
Function CallingSupported
Vision-
Reasoning-
Web Search-
Url Context-

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="openai/gpt-realtime-2",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of GPT Realtime 2?

GPT Realtime 2 (openai/gpt-realtime-2) has a 32,000-token context window and supports up to 4,096 output tokens per request.

How much does GPT Realtime 2 cost?

GPT Realtime 2 is priced at $4.00 per 1M input tokens and $16.00 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does GPT Realtime 2 support?

GPT Realtime 2 supports function calling.

How do I use GPT Realtime 2 via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "openai/gpt-realtime-2" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use GPT Realtime 2 with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from OpenAI