openai/gpt-5.4-mini-2026-03-17GPT 5.4 Mini 2026 03 17 (openai/gpt-5.4-mini-2026-03-17) is an AI model from OpenAI with a 272,000-token context window and 128,000 max output tokens, priced at $0.75/1M input and $4.50/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
OpenAI's mid-tier GPT-5 model, balancing strong capability with fast responses and cost efficiency.
| Mode | chat |
| Context Window | 272,000 tokens |
| Max Output | 128,000 tokens |
| Function Calling | Supported |
| Vision | Supported |
| Reasoning | Supported |
| Web Search | Supported |
| Url Context | - |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="openai/gpt-5.4-mini-2026-03-17",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)GPT 5.4 Mini 2026 03 17 (openai/gpt-5.4-mini-2026-03-17) has a 272,000-token context window and supports up to 128,000 output tokens per request.
GPT 5.4 Mini 2026 03 17 is priced at $0.75 per 1M input tokens and $4.50 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
GPT 5.4 Mini 2026 03 17 supports function calling, vision, reasoning, web search.
Send requests to https://api.haimaker.ai/v1/chat/completions with model "openai/gpt-5.4-mini-2026-03-17" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
OpenAI-compatible endpoint. Start building in minutes.