openai/gpt-4.1-nano-2025-04-14OpenAI's most lightweight GPT-4.1 model for classification, autocompletion, and latency-sensitive tasks with a 1M token context.
| Mode | chat |
| Context Window | 1M tokens |
| Max Output | 33K tokens |
| Function Calling | Supported |
| Vision | Supported |
| Reasoning | - |
| Web Search | - |
| Url Context | - |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="openai/gpt-4.1-nano-2025-04-14",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)OpenAI-compatible endpoint. Start building in minutes.