Qwen3 Max
qwen/qwen3-maxQwen3 Max (qwen/qwen3-max) is an AI model from Qwen with a 262,144-token context window and 32,768 max output tokens, priced at $0.78/1M input and $3.90/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Overview
Qwen3 Max is a chat model by Qwen. It supports a 262K token context window. Supports function calling.
Features & Capabilities
| Mode | chat |
| Context Window | 262,144 tokens |
| Max Output | 32,768 tokens |
| Function Calling | Supported |
| Vision | Not supported |
| Reasoning | Not supported |
| Web Search | Not supported |
| Url Context | Not supported |
API Usage
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="qwen/qwen3-max",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)Frequently Asked Questions
What is the context window of Qwen3 Max?
Qwen3 Max (qwen/qwen3-max) has a 262,144-token context window and supports up to 32,768 output tokens per request.
How much does Qwen3 Max cost?
Qwen3 Max is priced at $0.78 per 1M input tokens and $3.90 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
What features does Qwen3 Max support?
Qwen3 Max supports function calling.
How do I use Qwen3 Max via API?
Send requests to https://api.haimaker.ai/v1/chat/completions with model "qwen/qwen3-max" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
Use Qwen3 Max with the haimaker API
OpenAI-compatible endpoint. Start building in minutes.