z-ai/glm-4.5Glm 4.5 (z-ai/glm-4.5) is an AI model from Z Ai with a 131,072-token context window and 98,304 max output tokens, priced at $0.60/1M input and $2.20/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Zhipu AI's GLM (General Language Model) with reasoning and function calling capabilities.
| Mode | chat |
| Context Window | 131,072 tokens |
| Max Output | 98,304 tokens |
| Function Calling | Supported |
| Vision | - |
| Reasoning | Supported |
| Web Search | - |
| Url Context | - |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="z-ai/glm-4.5",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)Glm 4.5 (z-ai/glm-4.5) has a 131,072-token context window and supports up to 98,304 output tokens per request.
Glm 4.5 is priced at $0.60 per 1M input tokens and $2.20 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
Glm 4.5 supports function calling, reasoning.
Send requests to https://api.haimaker.ai/v1/chat/completions with model "z-ai/glm-4.5" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
OpenAI-compatible endpoint. Start building in minutes.