Haimaker.ai Logo
Z Ai logo

Glm 5v Turbo

z-ai/glm-5v-turbo
Chat
Z Ai|
Function CallingVisionReasoning

Glm 5v Turbo (z-ai/glm-5v-turbo) is an AI model from Z Ai with a 202,752-token context window and 131,072 max output tokens, priced at $1.20/1M input and $4.00/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Context Window
203K
tokens
Max Output
131K
tokens
Input Price
$1.20
/1M tokens
Output Price
$4.00
/1M tokens

Overview

Zhipu AI's GLM (General Language Model) with reasoning and function calling capabilities.

Features & Capabilities

Modechat
Context Window202,752 tokens
Max Output131,072 tokens
Function CallingSupported
VisionSupported
ReasoningSupported
Web Search-
Url Context-

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="z-ai/glm-5v-turbo",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Glm 5v Turbo?

Glm 5v Turbo (z-ai/glm-5v-turbo) has a 202,752-token context window and supports up to 131,072 output tokens per request.

How much does Glm 5v Turbo cost?

Glm 5v Turbo is priced at $1.20 per 1M input tokens and $4.00 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does Glm 5v Turbo support?

Glm 5v Turbo supports function calling, vision, reasoning.

How do I use Glm 5v Turbo via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "z-ai/glm-5v-turbo" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Glm 5v Turbo with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from Z Ai