Haimaker.ai Logo
Qwen logo

Qwen3 Coder Flash

qwen/qwen3-coder-flash
Chat
Qwen|
Function Calling

Qwen3 Coder Flash (qwen/qwen3-coder-flash) is an AI model from Qwen with a 1,000,000-token context window and 65,536 max output tokens, priced at $0.20/1M input and $0.97/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Context Window
1M
tokens
Max Output
66K
tokens
Input Price
$0.20
/1M tokens
Output Price
$0.97
/1M tokens

Overview

Qwen3 Coder Flash is a chat model by Qwen. It supports a 1M token context window. Supports function calling.

Features & Capabilities

Modechat
Context Window1,000,000 tokens
Max Output65,536 tokens
Function CallingSupported
Vision-
Reasoning-
Web Search-
Url Context-

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="qwen/qwen3-coder-flash",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Qwen3 Coder Flash?

Qwen3 Coder Flash (qwen/qwen3-coder-flash) has a 1,000,000-token context window and supports up to 65,536 output tokens per request.

How much does Qwen3 Coder Flash cost?

Qwen3 Coder Flash is priced at $0.20 per 1M input tokens and $0.97 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does Qwen3 Coder Flash support?

Qwen3 Coder Flash supports function calling.

How do I use Qwen3 Coder Flash via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "qwen/qwen3-coder-flash" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Qwen3 Coder Flash with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from Qwen