Haimaker.ai Logo
Google logo

Gemini 2.5 Flash

google/gemini-2.5-flash
Chat
Google|
Function CallingVision

Gemini 2.5 Flash (google/gemini-2.5-flash) is an AI model from Google with a 1,048,576-token context window and 8,192 max output tokens, priced at $0.30/1M input and $2.50/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Context Window
1M
tokens
Max Output
8K
tokens
Input Price
$0.30
/1M tokens
Output Price
$2.50
/1M tokens

Overview

Google's Gemini 2.5 Flash, a fast and efficient multimodal model with built-in thinking and long context support.

Features & Capabilities

Modechat
Context Window1,048,576 tokens
Max Output8,192 tokens
Function CallingSupported
VisionSupported
Reasoning-
Web Search-
Url Context-

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="google/gemini-2.5-flash",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Gemini 2.5 Flash?

Gemini 2.5 Flash (google/gemini-2.5-flash) has a 1,048,576-token context window and supports up to 8,192 output tokens per request.

How much does Gemini 2.5 Flash cost?

Gemini 2.5 Flash is priced at $0.30 per 1M input tokens and $2.50 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does Gemini 2.5 Flash support?

Gemini 2.5 Flash supports function calling, vision.

How do I use Gemini 2.5 Flash via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "google/gemini-2.5-flash" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Gemini 2.5 Flash with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from Google