google/gemini-2.5-flash-lite-preview-09-2025Gemini 2.5 Flash Lite Preview 09 2025 (google/gemini-2.5-flash-lite-preview-09-2025) is an AI model from Google with a 1,048,576-token context window and 65,535 max output tokens, priced at $0.10/1M input and $0.40/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Google's Gemini 2.5 Flash, a fast and efficient multimodal model with built-in thinking and long context support.
| Mode | chat |
| Context Window | 1,048,576 tokens |
| Max Output | 65,535 tokens |
| Function Calling | Supported |
| Vision | Supported |
| Reasoning | Supported |
| Web Search | Supported |
| Url Context | Supported |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="google/gemini-2.5-flash-lite-preview-09-2025",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)Gemini 2.5 Flash Lite Preview 09 2025 (google/gemini-2.5-flash-lite-preview-09-2025) has a 1,048,576-token context window and supports up to 65,535 output tokens per request.
Gemini 2.5 Flash Lite Preview 09 2025 is priced at $0.10 per 1M input tokens and $0.40 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
Gemini 2.5 Flash Lite Preview 09 2025 supports function calling, vision, reasoning, web search, url context.
Send requests to https://api.haimaker.ai/v1/chat/completions with model "google/gemini-2.5-flash-lite-preview-09-2025" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
OpenAI-compatible endpoint. Start building in minutes.