Haimaker.ai Logo

Virtuoso Large

arcee-ai/virtuoso-large
Chatother
Arcee Ai|
Function Calling
|Released Jun 2025 · Updated Aug 2025

Virtuoso Large (arcee-ai/virtuoso-large) is a qwen2 72.7B-parameter model from Arcee Ai with a 131,072-token context window and 64,000 max output tokens, priced at $0.75/1M input and $1.20/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Parameters
72.7B
Context Window
131K
tokens
Max Output
64K
tokens
Input Price
$0.75
/1M tokens
Output Price
$1.20
/1M tokens

Overview

Virtuoso-Large (72B) is our most powerful and versatile general-purpose model, designed to excel at handling complex and varied tasks across domains. With state-of-the-art performance, it offers unparalleled capability for nuanced understanding, contextual adaptability, and high accuracy.

Model Card

image/png Virtuoso-Large (72B) is our most powerful and versatile general-purpose model, designed to excel at handling complex and varied tasks across domains. With state-of-the-art performance, it offers unparalleled capability for nuanced understanding, contextual adaptability, and high accuracy.

Model Details

  • Architecture Base: Qwen2.5-72B
  • Parameter Count: 72B
  • License: Qwen's Tongyi License

Use Cases

  • Advanced content creation, such as technical writing and creative storytelling
  • Data summarization and report generation for cross-functional domains
  • Detailed knowledge synthesis and deep-dive insights from diverse datasets
  • Multilingual support for international operations and communications

Quantizations

GGUF format available here

License

Virtuoso-Large (72B) is released under the qwen license. You are free to use, modify, and distribute this model in both commercial and non-commercial applications, subject to the terms and conditions of the license.

If you have questions or would like to share your experiences using Virtuoso-Large (72B), please feel free to connect with us on social media. We’re excited to see what you build—and how this model helps you innovate!

Features & Capabilities

Modechat
Context Window131,072 tokens
Max Output64,000 tokens
Function CallingSupported
Vision-
Reasoning-
Web Search-
Url Context-

Technical Details

ArchitectureQwen2ForCausalLM
Model Typeqwen2
Base ModelQwen/Qwen2.5-72B
Librarytransformers

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="arcee-ai/virtuoso-large",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Virtuoso Large?

Virtuoso Large (arcee-ai/virtuoso-large) has a 131,072-token context window and supports up to 64,000 output tokens per request.

How much does Virtuoso Large cost?

Virtuoso Large is priced at $0.75 per 1M input tokens and $1.20 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does Virtuoso Large support?

Virtuoso Large supports function calling.

How do I use Virtuoso Large via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "arcee-ai/virtuoso-large" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Virtuoso Large with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from Arcee Ai