Virtuoso Large

Name: Virtuoso Large
Brand: Arcee Ai
SKU: arcee-ai/virtuoso-large
Price: 0.7500 USD
Availability: InStock

arcee-ai/virtuoso-large

Chatother

Arcee Ai|

Function Calling

|Released Jun 2025 · Updated Aug 2025

Virtuoso Large (arcee-ai/virtuoso-large) is a qwen2 72.7B-parameter model from Arcee Ai with a 131,072-token context window and 64,000 max output tokens, priced at $0.75/1M input and $1.20/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Parameters

72.7B

Context Window

131K

tokens

Max Output

64K

tokens

Input Price

$0.75

/1M tokens

Output Price

$1.20

/1M tokens

Overview

Virtuoso-Large (72B) is our most powerful and versatile general-purpose model, designed to excel at handling complex and varied tasks across domains. With state-of-the-art performance, it offers unparalleled capability for nuanced understanding, contextual adaptability, and high accuracy.

Model Card

Virtuoso-Large (72B) is our most powerful and versatile general-purpose model, designed to excel at handling complex and varied tasks across domains. With state-of-the-art performance, it offers unparalleled capability for nuanced understanding, contextual adaptability, and high accuracy.

Model Details

Architecture Base: Qwen2.5-72B
Parameter Count: 72B
License: Qwen's Tongyi License

Use Cases

Advanced content creation, such as technical writing and creative storytelling
Data summarization and report generation for cross-functional domains
Detailed knowledge synthesis and deep-dive insights from diverse datasets
Multilingual support for international operations and communications

Quantizations

GGUF format available here

License

Virtuoso-Large (72B) is released under the qwen license. You are free to use, modify, and distribute this model in both commercial and non-commercial applications, subject to the terms and conditions of the license.

If you have questions or would like to share your experiences using Virtuoso-Large (72B), please feel free to connect with us on social media. We’re excited to see what you build—and how this model helps you innovate!

Features & Capabilities

Mode	chat
Context Window	131,072 tokens
Max Output	64,000 tokens
Function Calling	Supported
Vision	Not supported
Reasoning	Not supported
Web Search	Not supported
Url Context	Not supported

Technical Details

Architecture	Qwen2ForCausalLM
Model Type	qwen2
Base Model	Qwen/Qwen2.5-72B
Library	transformers

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="arcee-ai/virtuoso-large",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Virtuoso Large?

Virtuoso Large (arcee-ai/virtuoso-large) has a 131,072-token context window and supports up to 64,000 output tokens per request.

How much does Virtuoso Large cost?

Virtuoso Large is priced at $0.75 per 1M input tokens and $1.20 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

What features does Virtuoso Large support?

Virtuoso Large supports function calling.

How do I use Virtuoso Large via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "arcee-ai/virtuoso-large" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.