Virtuoso Large
arcee-ai/virtuoso-largeVirtuoso Large (arcee-ai/virtuoso-large) is a qwen2 72.7B-parameter model from Arcee Ai with a 131,072-token context window and 64,000 max output tokens, priced at $0.75/1M input and $1.20/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Overview
Virtuoso-Large (72B) is our most powerful and versatile general-purpose model, designed to excel at handling complex and varied tasks across domains. With state-of-the-art performance, it offers unparalleled capability for nuanced understanding, contextual adaptability, and high accuracy.
Model Card
Virtuoso-Large (72B) is our most powerful and versatile general-purpose model, designed to excel at handling complex and varied tasks across domains. With state-of-the-art performance, it offers unparalleled capability for nuanced understanding, contextual adaptability, and high accuracy.
Model Details
- Architecture Base: Qwen2.5-72B
- Parameter Count: 72B
- License: Qwen's Tongyi License
Use Cases
- Advanced content creation, such as technical writing and creative storytelling
- Data summarization and report generation for cross-functional domains
- Detailed knowledge synthesis and deep-dive insights from diverse datasets
- Multilingual support for international operations and communications
Quantizations
GGUF format available here
License
Virtuoso-Large (72B) is released under the qwen license. You are free to use, modify, and distribute this model in both commercial and non-commercial applications, subject to the terms and conditions of the license.If you have questions or would like to share your experiences using Virtuoso-Large (72B), please feel free to connect with us on social media. We’re excited to see what you build—and how this model helps you innovate!
Features & Capabilities
| Mode | chat |
| Context Window | 131,072 tokens |
| Max Output | 64,000 tokens |
| Function Calling | Supported |
| Vision | Not supported |
| Reasoning | Not supported |
| Web Search | Not supported |
| Url Context | Not supported |
Technical Details
| Architecture | Qwen2ForCausalLM |
| Model Type | qwen2 |
| Base Model | Qwen/Qwen2.5-72B |
| Library | transformers |
API Usage
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="arcee-ai/virtuoso-large",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)Frequently Asked Questions
What is the context window of Virtuoso Large?
Virtuoso Large (arcee-ai/virtuoso-large) has a 131,072-token context window and supports up to 64,000 output tokens per request.
How much does Virtuoso Large cost?
Virtuoso Large is priced at $0.75 per 1M input tokens and $1.20 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
What features does Virtuoso Large support?
Virtuoso Large supports function calling.
How do I use Virtuoso Large via API?
Send requests to https://api.haimaker.ai/v1/chat/completions with model "arcee-ai/virtuoso-large" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
Use Virtuoso Large with the haimaker API
OpenAI-compatible endpoint. Start building in minutes.