Aion RP Llama 3.1 8B

Name: Aion RP Llama 3.1 8B
Brand: Aion Labs
SKU: aion-labs/aion-rp-llama-3.1-8b
Price: 0.8000 USD
Availability: InStock

aion-labs/aion-rp-llama-3.1-8b

Chatapache-2.0

Aion Labs|Released Nov 2024 · Updated Nov 2024

Aion RP Llama 3.1 8B (aion-labs/aion-rp-llama-3.1-8b) is a llama 8.0B-parameter model from Aion Labs with a 32,768-token context window and 32,768 max output tokens, priced at $0.80/1M input and $1.60/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.

Parameters

8.0B

Context Window

33K

tokens

Max Output

33K

tokens

Input Price

$0.80

/1M tokens

Output Price

$1.60

/1M tokens

Overview

This is a completely uncensored model that has been trained to excel at roleplaying and creative writing. It can utilize the full 131K context.

Model Card

Model Description

This is a completely uncensored model that has been trained to excel at roleplaying and creative writing. It can utilize the full 131K context.

Prompt Format

The model uses the following prompt format:

__SYSTEM__: system message here
__USER__: user message here
__ASSISTANT__: assistant message here

The model is capable of multi-turn conversations. A system message is not required, however, performance can improve if detailed instructions are provided in a system message.

Important: the recommended default temperature for this model is 0.7. Setting the temperature to 1.0 can sometimes result in nonsensical output.

Features & Capabilities

Mode	chat
Context Window	32,768 tokens
Max Output	32,768 tokens
Function Calling	-
Vision	-
Reasoning	-
Web Search	-
Url Context	-

Technical Details

Architecture	LlamaForCausalLM
Model Type	llama
Base Model	meta-llama/Llama-3.1-8B

API Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.haimaker.ai/v1",
    api_key="YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="aion-labs/aion-rp-llama-3.1-8b",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ],
)

print(response.choices[0].message.content)

Frequently Asked Questions

What is the context window of Aion RP Llama 3.1 8B?

Aion RP Llama 3.1 8B (aion-labs/aion-rp-llama-3.1-8b) has a 32,768-token context window and supports up to 32,768 output tokens per request.

How much does Aion RP Llama 3.1 8B cost?

Aion RP Llama 3.1 8B is priced at $0.80 per 1M input tokens and $1.60 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.

How do I use Aion RP Llama 3.1 8B via API?

Send requests to https://api.haimaker.ai/v1/chat/completions with model "aion-labs/aion-rp-llama-3.1-8b" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.

Use Aion RP Llama 3.1 8B with the haimaker API

OpenAI-compatible endpoint. Start building in minutes.

Get API Access

More from Aion Labs