sao10k/l3.1-70b-hanami-x1L3.1 70B Hanami x1 (sao10k/l3.1-70b-hanami-x1) is a llama 70.6B-parameter model from Sao10k with a 16,000-token context window and 16,000 max output tokens, priced at $3.00/1M input and $3.00/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Llama-3.1-70B-Hanami-x1
Llama-3.1-70B-Hanami-x1
This is an experiment over Euryale v2.2, which I think worked out nicely.
Feels different from it, in a good way. I prefer it over 2.2, and 2.1 from testing.
As usual, the Euryale v2.1 & 2.2 Settings work on it.
min_p of at minimum 0.1 is recommended for Llama 3 types.
I like it, so try it out?
| Mode | chat |
| Context Window | 16,000 tokens |
| Max Output | 16,000 tokens |
| Function Calling | - |
| Vision | - |
| Reasoning | - |
| Web Search | - |
| Url Context | - |
| Architecture | LlamaForCausalLM |
| Model Type | llama |
| Languages | en |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="sao10k/l3.1-70b-hanami-x1",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)L3.1 70B Hanami x1 (sao10k/l3.1-70b-hanami-x1) has a 16,000-token context window and supports up to 16,000 output tokens per request.
L3.1 70B Hanami x1 is priced at $3.00 per 1M input tokens and $3.00 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
Send requests to https://api.haimaker.ai/v1/chat/completions with model "sao10k/l3.1-70b-hanami-x1" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
OpenAI-compatible endpoint. Start building in minutes.