thedrummer/cydonia-24b-v4.1Cydonia 24B v4.1 (thedrummer/cydonia-24b-v4.1) is a mistral 23.6B-parameter model from Thedrummer with a 131,072-token context window and 131,072 max output tokens, priced at $0.30/1M input and $0.50/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
- Mistral v7 Tekken
Cydonia Evolved again.
I have to praise this model for good focus. I said earlier that it still remembers it at 12K. I think my personal evaluation of it has already beaten the rest.
Damn okay this model is actually pretty good. I don't have enough vram to test it on longer chats to 16k, but on 6k chats it's looking good and without deepseek's slop.
Wow, for a 24B this thing has some writing chops. Like it nails mood and nuance and shit with the prose, descriptive but not purple prose. you may have cracked the Cydonias for good with this one. The more I play with it, the more it feels like a level up from the prior ones. Haven't got into long context yet though. My cards tend to favor the opposite or at best neutral. Its rolling with the card, and nailing it, its a bit fallen and its doing good prose to match. Yeah this one's a banger.
Very good. For 24B, the best I've come across. Like even swipes, it stays creative and writes just as well as the swipes before it but doesn't recycle anything from them. It doesn't go overboard on the creativity like Gemma can do, it'll write what you tell it or if RP pick up on things pretty accurately. The prose isn't purple either, it's good.
I dunno how you have broken the spell R1 Cydonia had on me or what made me try this on a whim but you have gold on your hands with this tune. Again.
it really doesn't feel like a mistral tune which is honestly the best compliment I can give it. I'm not getting the usual mistral tuneisms from it.
It's probably the best Cydonia.
Hoping to make SleepDeprived proud with this one. RIP.
config-v4j
| Mode | chat |
| Context Window | 131,072 tokens |
| Max Output | 131,072 tokens |
| Function Calling | - |
| Vision | - |
| Reasoning | - |
| Web Search | - |
| Url Context | - |
| Architecture | MistralForCausalLM |
| Model Type | mistral |
| Base Model | mistralai/Mistral-Small-3.2-24B-Instruct-2507 |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="thedrummer/cydonia-24b-v4.1",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)Cydonia 24B v4.1 (thedrummer/cydonia-24b-v4.1) has a 131,072-token context window and supports up to 131,072 output tokens per request.
Cydonia 24B v4.1 is priced at $0.30 per 1M input tokens and $0.50 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
Send requests to https://api.haimaker.ai/v1/chat/completions with model "thedrummer/cydonia-24b-v4.1" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
OpenAI-compatible endpoint. Start building in minutes.