thedrummer/skyfall-36b-v2Skyfall 36B v2 (thedrummer/skyfall-36b-v2) is a mistral 36.9B-parameter model from Thedrummer with a 32,768-token context window and 32,768 max output tokens, priced at $0.55/1M input and $0.80/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Skyfall v2 is an upscaled version of Mistral Small 2501 with continued training for creativity and RP.
Creativity, good writing style, good instruct, chain of thought capability, mathematics understanding, and solid tool use performance... This model is peak! This will be my new daily model over all the 70Bs I have used.
Skyfall v2 is without a doubt my favorite model I've ever managed to run locally, bar none
skyfall is kinda nuts i am quite impressed
The biggest stand out for me is how good Skyfall handles size differences especially. It actually beats all of the 70b's I have used for descriptions of how the character worked around our size difference.
I played with the Skyfall 3bit model, taking a new character card with which I had not previously RP'd, and damn, it was so alive! The character's speech was conceptually correct, not as dry as 24b, there was a comedy tag and damn I really laughed in places. I really liked it, maybe it was the specific prompt that played great together with Skyfall.
Seriously though, Skyfall is just insanely good for some reason
How did you make skyfall so good

| Mode | chat |
| Context Window | 32,768 tokens |
| Max Output | 32,768 tokens |
| Function Calling | - |
| Vision | - |
| Reasoning | - |
| Web Search | - |
| Url Context | - |
| Architecture | MistralForCausalLM |
| Model Type | mistral |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="thedrummer/skyfall-36b-v2",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)Skyfall 36B v2 (thedrummer/skyfall-36b-v2) has a 32,768-token context window and supports up to 32,768 output tokens per request.
Skyfall 36B v2 is priced at $0.55 per 1M input tokens and $0.80 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
Send requests to https://api.haimaker.ai/v1/chat/completions with model "thedrummer/skyfall-36b-v2" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
OpenAI-compatible endpoint. Start building in minutes.