Skyfall 36B v2
thedrummer/skyfall-36b-v2Skyfall 36B v2 (thedrummer/skyfall-36b-v2) is a mistral 36.9B-parameter model from Thedrummer with a 32,768-token context window and 32,768 max output tokens, priced at $0.55/1M input and $0.80/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Overview
Skyfall v2 is an upscaled version of Mistral Small 2501 with continued training for creativity and RP.
Model Card
Join our Discord! https://discord.gg/Nbv9pQ88Xb
Nearly 4000 members strong ๐ช Now with more channels! A hub for users and makers alike!
BeaverAI proudly presents...
Skyfall 36B v2
Skyfall v2 is an upscaled version of Mistral Small 2501 with continued training for creativity and RP.
Special Thanks
- Thank you to each and everyone who donated and subscribed in Ko-Fi to make our venture a little bit easier.
- I'm also recently unemployed. I am a Software Developer with 8 years of experience in Web, API, AI, and adapting to new tech and requirements. If you're hiring, feel free to reach out to me however.
- To commercial hosters of my models: If you profit off someone's work, kindly consider contributing to the cause rather than turning a blind eye to those who provide value and are in need. A subscription/donation to my KoFi would be greatly appreciated!
Supported Chat Templates
- Mistral v7 Tekken (highly recommended)
- Metharme (not recommended)
- Alpaca (may be interesting, especially for cyoa / story)
Description
Creativity, good writing style, good instruct, chain of thought capability, mathematics understanding, and solid tool use performance... This model is peak! This will be my new daily model over all the 70Bs I have used.
Skyfall v2 is without a doubt my favorite model I've ever managed to run locally, bar none
skyfall is kinda nuts i am quite impressed
The biggest stand out for me is how good Skyfall handles size differences especially. It actually beats all of the 70b's I have used for descriptions of how the character worked around our size difference.
I played with the Skyfall 3bit model, taking a new character card with which I had not previously RP'd, and damn, it was so alive! The character's speech was conceptually correct, not as dry as 24b, there was a comedy tag and damn I really laughed in places. I really liked it, maybe it was the specific prompt that played great together with Skyfall.
Seriously though, Skyfall is just insanely good for some reason
How did you make skyfall so good
Links
- Original: https://huggingface.co/TheDrummer/Skyfall-36B-v2
- GGUF: https://huggingface.co/TheDrummer/Skyfall-36B-v2-GGUF
- iMatrix (recommended): https://huggingface.co/bartowski/TheDrummer_Skyfall-36B-v2-GGUF

Features & Capabilities
| Mode | chat |
| Context Window | 32,768 tokens |
| Max Output | 32,768 tokens |
| Function Calling | Not supported |
| Vision | Not supported |
| Reasoning | Not supported |
| Web Search | Not supported |
| Url Context | Not supported |
Technical Details
| Architecture | MistralForCausalLM |
| Model Type | mistral |
API Usage
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="thedrummer/skyfall-36b-v2",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)Frequently Asked Questions
What is the context window of Skyfall 36B v2?
Skyfall 36B v2 (thedrummer/skyfall-36b-v2) has a 32,768-token context window and supports up to 32,768 output tokens per request.
How much does Skyfall 36B v2 cost?
Skyfall 36B v2 is priced at $0.55 per 1M input tokens and $0.80 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
How do I use Skyfall 36B v2 via API?
Send requests to https://api.haimaker.ai/v1/chat/completions with model "thedrummer/skyfall-36b-v2" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
Use Skyfall 36B v2 with the haimaker API
OpenAI-compatible endpoint. Start building in minutes.