nex-agi/deepseek-v3.1-nex-n1DeepSeek V3.1 Nex N1 (nex-agi/deepseek-v3.1-nex-n1) is a deepseek_v3 671.1B-parameter model from Nex Agi with a 131,072-token context window and 163,840 max output tokens, priced at $0.14/1M input and $0.50/1M output tokens. Available via the haimaker.ai OpenAI-compatible API.
Deepseek V3.1 Nex N1 is a chat model by Nex Agi. It has 671.1B parameters. It supports a 131K token context window. Supports function calling.
Nex is a next-generation, full-stack agentic platform that brings foundation models, synthetic data pipelines, RL training, agent frameworks, and deployment tools together in one unified ecosystem.
DeepSeek-V3.1-Nex-N1 is the flagship release of the Nex-N1 series — a post-trained model designed to highlight agent autonomy, tool use, and real-world productivity.
We are committed to making it easier than ever to build and deploy AI agents by offering researchers and entrepreneurs a high-performance, reliable, and cost-effective "out-of-the-box" agent system.
Nex-N1 is evaluated on six representative agentic benchmarks (general + professional). The model consistently ranks at or near the top across tool-using, web-search, and coding-heavy evaluations, showing strong readiness for real-world agent workflows.
Nex-N1 provides various size models from 8B to 671B for different usage scenarios.
| Model | GAIA2 | τ2-Bench | SWE-bench Verified | Terminal-Bench2 | BaxBench | BFCL v4 |
| --- | --- | --- | --- | --- | --- | --- |
| DeepSeek-V3.1-Nex-N1 | 29.5 | 80.2 | 70.6 | 31.8 | 59.7 | 65.3 |
| Qwen3-32B-Nex-N1 | 16.7 | 72.1 | 50.5 | 16.7 | 34.8 | 60.5 |
| Qwen3-30B-A3B-Nex-N1 | 11.3 | 65.3 | 29.7 | 8.3 | 13.6 | 51.9 |
| internlm3-8B-Nex-N1 | 8.6 | 63.0 | 20.3 | - | - | 44.5 |
Nex-N1 demonstrates competitive performance across all evaluation scenarios, showing particularly strong results in practical coding and HTML generation tasks.
Refer to
We recommend sglang for serving Nex-series models locally:
python -m sglang.launch_server --model-path /path/to/your/model
Nex-series models support robust function-calling capabilities. To maximize the function-calling capabilities of the Nex-series models, we modified the tool parser of qwen3_coder, see: --tool-call-parser qwen3_coder flag when launching the server:
python -m sglang.launch_server --model-path /path/to/your/model --tool-call-parser qwen3_coder
Nex-N1 is optimized for mini program development. For optimal performance, we recommend using Claude Code configured with both context7 and a search MCP.
claude mcp add --transport http context7 https://mcp.context7.com/mcp --header "CONTEXT7_API_KEY: [CONTEXT7_API_KEY]"
claude mcp add --transport stdio serper-search --env SERPER_API_KEY=[SERPER_API_KEY] -- npx -y serper-search-scrape-mcp-server
Refer to context7.
| Mode | chat |
| Context Window | 131,072 tokens |
| Max Output | 163,840 tokens |
| Function Calling | Supported |
| Vision | - |
| Reasoning | - |
| Web Search | - |
| Url Context | - |
| Architecture | DeepseekV3ForCausalLM |
| Model Type | deepseek_v3 |
| Library | transformers |
from openai import OpenAI
client = OpenAI(
base_url="https://api.haimaker.ai/v1",
api_key="YOUR_API_KEY",
)
response = client.chat.completions.create(
model="nex-agi/deepseek-v3.1-nex-n1",
messages=[
{"role": "user", "content": "Hello, how are you?"}
],
)
print(response.choices[0].message.content)DeepSeek V3.1 Nex N1 (nex-agi/deepseek-v3.1-nex-n1) has a 131,072-token context window and supports up to 163,840 output tokens per request.
DeepSeek V3.1 Nex N1 is priced at $0.14 per 1M input tokens and $0.50 per 1M output tokens when accessed via the haimaker.ai OpenAI-compatible API.
DeepSeek V3.1 Nex N1 supports function calling.
Send requests to https://api.haimaker.ai/v1/chat/completions with model "nex-agi/deepseek-v3.1-nex-n1" using any OpenAI-compatible SDK. Authentication uses a Bearer API key from https://app.haimaker.ai.
OpenAI-compatible endpoint. Start building in minutes.