Haimaker.ai Logo

Inference Benchmarks

Performance metrics across hardware, software, and model configurations

Back to Home

Early Preview

We are working to certify more internal benchmarks to be published. If you're interested in providing hardware or have questions, email benchmarks@haimaker.ai.

Filters

qwen
Clear
Active Filters:Tag: qwen

Found 5 benchmark suites

DateSuite NameGPUModelOutput TPSInput TPSEnergy Cost
(kWh/MT)
2/19/2026
Tenstorrent Wormhole (32x) - Qwen3-32B
Wormhole
32x 384GB
Qwen3-32B
qwen
1,438.105,640.690.11
11/7/2025
NVIDIA H200 NVL (2x) - qwen3-30b-a3b
NVIDIA H200 NVL
2x 280GB
qwen3-30b-a3b
qwen
6,124.3851,413.770.00
11/5/2025
NVIDIA H200 NVL (2x) - qwen3-coder-30b-a3b-instruct
NVIDIA H200 NVL
2x 280GB
qwen3-coder-30b-a3b-instruct
qwen
5,757.7643,900.390.01
10/24/2025
NVIDIA H20 (8x) - qwen2.5-vl-72b-instruct
NVIDIA H20
8x 760GB
qwen2.5-vl-72b-instruct
qwen
2,266.046,375.820.11
10/24/2025
NVIDIA H20 (8x) - qwen3-coder-30b-a3b-instruct
NVIDIA H20
8x 760GB
qwen3-coder-30b-a3b-instruct
qwen
8,987.3830,699.040.02