Haimaker.ai Logo

Inference Benchmarks

Performance metrics across hardware, software, and model configurations

Back to Home

Early Preview

We are working to certify more internal benchmarks to be published. If you're interested in providing hardware or have questions, email benchmarks@haimaker.ai.

Filters

NVIDIA H20
Clear
Active Filters:Tag: NVIDIA H20

Found 7 benchmark suites

DateSuite NameGPUModelOutput TPSInput TPSEnergy Cost
(kWh/MT)
10/26/2025
NVIDIA H20 (8x) - deepseek-v3.1
NVIDIA H20
8x 760GB
deepseek-v3.1
deepseek-ai
865.084,142.630.16
10/25/2025
NVIDIA H20 (8x) - llama-3.3-70b-instruct (High Throughput)
NVIDIA H20
8x 760GB
llama-3.3-70b-instruct
meta-llama
5,091.047,327.230.10
10/24/2025
NVIDIA H20 (8x) - llama-3.3-70b-instruct
NVIDIA H20
8x 760GB
llama-3.3-70b-instruct
meta-llama
3,370.986,350.240.11
10/24/2025
NVIDIA H20 (8x) - qwen2.5-vl-72b-instruct
NVIDIA H20
8x 760GB
qwen2.5-vl-72b-instruct
qwen
2,266.046,375.820.11
10/24/2025
NVIDIA H20 (8x) - qwen3-coder-30b-a3b-instruct
NVIDIA H20
8x 760GB
qwen3-coder-30b-a3b-instruct
qwen
8,987.3830,699.040.02
10/24/2025
NVIDIA H20 (8x) - mistral-nemo-instruct-2407
NVIDIA H20
8x 760GB
mistral-nemo-instruct-2407
mistralai
12,605.4324,969.580.02
10/24/2025
NVIDIA H20 (8x) - gemma-3-27b-it
NVIDIA H20
8x 760GB
gemma-3-27b-it
google
6,567.8012,358.950.05