NVIDIA H200 NVL (2x) - qwen3-30b-a3b

November 7, 2025 at 01:50 AM

Dataset: reference (v1.0)

Best Performance

Click a metric to highlight the best run in the table below

Best Output TPS
6,124.38
Peak generation speed
Best Input TPS
51,413.77
Peak prefill speed
Best Energy Efficiency
0.00 kWh/MT
Energy cost per 1M tokens
Best TTFT (P95)
80.05 ms
Lowest latency
Best E2E (P95)
1,029.03 ms
Lowest latency

Test Matrix Results

Performance across different input/output token combinations and concurrency levels

Input TokensOutput TokensConcurrencyOutput TPSInput TPSEnergy Cost
(kWh/MT)
TTFT MeanTTFT P95E2E P95Success Rate
Best Run for Output TPS
1285121024x6,124.381,619.440.0321,801.4844,878.0854,102.09100.0%
1281281x91.3087.020.36583.52583.521,401.52100.0%
1281282x175.34171.230.21357.86612.151,424.48100.0%
1281284x193.43191.920.171,190.231,532.292,640.21100.0%
1281288x573.67575.350.08294.52333.601,775.38100.0%
12812816x620.23624.770.091,090.831,607.703,291.78100.0%
12812832x2,030.752,034.720.03365.43421.701,980.77100.0%
12812864x3,045.393,033.480.03391.63455.272,555.71100.0%
128128128x2,584.912,618.960.02515.97466.334,645.84100.0%
1285121x145.2934.620.4283.0183.013,523.96100.0%
1285122x242.3159.160.27191.52192.934,224.10100.0%
1285124x437.79108.590.22144.05167.114,668.53100.0%
1285128x638.19162.470.18183.55201.146,303.05100.0%
12851216x1,128.64288.770.11236.00278.187,111.92100.0%
12851232x1,917.19485.270.07252.10274.248,369.90100.0%
12851264x3,537.10885.820.05299.43358.249,011.70100.0%
128512128x4,207.471,065.920.04681.03480.039,924.89100.0%
128512256x5,160.931,355.650.033,665.938,372.8017,941.22100.0%
128512512x5,538.101,466.770.039,186.7419,399.5028,896.77100.0%
1281,0241x148.5317.700.5187.2187.216,891.73100.0%
1281,0242x270.5133.020.30125.27127.267,564.85100.0%
1281,0244x422.4452.390.27127.51148.709,684.35100.0%
1281,0248x812.21107.510.16171.35182.949,537.79100.0%
1281,02416x1,121.23145.660.14229.80284.2014,120.79100.0%
1281,02432x1,973.20252.730.09269.35308.7416,129.97100.0%
1281,02464x3,663.31471.380.05306.31369.3917,075.42100.0%
1281,024128x3,924.64500.760.041,515.5216,059.4630,348.55100.0%
1281,024256x5,962.24782.640.037,654.8516,624.1235,944.46100.0%
1281,024512x5,878.08771.430.0322,537.8745,301.7864,370.00100.0%
1282,0481x147.118.760.5587.8687.8613,918.64100.0%
1282,0482x278.2616.980.36101.35116.0514,673.75100.0%
1282,0484x496.4931.800.24129.50140.4815,817.64100.0%
1282,0488x584.5643.640.22180.70204.8323,024.68100.0%
1282,04816x1,232.6382.830.13220.17236.0824,853.65100.0%
1282,04832x1,745.28120.360.11599.47854.9834,001.95100.0%
1282,04864x3,185.52216.290.07315.98393.2537,508.12100.0%
1282,048128x4,775.11325.370.04795.16506.8241,881.24100.0%
1282,048256x4,779.97331.570.0418,368.6439,975.4281,514.20100.0%
1282,048512x5,501.68388.290.0450,010.75108,425.07146,593.38100.0%
1282,0481024x5,634.65402.100.0458,659.48120,161.33154,871.7556.6%
5121281x117.22454.210.11190.02190.021,092.12100.0%
5121282x241.51938.680.06139.70192.981,052.04100.0%
5121284x353.841,366.970.07251.92295.251,440.48100.0%
5121288x562.642,170.330.04275.59316.971,808.46100.0%
51212816x963.753,736.350.03309.62360.502,104.37100.0%
51212832x1,710.346,681.040.02381.93453.072,322.49100.0%
51212864x2,957.6911,526.220.01420.31585.782,655.11100.0%
512128128x3,248.0212,630.160.01581.182,553.104,370.43100.0%
512128256x4,161.5317,353.870.01844.132,324.465,174.83100.0%
512128512x4,548.1319,088.770.011,679.113,133.325,818.13100.0%
5121281024x4,393.1720,312.100.014,094.478,631.4011,388.98100.0%
5125121x143.18138.700.3280.0580.053,575.47100.0%
5125122x204.19198.400.2394.86103.504,929.03100.0%
5125124x421.40407.000.14143.06168.114,851.35100.0%
5125128x656.20632.810.11180.79205.386,230.99100.0%
51251216x1,162.171,125.990.08234.27267.837,023.93100.0%
51251232x2,306.362,297.870.04272.34300.116,887.76100.0%
51251264x3,506.833,427.990.03326.76359.239,098.20100.0%
512512128x4,933.274,865.920.02346.08404.159,649.20100.0%
512512256x4,993.625,042.900.023,799.918,247.8017,978.17100.0%
512512512x5,469.175,553.630.0210,009.9019,416.6028,895.73100.0%
5125121024x5,554.695,577.920.0225,698.7058,247.6765,436.90100.0%
5121,0241x138.1566.920.44173.55173.557,411.20100.0%
5121,0242x278.75135.430.2680.3784.757,312.41100.0%
5121,0244x447.21215.960.19204.19252.799,147.75100.0%
5121,0248x639.00308.110.14263.09303.0412,809.59100.0%
5121,02416x1,101.01583.400.11244.83327.6013,587.52100.0%
5121,02432x1,855.75943.990.07363.79460.4816,841.32100.0%
5121,02464x3,577.981,794.280.04382.96446.7117,637.94100.0%
5121,024128x4,728.082,340.720.03704.70499.9419,940.60100.0%
5121,024256x5,151.712,604.340.038,997.2618,963.1339,167.75100.0%
5121,024512x5,548.742,826.220.0324,527.9252,039.2774,252.09100.0%
5121,0241024x5,722.642,922.340.0357,390.37118,995.24139,735.3294.6%
5122,0481x146.3139.720.46191.85191.8512,477.67100.0%
5122,0482x250.7568.130.33133.47178.6314,398.93100.0%
5122,0484x394.19105.040.25188.99266.5418,668.21100.0%
5122,0488x599.27168.180.19324.60363.9023,472.43100.0%
5122,04816x972.65290.950.13334.85368.2627,249.02100.0%
5122,04832x1,773.23502.490.09419.56519.0731,709.22100.0%
5122,04864x3,098.13846.120.06433.08568.0237,541.11100.0%
5122,048128x5,173.691,410.280.04569.53590.9841,142.50100.0%
5122,048256x4,647.681,271.630.0418,464.8743,501.5084,356.32100.0%
1,0241281x124.39947.520.06192.17192.171,029.03100.0%
1,0241282x242.191,848.630.04139.67188.751,045.51100.0%
1,0241284x366.502,798.140.04244.53291.981,390.52100.0%
1,0241288x743.115,684.330.02231.23261.171,367.39100.0%
1,02412816x941.617,223.450.01376.77473.052,153.15100.0%
1,02412832x1,622.5212,461.140.01465.61549.322,488.07100.0%
1,02412864x2,702.7820,842.940.01458.93707.012,887.47100.0%
1,024128128x3,191.3024,536.660.01741.972,918.974,529.83100.0%
1,024128256x3,455.2028,255.890.011,711.014,364.126,397.15100.0%
1,024128512x3,919.6633,237.720.013,058.175,914.639,072.96100.0%
1,0241281024x4,461.5936,953.900.014,988.5611,282.0614,100.32100.0%
1,0245121x137.04260.970.21165.29165.293,735.81100.0%
1,0245122x269.26513.800.12137.38138.763,799.87100.0%
1,0245124x516.78986.370.08195.22224.863,955.41100.0%
1,0245128x653.061,248.880.08232.02299.576,252.01100.0%
1,02451216x1,112.592,133.780.06246.13317.617,340.56100.0%
1,02451232x2,311.414,436.340.03342.46399.267,003.86100.0%
1,02451264x3,408.316,550.450.02390.24567.689,456.00100.0%
1,024512128x3,713.597,153.160.021,254.269,618.9416,841.95100.0%
1,024512256x4,301.368,504.750.025,193.0610,015.4321,849.95100.0%
1,024512512x5,180.6010,267.660.0211,659.1725,085.4234,795.04100.0%
1,0245121024x5,402.4210,752.240.0127,083.0754,730.2964,918.02100.0%
1,0241,0241x148.56141.450.31202.25202.256,892.44100.0%
1,0241,0242x268.59256.260.18230.18232.577,622.23100.0%
1,0241,0244x443.77423.510.15229.98296.129,220.87100.0%
1,0241,0248x671.26641.840.12240.59297.8912,190.18100.0%
1,0241,02416x1,140.471,093.620.09400.57475.7514,333.18100.0%
1,0241,02432x1,951.031,888.190.05452.79577.7916,579.44100.0%
1,0241,02464x3,523.363,453.420.03490.80681.2518,107.45100.0%
1,0241,024128x3,608.773,536.370.031,668.749,540.4328,320.37100.0%
1,0241,024256x4,600.084,552.310.0210,717.2320,415.7441,083.54100.0%
1,0241,024512x5,258.525,268.510.0227,367.0157,314.2177,299.10100.0%
1,0241,0241024x5,599.615,598.470.0255,063.58115,462.51135,729.8189.0%
1,0242,0481x145.5469.290.41183.52183.5214,070.26100.0%
1,0242,0482x257.55122.860.25210.11211.6615,900.72100.0%
1,0242,0484x502.07248.520.17200.97292.0615,690.43100.0%
1,0242,0488x643.98316.540.16283.30340.4624,729.41100.0%
1,0242,04816x1,136.93566.730.10317.46387.4627,679.28100.0%
1,0242,04832x1,802.38938.430.08999.481,891.1333,430.48100.0%
1,0242,04864x3,166.881,673.470.05507.88702.5337,439.60100.0%
1,0242,048128x4,579.822,419.510.031,230.11953.6743,198.59100.0%
1,0242,048256x4,616.742,445.810.0319,940.0343,404.1787,613.48100.0%
1,0242,048512x4,972.642,661.330.0348,631.00114,271.31149,219.9987.7%
2,0481281x124.151,908.830.04203.81203.811,030.91100.0%
2,0481282x232.943,565.970.02192.80211.111,091.94100.0%
2,0481284x417.966,399.180.02199.14225.961,221.70100.0%
2,0481288x539.238,242.230.01316.29355.681,891.74100.0%
2,04812816x871.8613,316.730.01337.09541.542,312.39100.0%
2,04812832x1,455.5522,251.780.01524.30711.402,760.41100.0%
2,04812864x2,252.5434,429.440.01730.661,218.403,544.02100.0%
2,048128128x2,530.1438,689.340.011,106.543,225.315,488.79100.0%
2,048128256x2,825.1343,251.210.012,680.375,569.048,997.70100.0%
2,048128512x3,249.1551,413.770.004,780.7910,467.9413,726.57100.0%
2,0481281024x3,156.3249,914.450.0111,850.9026,135.5829,770.75100.0%
2,0485121x134.63517.490.14177.93177.933,802.73100.0%
2,0485122x254.03972.220.08256.68257.764,027.76100.0%
2,0485124x402.121,539.170.06259.32307.295,089.00100.0%
2,0485128x620.412,468.380.05255.03330.876,326.22100.0%
2,04851216x1,242.444,778.640.03423.18524.006,521.55100.0%
2,04851232x1,744.356,665.100.02521.97781.469,314.58100.0%
2,04851264x3,136.2311,989.940.01582.961,018.6010,302.16100.0%
2,048512128x3,637.0513,960.180.011,271.711,977.7913,596.17100.0%
2,048512256x4,008.3615,336.180.018,100.7216,176.5527,136.01100.0%
2,048512512x4,582.7118,022.620.0114,955.9331,034.4242,838.00100.0%
2,0485121024x4,353.1216,892.410.0141,756.3481,539.1995,689.78100.0%
2,0481,0241x147.30283.080.20195.00195.006,950.93100.0%
2,0481,0242x268.38513.560.14141.04186.727,581.70100.0%
2,0481,0244x418.62875.080.10275.38342.238,945.72100.0%
2,0481,0248x557.981,253.560.07362.89457.4112,469.39100.0%
2,0481,02416x962.682,122.620.05459.62694.9914,703.88100.0%
2,0481,02432x1,714.283,606.660.04546.90959.5417,259.36100.0%
2,0481,02464x3,057.896,229.830.02654.521,123.2019,967.54100.0%
2,0481,024128x3,480.206,892.830.022,103.6313,641.0231,546.38100.0%
2,0481,024256x4,880.799,682.730.0211,641.8524,412.6247,193.27100.0%
2,0481,024512x5,086.6810,127.120.0233,041.7168,961.4290,690.26100.0%
2,0481,0241024x5,024.209,906.060.0261,462.20119,826.44141,925.5375.2%
2,0482,0481x150.33156.450.29217.06217.0612,570.07100.0%
2,0482,0482x266.25257.570.19275.99305.3915,193.66100.0%
2,0482,0484x350.67447.050.15807.311,467.4817,061.78100.0%
2,0482,0488x658.93865.090.10330.37408.6718,078.46100.0%
2,0482,04816x882.091,201.770.08415.50543.7426,000.27100.0%
2,0482,04832x1,611.791,927.720.06530.69781.8932,388.14100.0%
2,0482,04864x2,840.953,163.490.04667.421,193.9439,402.31100.0%
2,0482,048128x3,414.813,654.470.033,339.4324,196.8856,045.27100.0%
2,0482,048256x4,765.425,132.700.0321,082.9046,978.6892,241.68100.0%
2,0482,048512x4,707.855,106.030.0348,177.25108,485.30147,303.8383.6%

Hardware Configuration

GPU ManufacturerNVIDIA
GPU ModelNVIDIA H200 NVL
GPU Count2
GPU Memory (Total)280 GB
GPU Driver580.95.05
CUDA VersionUnknown
Compute Capability9.0
Power Limit (per GPU)600 W
CPU ModelIntel(R) Xeon(R) 6960P
RAM2,267 GB

Software Configuration

Inference FrameworkvLLM
Framework Version0.11.0
OSUbuntu
OS Version22.04.5 LTS (Jammy Jellyfish)
Kernel Version5.15.0-88-generic
Python Version3.10.12

Model Configuration

Providerqwen
Model Nameqwen3-30b-a3b
QuantizationFP16

Inference Configuration

Runtime parameters used across all benchmark runs

Max Model LengthUnknown
Tensor Parallel Size1
Pipeline Parallel Size1
GPU Memory Utilization0.90
Temperature0.70
Top-P1.00
Top-K-1