NVIDIA A100 80GB PCIe (2x) - gemma-3-27b-it

November 2, 2025 at 09:37 PM

Dataset: reference (v1.0)

Best Performance

Click a metric to highlight the best run in the table below

Best Output TPS
1,834.30
Peak generation speed
Best Input TPS
4,909.53
Peak prefill speed
Best Energy Efficiency
0.03 kWh/MT
Energy cost per 1M tokens
Best TTFT (P95)
80.32 ms
Lowest latency
Best E2E (P95)
5,447.75 ms
Lowest latency

Test Matrix Results

Performance across different input/output token combinations and concurrency levels

Input TokensOutput TokensConcurrencyOutput TPSInput TPSEnergy Cost
(kWh/MT)
TTFT MeanTTFT P95E2E P95Success Rate
Best Run for Output TPS
128128256x1,834.301,837.610.044,777.469,628.8917,018.42100.0%
1281281x13.6212.982.494,152.624,152.629,396.89100.0%
1281282x26.8626.231.384,320.554,354.149,528.95100.0%
1281284x92.7492.010.79308.88344.665,513.05100.0%
1281288x184.54185.080.39250.83309.545,545.32100.0%
12812816x342.93345.450.20400.06516.975,957.65100.0%
12812832x586.23587.090.12672.08731.576,449.38100.0%
12812864x1,045.431,040.580.07883.791,202.047,684.86100.0%
128128128x1,164.331,164.190.061,273.822,055.399,145.84100.0%
128128512x1,577.361,579.820.0513,534.3826,816.8134,711.40100.0%
1285121x24.105.743.41297.84297.8421,245.41100.0%
1285122x47.3311.562.67474.73637.5621,617.18100.0%
1285124x94.6223.471.35330.53382.4721,637.97100.0%
1285128x178.3646.870.71320.94390.9421,900.00100.0%
12851216x347.3389.590.37400.96486.8522,980.55100.0%
12851232x664.91167.750.19548.04649.3724,434.30100.0%
12851264x1,192.09298.290.11812.661,020.0527,259.75100.0%
128512128x1,674.19418.420.081,141.311,711.1237,835.43100.0%
128512256x1,426.00357.810.0918,357.9236,390.4069,700.5498.8%
1281,0241x23.932.853.75299.56299.5642,786.41100.0%
1281,0242x47.895.851.90301.09326.0042,756.49100.0%
1281,0244x94.1811.681.50263.52274.1843,486.01100.0%
1281,0248x170.6323.250.83206.66254.0544,163.08100.0%
1281,02416x345.5945.110.42277.85317.4145,714.38100.0%
1281,02432x629.7081.610.22436.29594.7250,201.52100.0%
1281,02464x1,114.99141.910.13596.76901.8157,353.71100.0%
1281,024128x1,057.38133.750.131,969.372,075.37103,104.07100.0%
1282,0481x24.022.723.8180.3280.3244,800.66100.0%
1282,0482x42.004.112.22148.05181.6660,052.09100.0%
1282,0484x85.907.241.73144.02180.2369,989.56100.0%
1282,0488x130.3011.611.04200.22237.8182,964.39100.0%
1282,04816x289.4825.380.50332.71426.8980,950.15100.0%
1282,04832x503.0043.170.30405.57587.8790,677.70100.0%
1282,04864x870.5573.950.17583.43802.92105,871.88100.0%
1282,048128x971.1480.390.152,108.371,884.70186,177.52100.0%
1282,048256x976.1181.740.1510,353.29120,575.72203,997.2654.7%
5121281x23.4991.040.84176.10176.105,447.75100.0%
5121282x45.85178.190.43243.29307.705,579.84100.0%
5121284x91.44353.280.33178.65230.445,597.27100.0%
5121288x174.45672.910.18385.87462.195,865.29100.0%
51212816x302.291,171.370.10658.671,037.686,760.84100.0%
51212832x543.382,117.410.06775.691,526.577,467.25100.0%
51212864x795.803,095.980.041,531.872,890.2110,059.09100.0%
512128128x814.233,163.500.042,190.304,982.7916,272.00100.0%
512128256x858.703,336.160.0411,657.4622,272.1232,641.36100.0%
512128512x880.523,420.790.0429,592.0356,120.5267,103.07100.0%
5121281024x925.953,595.350.0459,996.64117,297.72128,748.4695.9%
5125121x23.8623.122.16168.88168.8821,457.16100.0%
5125122x47.6646.311.66167.36167.7421,483.96100.0%
5125124x92.4889.320.85298.04452.4622,139.55100.0%
5125128x183.30176.770.45290.36449.6222,340.59100.0%
51251216x334.03332.020.24572.37675.3923,892.49100.0%
51251232x597.50592.380.13843.431,486.9626,855.45100.0%
51251264x1,026.141,003.750.081,371.072,736.9631,549.84100.0%
512512128x935.23912.510.092,830.785,351.4265,564.65100.0%
5121,0241x23.7611.512.92170.54170.5443,102.69100.0%
5121,0242x47.3122.991.43173.78237.9643,282.33100.0%
5121,0244x94.7645.761.11236.76314.6743,216.00100.0%
5121,0248x179.7087.480.61403.71481.8445,147.94100.0%
5121,02416x329.22165.210.32547.92877.8847,987.78100.0%
5121,02432x571.31294.000.181,074.721,746.9954,188.76100.0%
5121,02464x870.59431.960.121,334.032,813.2265,613.62100.0%
5121,024128x934.32463.190.125,335.747,645.09133,033.10100.0%
5121,024256x921.12459.160.1225,354.6781,778.51185,696.8569.1%
5122,0481x23.697.533.26185.72185.7265,793.64100.0%
5122,0482x44.6513.392.62128.52165.9573,836.81100.0%
5122,0484x81.4024.701.42271.22348.1078,430.80100.0%
5122,0488x165.9253.630.71432.50594.9273,406.66100.0%
5122,04816x267.86102.420.44717.33974.1676,198.84100.0%
5122,04832x487.80170.870.24706.811,286.1288,675.57100.0%
5122,04864x812.15272.680.151,503.612,944.34108,729.82100.0%
5122,048128x864.44290.590.144,263.047,552.00202,964.31100.0%
5122,048256x898.40302.180.144,281.607,631.59202,180.0350.0%
1,0241281x22.58172.020.50315.86315.865,665.77100.0%
1,0241282x45.09344.200.37203.01300.895,663.92100.0%
1,0241284x86.05656.970.20450.45588.555,948.06100.0%
1,0241288x161.061,231.990.10621.17836.346,353.42100.0%
1,02412816x267.712,053.730.06861.281,602.457,598.02100.0%
1,02412832x420.233,224.270.041,295.672,772.019,602.70100.0%
1,02412864x629.524,833.940.032,081.424,853.4012,804.67100.0%
1,024128128x513.543,943.830.0411,855.3723,113.4331,341.12100.0%
1,0245121x23.4444.641.50308.17308.1721,839.53100.0%
1,0245122x46.9189.511.14294.35296.4021,828.25100.0%
1,0245124x92.67176.890.58328.16533.9622,090.17100.0%
1,0245128x174.53333.760.31595.66852.8023,447.66100.0%
1,02451216x324.44622.220.17853.211,588.1225,216.93100.0%
1,02451232x559.051,083.730.101,367.742,734.6528,880.21100.0%
1,02451264x729.311,400.040.072,346.535,184.3538,222.52100.0%
1,024512128x807.841,572.450.0723,111.1848,167.9879,419.21100.0%
1,024512256x708.501,367.120.0844,510.3088,790.50123,295.9878.5%
1,0241,0241x23.5322.402.20291.82291.8243,520.17100.0%
1,0241,0242x46.9844.821.11330.81351.6643,589.89100.0%
1,0241,0244x93.4189.150.86436.34571.7143,834.64100.0%
1,0241,0248x180.42172.520.45639.99870.2445,401.79100.0%
1,0241,02416x327.79322.330.25830.581,235.1048,711.97100.0%
1,0241,02432x573.71565.400.141,469.482,801.1855,474.69100.0%
1,0241,02464x727.69720.480.102,757.075,901.9069,010.16100.0%
1,0241,024128x753.78750.760.1037,507.4279,812.52145,994.24100.0%
1,0241,024256x723.89719.740.1139,612.8281,305.35147,599.4652.3%
1,0242,0481x23.5413.442.77299.43299.4372,521.45100.0%
1,0242,0482x43.1926.962.13286.70289.5171,812.95100.0%
1,0242,0484x86.9150.511.12395.48586.6277,264.13100.0%
1,0242,0488x142.2286.590.69682.591,092.6387,586.86100.0%
1,0242,04816x275.83179.620.35873.091,673.9782,485.92100.0%
1,0242,04832x470.11305.880.211,369.042,774.19101,004.87100.0%
1,0242,04864x730.31496.250.134,429.617,985.87121,306.76100.0%
1,0242,048128x744.37508.420.1341,455.41114,732.23203,274.0485.9%
2,0481281x21.19325.720.27571.48571.486,042.43100.0%
2,0481282x43.21661.550.20321.52519.455,905.05100.0%
2,0481284x72.461,109.400.11930.761,479.197,055.29100.0%
2,0481288x125.121,912.510.071,224.392,410.818,159.45100.0%
2,04812816x240.913,679.680.041,271.112,282.468,450.72100.0%
2,04812832x321.384,909.530.032,664.855,749.0012,581.70100.0%
2,04812864x306.484,682.220.037,446.9519,557.9125,754.00100.0%
2,0485121x23.0288.500.93539.77539.7722,234.93100.0%
2,0485122x45.33173.480.46799.161,027.0722,587.95100.0%
2,0485124x90.34345.770.36590.321,003.9822,661.48100.0%
2,0485128x160.83626.810.191,113.801,931.3824,949.99100.0%
2,04851216x283.721,133.900.111,644.423,126.3827,553.98100.0%
2,04851232x485.301,936.610.072,680.545,600.6732,087.26100.0%
2,04851264x473.821,845.560.0710,150.8942,856.0966,706.49100.0%
2,0481,0241x23.3144.791.51545.80545.8043,934.24100.0%
2,0481,0242x46.7189.381.16538.24546.2043,840.09100.0%
2,0481,0244x80.10178.740.57702.971,018.1643,848.00100.0%
2,0481,0248x144.96341.130.33910.541,404.0245,856.48100.0%
2,0481,02416x265.75637.110.181,075.652,277.9549,021.03100.0%
2,0481,02432x505.581,103.950.101,684.104,409.1856,491.28100.0%
2,0481,02464x534.981,130.360.0911,317.8944,270.6697,610.50100.0%
2,0481,024128x666.511,370.220.0844,801.7790,058.54153,034.3693.0%
2,0481,024256x661.721,348.640.0845,463.5789,444.37152,945.2146.5%
2,0482,0481x23.3729.881.91559.10559.1065,812.09100.0%
2,0482,0482x41.9159.831.45544.45552.8564,769.45100.0%
2,0482,0484x69.81133.430.79708.071,082.7858,450.36100.0%
2,0482,0488x110.21212.480.461,095.701,894.3069,456.32100.0%
2,0482,04816x203.71368.770.271,337.302,820.3181,592.39100.0%
2,0482,04832x360.93576.840.163,500.176,652.96105,053.43100.0%
2,0482,04864x560.51832.700.1111,440.9541,217.92129,621.59100.0%
2,0482,048128x608.21900.750.1142,563.21115,326.01195,998.5477.3%

Hardware Configuration

GPU ManufacturerNVIDIA
GPU ModelNVIDIA A100 80GB PCIe
GPU Count2
GPU Memory (Total)160 GB
GPU Driver570.195.03
CUDA VersionUnknown
Compute Capability8.0
Power Limit (per GPU)300 W
CPU ModelIntel Xeon Processor (Icelake)
RAM31 GB

Software Configuration

Inference FrameworkvLLM
Framework Version0.11.0
OSUbuntu
OS Version22.04.5 LTS (Jammy Jellyfish)
Kernel Version5.15.0-88-generic
Python Version3.10.12

Model Configuration

Providergoogle
Model Namegemma-3-27b-it
QuantizationBF16

Inference Configuration

Runtime parameters used across all benchmark runs

Max Model Length8192
Tensor Parallel Size1
Pipeline Parallel Size1
GPU Memory Utilization0.90
Temperature0.70
Top-P1.00
Top-K-1