Tenstorrent Wormhole (32x) - Llama-3.3-70B-Instruct

February 19, 2026 at 06:18 PM

Dataset: reference (v1.0)

Best Performance

Click a metric to highlight the best run in the table below

Best Output TPS
1,091.26
Peak generation speed
Best Input TPS
4,654.03
Peak prefill speed
Best Energy Efficiency
0.13 kWh/MT
Energy cost per 1M tokens
Best TTFT (P95)
195.78 ms
Lowest latency
Best E2E (P95)
2,244.78 ms
Lowest latency

Test Matrix Results

Performance across different input/output token combinations and concurrency levels

Input TokensOutput TokensConcurrencyOutput TPSInput TPSEnergy Cost
(kWh/MT)
TTFT MeanTTFT P95E2E P95Success Rate
Best Run for Output TPS
12851232x1,091.26286.230.405,617.575,629.0614,356.75100.0%
1281281x25.5026.107.313,048.703,048.705,018.98100.0%
1281282x108.06108.481.88370.77371.122,367.81100.0%
1281284x186.79187.161.03719.82721.422,740.22100.0%
1281288x298.02299.191.021,421.911,425.133,434.87100.0%
12812816x421.83424.920.762,819.222,825.254,851.99100.0%
12812832x533.40535.490.605,609.555,621.757,672.03100.0%
12812864x496.12496.980.657,480.3310,812.5212,857.72100.0%
1285121x61.4415.846.28196.02196.028,267.94100.0%
1285122x120.4030.222.95372.19372.588,503.87100.0%
1285124x230.6057.761.57724.17728.148,878.98100.0%
1285128x386.16105.721.091,422.021,425.309,720.78100.0%
12851216x673.64184.410.642,819.902,824.7111,182.59100.0%
12851264x949.19245.860.479,350.4617,537.4926,030.85100.0%
1281,0241x61.5210.766.69196.57196.5712,154.90100.0%
1281,0242x103.5915.234.11371.22371.7216,625.65100.0%
1281,0244x207.5629.741.95720.42721.9917,052.71100.0%
1281,0248x336.5056.841.301,419.891,422.5617,969.51100.0%
1281,02416x564.40104.360.802,821.802,826.9318,329.48100.0%
1281,02432x1,028.82179.850.445,612.455,623.4022,548.80100.0%
1281,02464x939.03164.500.509,925.5621,165.6836,751.53100.0%
1282,0481x61.949.986.48196.12196.1213,107.21100.0%
1282,0482x91.9211.834.44372.32372.8221,185.25100.0%
1282,0484x213.9631.362.00719.11722.1816,261.30100.0%
1282,0488x254.7741.851.681,424.081,428.3821,522.24100.0%
1282,04816x533.6895.850.832,820.312,824.8619,505.73100.0%
1282,04832x810.90139.580.555,613.845,625.1423,723.55100.0%
1282,04864x941.12156.770.499,920.5220,978.3038,115.77100.0%
1282,048128x885.71148.240.519,783.5520,509.2137,137.06100.0%
5121281x57.02216.931.49198.20198.202,244.78100.0%
5121282x106.58413.820.78372.75373.282,401.55100.0%
5121284x146.79708.210.48724.09726.612,799.35100.0%
5121288x291.651,127.310.431,426.321,428.883,509.63100.0%
51212816x413.071,598.630.322,820.092,825.944,954.80100.0%
51212832x524.462,037.640.235,619.955,632.687,802.94100.0%
51212864x487.881,893.660.287,534.9310,935.4013,081.45100.0%
5125121x60.6657.694.04201.37201.378,441.38100.0%
5125122x118.50115.031.96376.45377.068,640.31100.0%
5125124x154.30218.391.23724.17725.509,078.86100.0%
5125128x352.36399.150.681,423.741,427.229,913.63100.0%
51251216x665.25687.190.422,823.932,831.3011,527.85100.0%
51251232x1,086.901,096.690.265,614.735,625.5714,495.86100.0%
51251264x910.74922.330.309,707.8218,217.7126,893.67100.0%
5121,0241x60.1332.875.20197.54197.5414,800.86100.0%
5121,0242x105.8366.502.70373.22373.6914,772.17100.0%
5121,0244x136.07111.961.86722.95727.0817,384.95100.0%
5121,0248x303.46212.811.001,425.001,428.3418,504.60100.0%
5121,02416x563.14392.670.562,821.852,826.7619,060.17100.0%
5121,02432x988.34674.720.335,628.805,640.5322,745.57100.0%
5121,02464x950.01615.990.3410,523.0322,681.7338,824.61100.0%
5122,0481x60.4936.594.67195.78195.7813,292.61100.0%
5122,0482x110.5260.522.82372.25372.8816,302.67100.0%
5122,0484x129.49105.592.05723.97725.8718,323.10100.0%
5122,0488x308.97192.251.011,422.721,425.5219,431.29100.0%
5122,04816x587.08388.740.562,824.782,829.9819,075.79100.0%
5122,04832x923.77603.860.365,616.635,628.0124,911.69100.0%
5122,04864x917.09595.250.3510,330.2922,827.9837,146.60100.0%
1,0241281x55.77424.400.84201.55201.552,294.73100.0%
1,0241282x102.73787.720.45375.79376.312,491.56100.0%
1,0241284x180.731,392.870.26725.36726.782,832.28100.0%
1,0241288x286.192,201.510.251,425.911,429.363,576.01100.0%
1,02412816x407.973,140.240.182,827.192,832.835,015.69100.0%
1,02412832x511.173,974.370.135,626.015,636.357,913.48100.0%
1,02412864x467.263,588.670.157,730.6911,469.4213,660.05100.0%
1,0245121x58.75111.762.60197.13197.138,715.40100.0%
1,0245122x114.79220.041.30372.88372.978,920.40100.0%
1,0245124x218.92421.810.75722.27723.619,353.35100.0%
1,0245128x377.54770.440.471,430.001,433.5510,220.86100.0%
1,02451216x600.241,334.010.292,825.322,828.9211,812.26100.0%
1,02451232x1,039.052,101.400.185,620.115,634.8514,970.83100.0%
1,02451264x891.301,784.630.219,532.3318,414.0227,499.91100.0%
1,0241,0241x56.4893.093.02197.64197.6410,442.40100.0%
1,0241,0242x94.41120.342.20371.91372.5916,002.81100.0%
1,0241,0244x190.83249.671.09722.76724.3015,753.59100.0%
1,0241,0248x272.39412.390.721,428.181,432.5219,097.08100.0%
1,0241,02416x457.35757.080.422,832.182,837.2019,122.70100.0%
1,0241,02432x833.341,293.540.265,630.515,641.5822,414.08100.0%
1,0241,02464x850.951,216.980.269,494.6221,026.9738,560.44100.0%
1,0241,024128x864.671,261.720.269,566.9120,542.7636,670.33100.0%
1,0241,024256x892.341,307.570.269,858.6121,267.3736,106.22100.0%
1,0241,024512x854.151,209.940.269,771.9721,382.7337,159.62100.0%
1,0242,0481x58.6390.793.22199.86199.8610,710.86100.0%
1,0242,0482x101.94144.801.85372.17372.8013,384.59100.0%
1,0242,0484x192.74272.991.02723.68726.7114,310.52100.0%
1,0242,0488x252.26346.240.841,427.771,430.1820,697.39100.0%
1,0242,04816x553.30845.660.352,835.602,841.4517,925.79100.0%
1,0242,04832x787.901,236.250.265,626.795,637.2922,403.45100.0%
1,0242,04864x825.941,218.040.279,690.4620,825.7535,711.46100.0%
1,0242,048128x845.181,190.920.289,571.3021,310.0836,857.89100.0%
1,0242,048256x766.581,083.020.299,581.0020,292.4836,461.90100.0%
2,0481281x49.82761.390.49367.97367.972,569.27100.0%
2,0481282x87.701,336.420.28710.45711.102,918.26100.0%
2,0481284x140.202,141.290.251,395.131,397.753,651.32100.0%
2,0481288x202.133,087.840.192,764.772,768.755,064.53100.0%
2,04812816x258.883,954.620.145,503.055,507.517,907.99100.0%
2,04812832x304.494,654.030.1310,967.6510,978.8913,444.00100.0%
2,04812864x290.464,443.600.1414,067.8919,584.0421,979.73100.0%
2,0485121x55.38222.451.60364.38364.388,776.10100.0%
2,0485122x99.62405.640.94709.40710.079,557.55100.0%
2,0485124x189.45723.400.531,397.131,400.1910,807.84100.0%
2,0485128x339.561,310.570.342,763.632,767.5711,931.22100.0%
2,04851216x508.622,051.610.225,499.775,506.4415,241.78100.0%
2,04851232x745.642,972.040.1610,971.5210,980.3221,051.29100.0%
2,04851264x668.032,628.920.1816,116.8527,419.9337,170.19100.0%
2,0481,0241x54.84238.921.60374.16374.168,173.72100.0%
2,0481,0242x91.96328.511.13706.09706.7311,688.90100.0%
2,0481,0244x173.00566.050.681,396.881,399.4113,791.70100.0%
2,0481,0248x245.37731.530.512,761.162,764.2719,493.38100.0%
2,0481,02416x438.341,267.520.315,505.075,511.3622,968.88100.0%
2,0481,02432x702.812,046.150.2110,971.2910,980.5729,729.92100.0%
2,0481,02464x705.681,969.180.2216,205.4130,332.3146,023.82100.0%
2,0481,024128x703.711,940.040.2216,387.2430,966.0447,578.63100.0%
2,0482,0481x54.47237.291.59374.04374.048,224.48100.0%
2,0482,0482x90.34242.391.42709.42709.9415,818.84100.0%
2,0482,0484x165.97554.410.701,391.511,394.9214,034.30100.0%
2,0482,0488x223.51676.280.562,764.492,767.1120,901.13100.0%
2,0482,04816x307.10733.360.485,496.345,500.8231,920.57100.0%
2,0482,04832x646.791,971.720.2210,970.3710,980.0829,207.73100.0%
2,0482,04864x550.841,434.290.2716,473.9630,400.6348,767.76100.0%

Hardware Configuration

GPU ManufacturerTenstorrent
GPU ModelWormhole
GPU Count32
GPU Memory (Total)384 GB
CPU ModelAMD EPYC 9354P 32-Core Processor
RAM566 GB

Software Configuration

Inference FrameworkvLLM
Framework VersionUnknown
OSUbuntu
OS Version22.04.5 LTS (Jammy Jellyfish)
Kernel Version6.8.0-94-generic
Python Version3.10.12

Model Configuration

Providermeta-llama
Model NameLlama-3.3-70B-Instruct
QuantizationFP16

Inference Configuration

Runtime parameters used across all benchmark runs

Max Model LengthUnknown
Tensor Parallel Size1
Pipeline Parallel Size1
GPU Memory UtilizationUnknown
Temperature0.70
Top-P1.00
Top-K-1