⚡ WebGPU Benchmark Results (62.80x speedup)

#74
by a414166402 - opened
Batch SizeWASM (int8)WASM (fp16)WASM (fp32)WebGPU (fp16)WebGPU (fp32)
1400.70513.20467.2017.4019.90
2796.501013.90917.5058.0036.00
41552.702013.801849.2056.1065.90
83176.204164.103827.00168.20116.70
166805.908905.308092.00258.60145.70
3214272.1018282.5016072.30477.00291.10
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WASM (int8), WASM (fp16), WASM (fp32), WebGPU (fp16), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/124.0.0.0 Safari/537.36
  • GPU: vendor=nvidia, architecture=turing, device=, description=

Sign up or log in to comment