⚡ WebGPU Benchmark Results (233.94x speedup) - Ubuntu 3090 Ti

#51
by pcuenq HF staff - opened
Batch SizeWASM (int8)WASM (fp16)WASM (fp32)WebGPU (int8)WebGPU (fp32)
1485.50706.90664.20793.6030.00
2934.401425.301330.801249.8051.40
41904.802929.602771.002100.9097.70
83898.605863.205502.903860.90287.60
167886.6012020.4011108.707249.20330.30
3216368.7024586.7022498.9013512.10105.10
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WASM (int8), WASM (fp16), WASM (fp32), WebGPU (int8), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36
  • GPU: vendor=nvidia, architecture=lovelace, device=, description=

Sign up or log in to comment