⚡ WebGPU Benchmark Results (39.03x speedup) – M1 Max

#48
by pcuenq HF staff - opened
Batch SizeWASM (int8)WASM (fp16)WASM (fp32)WebGPU (int8)WebGPU (fp16)WebGPU (fp32)
1372.50392.80375.00358.4021.0016.30
2743.50782.90749.90654.0054.0026.50
41482.001561.001510.401235.7046.1045.90
83009.803164.603049.502440.60108.4077.20
166134.606451.506127.904888.50146.80156.30
3212237.0013093.6012447.6010082.60335.50343.60
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WASM (int8), WASM (fp16), WASM (fp32), WebGPU (int8), WebGPU (fp16), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36
  • GPU: vendor=apple, architecture=common-3, device=, description=

Sign up or log in to comment