⚡ WebGPU Benchmark Results (35.43x speedup) – M1 Max

#49
by pcuenq HF staff - opened
Batch SizeWebGPU (int8)WebGPU (fp16)WebGPU (fp32)
1356.9026.6018.40
2652.9030.4023.30
41234.0049.5042.00
82410.9073.8077.30
164801.80113.00157.00
329923.90224.50343.50
6420731.30429.00664.00
12847839.602693.801350.40
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WebGPU (int8), WebGPU (fp16), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36
  • GPU: vendor=apple, architecture=common-3, device=, description=

Sign up or log in to comment