⚡ WebGPU Benchmark Results (1.88x speedup) – M1 Max WebGPU up to bs=128

#54
by pcuenq HF staff - opened
Batch SizeWebGPU (fp16)WebGPU (fp32)
115.9019.20
223.3031.20
435.6052.80
853.3099.70
1694.20197.20
32192.70389.20
64399.60764.40
128796.601494.60
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WebGPU (fp16), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36
  • GPU: vendor=apple, architecture=common-3, device=, description=

Sign up or log in to comment