⚡ WebGPU Benchmark Results (183.84x speedup)

#50
by omaryshchenko - opened
Batch SizeWASM (int8)WASM (fp16)WASM (fp32)WebGPU (int8)WebGPU (fp16)WebGPU (fp32)
1298.30481.00460.60340.8033.5052.10
2609.80978.90928.30679.3088.70115.90
41209.901873.701763.801257.70124.0034.30
82425.403794.403536.902285.10152.10190.00
164977.407933.207318.603498.40207.4088.00
3210469.9019541.8015913.807471.70370.30106.30
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WASM (int8), WASM (fp16), WASM (fp32), WebGPU (int8), WebGPU (fp16), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36
  • GPU: vendor=nvidia, architecture=ampere, device=, description=

Sign up or log in to comment