Spaces:

Xenova
/

experimental-phi3-webgpu

Running

convert and ios

by l0d0v1c - opened Jul 31, 2024

Jul 31, 2024

Very impressive work! Two questions:
1-I tried to convert fintuned phi-3 models with optimum but I only get a 15Go onnx from quantized model. Is there a script to convert models?
2-On Ipad it is possible to enable the webgpu settings. It's only load the first part of the model but not the second one. Maybe a question of RAM?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment