Upload ONNX weights

#1
by Xenova HF staff - opened
Owner
No description provided.
Owner

@fxmarty I converted this repo's safetensors model to ONNX (same config), but it's significantly larger than the pytorch equivalent (~500MB vs ~8MB).

I'm using the LlamaOnnxConfig from Optimum - any ideas for what could cause this? :) Thanks!

Xenova changed pull request status to closed

Sign up or log in to comment