ONNX Conversion Tutorial

by qnguyen3 - opened 15 days ago

15 days ago

Hi @Xenova , thank you for your awesome work.
I recently fine-tuned this model for information extraction from images using JSON Schema, with the intention of embedding it into a web application. I was wondering if you could recommend any existing tutorials that would guide me through the process of converting the model into the ONNX format. This would enable me to perform the conversion independently in the future. Thank you for your awesome work!

Xenova

Owner 15 days ago

I must admit, the current process to export the model is a bit complicated, and is very manual/hacky at the moment... I'll eventually turn it into a script, but in the meantime, just ping me and I'd be happy to help out with it.

Xenova

Owner 15 days ago

There are 3 components:

Vision model + multimodal projection (vision_encoder.onnx)
Embedding layer (embed_tokens.onnx)
Language model without embedding layer (decoder_model_merged.onnx)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment