Spaces:

aavetis
/

ugen-image-captioning

Runtime error

faster inference?

by DoctorSlimm - opened May 28, 2024

May 28, 2024

great model im a huge fan! any way to make it faster?

anything along the lines of vllm or so for this model arch?

batching? blfloat16? onnx? quantization?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment