OpenVINO IR model with int8 quantization

#2
by fakezeta - opened

Hi, I've converted you model to OpenVINO IR format.
Template to use it on LocalAI are added to the README.md file.

NousResearch org

is there instructions on how to configure function calling for Heremes-2-Pro with LocalAI?

Support for function calling on transformers Is in this PR with example instruction.

I still haven't had the chance to test it: will report here as soon as possible.

Sign up or log in to comment