|
If you want to use vision functionality: |
|
|
|
Make sure you are using the latest version of KoboldCpp. |
|
To use the multimodal capabilities of this model, such as vision, you also need to load the specified mmproj file, you can get it here. |
|
|
|
https://huggingface.co/LeroyDyer/Mixtral_AI_Vision_128k/blob/main/mmproj-model-f16.gguf |
|
|
|
You can load the mmproj by using the corresponding section in the interface: |
|
|
|
|
|
KoboldCpp now supports Vision via Multimodal Projectors (aka LLaVA), allowing it to perceive and react to images! Load a suitable --mmproj file or select it in the GUI launcher to use vision capabilities. (Not working on Vulkan) |
|
Note: This is NOT limited to only LLaVA models, any compatible model of the same size and architecture can gain vision capabilities! |
|
Simply grab a 200mb mmproj file for your architecture here, |
|
|
|
https://huggingface.co/koboldcpp/mmproj |
|
|
|
load it with --mmproj and stick it into your favorite compatible model, and it will be able to see images as well! |