LeroyDyer
/

Mixtral_AI_Vision_X_128k_7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mixtral_AI_Vision_X_128k_7b / README.md

LeroyDyer's picture

Update README.md

44cb2d8 verified 4 months ago

|

history blame contribute delete

No virus

983 Bytes

	If you want to use vision functionality:

	Make sure you are using the latest version of KoboldCpp.
	To use the multimodal capabilities of this model, such as vision, you also need to load the specified mmproj file, you can get it here.

	https://huggingface.co/LeroyDyer/Mixtral_AI_Vision_128k/blob/main/mmproj-model-f16.gguf

	You can load the mmproj by using the corresponding section in the interface:


	KoboldCpp now supports Vision via Multimodal Projectors (aka LLaVA), allowing it to perceive and react to images! Load a suitable --mmproj file or select it in the GUI launcher to use vision capabilities. (Not working on Vulkan)
	Note: This is NOT limited to only LLaVA models, any compatible model of the same size and architecture can gain vision capabilities!
	Simply grab a 200mb mmproj file for your architecture here,

	https://huggingface.co/koboldcpp/mmproj

	load it with --mmproj and stick it into your favorite compatible model, and it will be able to see images as well!