Multi-Images Multi-Audio Multi-turn Multi-Modal bilingual TinyLlama

SigClip Encoder + Whisper Encoder + TinyLlama, source code at https://github.com/mesolitica/multimodal-LLM

Downloads last month
6
Safetensors
Model size
1.62B params
Tensor type
BF16
·
Inference API
Unable to determine this model’s pipeline type. Check the docs .