view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • 6 days ago • 41
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 13 days ago • 76
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 126
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive By bpan • Apr 9 • 26
A Modular End-to-End Multimodal Learning Method for Structured and Unstructured Data Paper • 2403.04866 • Published Mar 7 • 5
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression Paper • 2311.10794 • Published Nov 17, 2023 • 22