EVLM: An Efficient Vision-Language Model for Visual Understanding Paper • 2407.14177 • Published 15 days ago • 41
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • 29 days ago • 48
view article Article Build Agentic Workflow using OpenAGI and HuggingFace models By lucifertrj • Jun 26 • 6
view article Article Building a Vision Mixture-of-Expert Model from several fine-tuned Phi-3-Vision Models By mjbuehler • Jun 12 • 5
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20 • 80
view article Article Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖 By m-ric • Jun 20 • 25
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • Jun 23 • 51
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 88
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 151
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive By bpan • Apr 9 • 28
A Modular End-to-End Multimodal Learning Method for Structured and Unstructured Data Paper • 2403.04866 • Published Mar 7 • 5
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression Paper • 2311.10794 • Published Nov 17, 2023 • 24