Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese Paper • 2408.12480 • Published Aug 22, 2024 • 23
EraX-Multimodal Collection EraX's Collection of Vision-Language models • 5 items • Updated Jan 11 • 2
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 205