Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published Sep 25, 2024 • 112
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 19 days ago • 113
RLVR Collection Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 6 days ago • 10
ReSearch Collection Trained models as described in the paper "ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning" • 5 items • Updated 10 days ago • 4
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 27 days ago • 96
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 26 days ago • 372
view article Article HuggingFace, IISc partner to supercharge model building on India's diverse languages Feb 27 • 18
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 • 73