DRAMA Collection A collection of small (sub-1B) multilingual dense retrievers that generalize well across a number of tasks and languages. • 3 items • Updated about 1 month ago • 5
view article Article Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub — No Code Required Nov 4, 2024 • 42
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community Dec 9, 2024 • 56
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 11 days ago • 108
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Paper • 2503.12937 • Published 12 days ago • 27
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 119
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 25 days ago • 70
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations • 9 items • Updated 2 days ago • 30
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 2 days ago • 146
Jamba 1.6 Collection The AI21 Jamba family of models are hybrid SSM-Transformer foundation models, outperforming open model competitors on quality and speed. • 2 items • Updated 23 days ago • 11
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 25 days ago • 68
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 8 items • Updated 26 days ago • 25
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 10 days ago • 102
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 16 days ago • 68
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 16 days ago • 299
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published Sep 25, 2024 • 111
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models Paper • 2502.10458 • Published Feb 12 • 34