Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 4 days ago • 89
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 3 days ago • 240
view article Article HuggingFace, IISc partner to supercharge model building on India's diverse languages 16 days ago • 14
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 11 days ago • 65
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 8 days ago • 79
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 7 days ago • 72
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 70
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 198
Running 2.25k 2.25k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters