view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 3 days ago • 234
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published 5 days ago • 21
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 11 days ago • 65
view article Article The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models Jan 29, 2024 • 25
view article Article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies By prithivMLmods • 25 days ago • 19
view article Article Small Language Models (SLMs): A Comprehensive Overview By jjokah • 20 days ago • 15
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 34 items • Updated 4 days ago • 28
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 203
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Paper • 2412.04424 • Published Dec 5, 2024 • 61
Biomedical Collection Models for biomedical research applications, such as radiology report generation and biomedical language understanding. • 9 items • Updated Jan 8 • 10
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 111