view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 3 days ago β’ 234
π©βπ» OlympicCoder Collection Reasoning datasets and models for competitive coding β’ 4 items β’ Updated 3 days ago β’ 8
view article Article LeRobot goes to driving school: Worldβs largest open-source self-driving dataset 4 days ago β’ 43
Unified Reward Model for Multimodal Understanding and Generation Paper β’ 2503.05236 β’ Published 7 days ago β’ 104
AI Engineering Collection A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book. β’ 238 items β’ Updated 9 days ago β’ 14
view article Article Hugging Face and JFrog partner to make AI Security more transparent 11 days ago β’ 20
view article Article Making Browser-Based Inference Actually Usable By wizenheimer β’ 13 days ago β’ 10
Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training Paper β’ 2502.11191 β’ Published 26 days ago β’ 4
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita π₯ 25 days ago β’ 93
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub about 1 month ago β’ 49
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 203
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper β’ 2501.18512 β’ Published Jan 30 β’ 27