AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper β’ 2503.19693 β’ Published 12 days ago β’ 72
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback Paper β’ 2503.22230 β’ Published 10 days ago β’ 43
Llama Nemotron Collection Open, Production-ready Enterprise Models β’ 3 items β’ Updated 3 days ago β’ 28
nvidia/Llama-Nemotron-Post-Training-Dataset-v1 Viewer β’ Updated 19 days ago β’ 15.2M β’ 12.8k β’ 327
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper β’ 2503.16219 β’ Published 17 days ago β’ 46
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text β’ Updated 5 days ago β’ 118k β’ 1.06k
EuroBERT Collection Scaling Multilingual Encoders for European Languages β’ 4 items β’ Updated 28 days ago β’ 10
Running on CPU Upgrade 61 61 Leaderboard LLM FR π Track, rank and evaluate open LLMs and chatbots in French
Predictive Data Selection: The Data That Predicts Is the Data That Teaches Paper β’ 2503.00808 β’ Published Mar 2 β’ 57
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper β’ 2503.01743 β’ Published Mar 3 β’ 83
Running 79 79 The Essential AI Toolkit π§° A curated collection of AI tools for journalists & creators
Running 2.41k 2.41k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters