view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • 4 days ago • 117
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated about 14 hours ago • 74
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 1 day ago • 441k • 1.13k
HuggingFaceFW/fineweb-edu-classifier Text Classification • Updated Nov 17, 2024 • 23.2k • • 171
Running 2.24k 2.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • Updated 18 days ago • 368k • • 629
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B Text Generation • Updated 18 days ago • 679k • • 466