Running 2.25k 2.25k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 872
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • Jan 31 • 44
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated 18 days ago • 15
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 159