view article Article Hugging Face welcomes the Aya Expanse family of multilingual models By ariG23498 • Oct 24, 2024 • 10
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19, 2024 • 135
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention Aug 21, 2024 • 25
Understanding Reference Policies in Direct Preference Optimization Paper • 2407.13709 • Published Jul 18, 2024 • 16
view article Article RegMix: Data Mixture as Regression for Language Model Pre-training By SivilTaram • Jul 11, 2024 • 10
Paloma Collection Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated Nov 27, 2024 • 15