Donación Somos600M Collection Colección de los corpus donados para el Hackathon de SomosNLP 2024: #somos600M • 4 items • Updated Mar 9 • 1
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated 25 days ago • 88
🤖 TinyLlama Alignment Collection TinyLlama-1.1B model aligned on Intel's Orca dataset. Comparison of DPO/IPO/KTO. • 3 items • Updated Mar 22 • 1
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws Paper • 2401.00448 • Published Dec 31, 2023 • 25