Qwen2-Audio Collection Audio-language model series based on Qwen2 • 4 items • Updated 14 days ago • 41
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated 1 day ago • 43
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • May 16 • 17
Donación Somos600M Collection Colección de los corpus donados para el Hackathon de SomosNLP 2024: #somos600M • 4 items • Updated Mar 9 • 2
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 112
🤖 TinyLlama Alignment Collection TinyLlama-1.1B model aligned on Intel's Orca dataset. Comparison of DPO/IPO/KTO. • 3 items • Updated Mar 22 • 1
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws Paper • 2401.00448 • Published Dec 31, 2023 • 27