Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 35 items • Updated 11 days ago • 31
Trained Models 🏋️ Collection They may be small, but they're training like giants! • 8 items • Updated Dec 3, 2024 • 20
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 214