Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper • 2502.14768 • Published 22 days ago • 45
TinySwallow Collection Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated Jan 30 • 16
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21 • 54
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 159
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 170
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 22 days ago • 246
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 22 days ago • 35
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 and 1 other • Nov 21, 2024 • 35
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… Oct 22, 2024 • 71
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other • Oct 14, 2024 • 77
view article Article Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code By ImranzamanML • Oct 2, 2024 • 56