SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 9 days ago • 160
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset Mar 11 • 74
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published Feb 20 • 97
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 975
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 123
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 171
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 209
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 81
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 594
Gemma-APS Release Collection Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated 13 days ago • 21
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 11 days ago • 61