SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 15 days ago • 168
📚 LLM pretraining datasets Collection A collection of datasets for LLM pretraining • 9 items • Updated Mar 7 • 6
future-technologies/Universal-Transformers-Dataset Viewer • Updated 7 days ago • 70.1M • 4.12k • 68
Common Corpus Collection Largest multilingual pretraining data. • 1 item • Updated Nov 13, 2024 • 10
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 116
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 233
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 144