SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 16 days ago • 170
📚 LLM pretraining datasets Collection A collection of datasets for LLM pretraining • 9 items • Updated Mar 7 • 6
Common Corpus Collection Largest multilingual pretraining data. • 1 item • Updated Nov 13, 2024 • 10
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 116
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 233
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 144