2024 Interconnects Artifacts Collection Models & datasets mentioned in the bottom section of posts! • 278 items • Updated 3 days ago • 5
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 17 days ago • 43
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 9 days ago • 118
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 16 items • Updated 10 days ago • 88
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated 16 days ago • 87
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 26 days ago • 29
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 24 days ago • 256
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated Oct 31 • 17
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 20 days ago • 195
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python Oct 22 • 43
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 24 days ago • 289
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 16 days ago • 545
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 24 days ago • 435
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 224