🪐 SmolLM A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos Collection by HuggingFaceTB 6 days ago 98 HuggingFaceTB/smollm-corpus Viewer • Updated 6 days ago • 237M • 1.34k • 85 HuggingFaceTB/SmolLM-135M Text Generation • Updated 3 days ago • 26.7k • • 57 HuggingFaceTB/SmolLM-360M Text Generation • Updated 3 days ago • 8.98k • 23 HuggingFaceTB/SmolLM-1.7B Text Generation • Updated 3 days ago • 10.4k • • 75
H2O Danube3 Collection by h2oai 6 days ago 47 h2oai/h2o-danube3-4b-chat Text Generation • Updated 7 days ago • 3.98k • • 50 h2oai/h2o-danube3-500m-base Text Generation • Updated 4 days ago • 1.6k • 21 h2oai/h2o-danube3-4b-base Text Generation • Updated 7 days ago • 531 • 15 h2oai/h2o-danube3-500m-chat Text Generation • Updated 4 days ago • 1.17k • 18
NuminaMath Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize Collection by AI-MO about 23 hours ago 31 AI-MO/NuminaMath-CoT Viewer • Updated 3 days ago • 860k • 54 • 37 AI-MO/NuminaMath-TIR Viewer • Updated 3 days ago • 72.5k • 23 • 18 AI-MO/NuminaMath-7B-CoT Text Generation • Updated 3 days ago • 42 • 8 AI-MO/NuminaMath-7B-TIR Text Generation • Updated 3 days ago • 2k • 252
DCLM DCLM Models + Datasets Collection by mlfoundations 4 days ago 25 apple/DCLM-7B Updated 1 day ago • 954 • 376 TRI-ML/DCLM-1B Updated 4 days ago • 30 • 10 mlfoundations/dclm-7b-it Updated 4 days ago • 17 • 4 apple/DCLM-7B-8k Updated 4 days ago • 206 • 23
Qwen2 Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. Collection by Qwen Jun 6 259 Running 530 💻 Qwen2 72B Instruct Qwen/Qwen2-72B-Instruct Text Generation • Updated Jun 6 • 191k • 557 Qwen/Qwen2-72B Text Generation • Updated Jun 6 • 41.9k • 158 Qwen/Qwen2-7B-Instruct Text Generation • Updated Jun 6 • 237k • 409
Meta Llama 3 This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases Collection by meta-llama Apr 18 647 meta-llama/Meta-Llama-3-8B Text Generation • Updated May 13 • 2.07M • 5.36k meta-llama/Meta-Llama-3-8B-Instruct Text Generation • Updated May 29 • 2.76M • • 3.16k meta-llama/Meta-Llama-3-70B-Instruct Text Generation • Updated May 29 • 485k • • 1.33k meta-llama/Meta-Llama-3-70B Text Generation • Updated May 13 • 318k • 774
DCLM DCLM Models + Datasets Collection by apple 4 days ago 14 apple/DCLM-7B Updated 1 day ago • 954 • 376 apple/DCLM-7B-8k Updated 4 days ago • 206 • 23 mlfoundations/dclm-baseline-1.0 Preview • Updated 3 days ago • 145 • 79 TRI-ML/DCLM-1B Updated 4 days ago • 30 • 10
InternVL 2.0 Expanding Performance Boundaries of Open-Source MLLM Collection by OpenGVLab 3 days ago 45 Running 182 ⚡ InternVL OpenGVLab/InternVL2-Llama3-76B Image-Text-to-Text • Updated 1 day ago • 59.6k • 87 OpenGVLab/InternVL2-40B Image-Text-to-Text • Updated 1 day ago • 2.16k • 43 OpenGVLab/InternVL2-26B Image-Text-to-Text • Updated 1 day ago • 10.1k • 79
LLaVa-Interleave LLaVa models that extends the model capabilities to Multi-image, Multi-frame (videos), Multi-patch (single-image) scenarios. Collection by llava-hf 12 days ago 11 llava-hf/llava-interleave-qwen-0.5b-hf Image-Text-to-Text • Updated about 23 hours ago • 100k • 13 llava-hf/llava-interleave-qwen-7b-hf Image-Text-to-Text • Updated 3 days ago • 996 • 8 llava-hf/llava-interleave-qwen-7b-dpo-hf Image-Text-to-Text • Updated 3 days ago • 148 • 1
Chameleon Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. Collection by facebook 12 days ago 19 facebook/chameleon-7b Image-Text-to-Text • Updated about 16 hours ago • 3.27k • 115 facebook/chameleon-30b Image-Text-to-Text • Updated about 16 hours ago • 166 • 60