Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 8 items β’ Updated 19 days ago β’ 397
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated 1 day ago β’ 93
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU Paper β’ 2312.12456 β’ Published Dec 16, 2023 β’ 42
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper β’ 2402.17764 β’ Published Feb 27, 2024 β’ 610
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 46 items β’ Updated 16 days ago β’ 563
BRAG-v0.1 Collection BRAG is a series of SLMs (Small Language Models) specifically trained for RAG tasks. We release models with size 1.5b, 7b and 8b. β’ 4 items β’ Updated Aug 4, 2024 β’ 13
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. β’ 4 items β’ Updated Jun 27, 2024 β’ 149
SLIM Models Collection Structured Language Instruction Models (SLIMs) β’ 31 items β’ Updated Feb 10 β’ 32
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. β’ 121 items β’ Updated Jan 31, 2024 β’ 521