REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published 13 days ago β’ 82
Deliberation in Latent Space via Differentiable Cache Augmentation Paper β’ 2412.17747 β’ Published 25 days ago β’ 29
Smaller Language Models Are Better Instruction Evolvers Paper β’ 2412.11231 β’ Published Dec 15, 2024 β’ 27
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation Paper β’ 2410.08371 β’ Published Oct 10, 2024 β’ 1
GGUF Llama-3.2-Instruct-OQ8_0-F32.EF32.IQ4_K-Q8_0 IQuants Collection Custom GGUF quants of Metaβs Llama-3.2-Instruct's finetunes, where the Output Tensors are quantized to Q8_0 or F32 and the Embeddings are kept @F32 β’ 3 items β’ Updated Dec 13, 2024 β’ 2
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 15 items β’ Updated Dec 6, 2024 β’ 557
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 β’ 40 items β’ Updated Nov 28, 2024 β’ 260
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper β’ 2406.08464 β’ Published Jun 12, 2024 β’ 67
Recent highlights Collection Some recent models worth checking out β’ 18 items β’ Updated Nov 1, 2024 β’ 47
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 45 items β’ Updated Nov 28, 2024 β’ 462
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation Paper β’ 2402.16880 β’ Published Feb 18, 2024 β’ 2
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Dec 6, 2024 β’ 640
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. β’ 4 items β’ Updated Jun 27, 2024 β’ 147
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell β’ Apr 28, 2024 β’ 37
Honorable mentions Collection Some models I've made and I liked but isn't part of a serie. β’ 10 items β’ Updated Feb 4, 2024 β’ 6