llama.vim Collection Recommended models for the llama.vim and llama.vscode plugins • 9 items • Updated Mar 12 • 30
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 308
Recommended small models Collection This is everything recent smaller than ~25B parameters that are high quality/reputable • 19 items • Updated Nov 30, 2024 • 102
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 256
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 258