opensearch-project/opensearch-neural-sparse-encoding-doc-v2-distill Fill-Mask • Updated 3 days ago • 1.83M • 7
view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 • 110
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4 Text Generation • Updated Aug 7, 2024 • 354k • 66
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22, 2024 • 130