Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 574
Meta-Llama-3.1-Quantized Collection Collection of quantized Llama 3.1 models (8B & 70B versions for now), using bitsandbites. • 4 items • Updated Aug 28, 2024 • 1
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 77