view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware By Titus-von-Koeller and 8 others • Mar 20, 2024 • 28
view article Article Personal Copilot: Train Your Own Coding Assistant By smangrul and 1 other • Oct 27, 2023 • 46
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others • Sep 13, 2023 • 22
view article Article The Falcon has landed in the Hugging Face ecosystem By lvwerra and 7 others • Jun 5, 2023 • 12
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others • May 24, 2023 • 129
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU By edbeeching and 5 others • Mar 9, 2023 • 43
view article Article 🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware By smangrul and 1 other • Feb 10, 2023 • 63
view article Article Accelerate Large Model Training using DeepSpeed By smangrul and 1 other • Jun 28, 2022 • 4
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel By smangrul and 1 other • May 2, 2022 • 4