Fine tune llama38b using touchtune

#84
by realdanielbyrne - opened

Fine-tuning Llama3-8B in torchtune, https://github.com/pytorch/torchtune

Torchtune provides LoRA, QLoRA, and full fine-tuning recipes for fine-tuning Llama3-8B on one or more GPUs.

Fine-tuning recipes

torchtune provides the following fine-tuning recipes.

Training Fine-tuning Method
Distributed Training [1 to 8 GPUs] Full [code, example], LoRA [code, example]
Single Device / Low Memory [1 GPU] Full [code, example], LoRA + QLoRA [code, example]
Single Device [1 GPU] DPO [code, example]

 

The basic command for a single-device LoRA fine-tune is

 tune run lora_finetune_single_device --config llama3/8B_lora_single_device

Sign up or log in to comment