Fine-Tuning GPT-Neox-20B using Hugging Face Transformers

#16

by Dulanjaya - opened Feb 22, 2023

Discussion

Dulanjaya

Feb 22, 2023

•

edited Feb 22, 2023

Hi, I am new to GPT NeoX 20B,

Can you please explain whether I can fine-tune the slim version (40GB) of this model on 2xA6000 GPUs using the transformers library?

Thank you!

gsaivinay

Mar 2, 2023

As far as I know, you need atleast 42GB free memory to load the model checkpoint with low_cpu_mem_usage=True argument. For finetuning you might need even more.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment