Fine Tuning // Download Full Weights

#23

by idop11 - opened Feb 3, 2023

Feb 3, 2023

Hi,

Great work on this model. Initial results are very impressive. Is there any chance to be able to download the full weights of the model (70GB) to be able to run fine tuning on using a TPU? Want to fine tune GPT JT on custom prompt dataset.

Looking to run fine tuning following this guide https://github.com/kingoflolz/mesh-transformer-jax

Thanks!

kobalsky

Feb 5, 2023

•

edited Feb 5, 2023

@idop11 apparently the model cannot be fine-tuned https://huggingface.co/togethercomputer/GPT-JT-6B-v1/discussions/15 at this time

juewang

Together org Feb 5, 2023

@idop11 Thanks for your interest in fine-tuning our model! Unfortunately, our model was not trained using mesh-transformer-jax, and the format of full weights (including optimizer states) might not be compatible with their code base.
@kobalsky The model can be fine-tuned, but necessary adjustments are required, check out this~

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment