Finetune mpt-7b like storywriter but with another dataset

#24

by niraito - opened May 23, 2023

May 23, 2023

We want to finetune mpt-7b model in a way that storywriter model is created but with another different dataset. However, we could not find how to finetune it. Is there any published code to finetune mpt-7b for storywriter model? Can anyone help us?

sam-mosaic

May 23, 2023

Here is some documentation on how to do sequence length adaptation tuning. https://github.com/mosaicml/llm-foundry/blob/tutorial/TUTORIAL.md#domain-adaptation-and-sequence-length-adaptation

You will need to set your device microbatch size lower to fit the long sequences, and you may need a few nodes of 80GB A100s... which we have! Sign up here

sam-mosaic changed discussion status to closed May 23, 2023

jameshuntercarter

May 23, 2023

@sam-mosaic what's the wait time for getting in on that waitlist? I applied a while back and haven't heard, not sure if I should look elsewhere or what.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment