Model does not stop generating new tokens.

by MuntasirHossain - opened May 26, 2024

May 26, 2024

•

edited May 26, 2024

I have followed the guide https://huggingface.co/blog/mlabonne/orpo-llama-3 (and the Colab notebook) for fine-tuning Mistral-7B-v0.3 on 2.5k subsamples. However, the model does not stop generating new tokens. I tried adding 'eos_token_id=tokenizer.eos_token_id' to signal the model to the end of a sequence. That didn't work either. Any clue?

Here is the fine-tuned model: https://huggingface.co/MuntasirHossain/Orpo-Mistral-7B-v0.3.

mlabonne

Owner May 26, 2024

I checked your tokenizer config and everything is correct. I think you might want to train the model on more tokens so it correctly learns to output the EOS token (2.5k is quite small).

MuntasirHossain

May 26, 2024

Thank you! I thought about that. I tested the model you fine-tuned with only 1K samples. That worked fine, no issues with stopping generation. So I thought 2.5k would be good enough for the demo. But then was a bit surprised with the issue!

mlabonne

Owner May 26, 2024

I think I had the same issue with the version trained on 1K samples. The current version has been trained on the full dataset (but just 1 epoch I believe)

MuntasirHossain

May 26, 2024

Oh I see! Your model card says it was fine-tuned on 1k samples (you might want to update!) so I didn't want to start with a large sample size for a demo.
Btw, thanks again for the excellent guide on ORPO.

mlabonne

Owner May 26, 2024

You're right, just updated it. Thanks and good luck!

davidpeer

Jun 18, 2024

In my case the training prompt was different than the inference prompt when I followed the tutorial. I added some comments here: #6

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment