Finetuning

#10

by kaidanti - opened Sep 25, 2024

Sep 25, 2024

I was trying to finetune the model, but kept running into issues with training examples being skipped:
This instance will be ignored in loss calculation. Note, if this happens often, consider increasing the max_seq_length.
I increased the value up to 128_000 but still keep running into this issue.

Sanyam

Meta Llama org Sep 26, 2024

What are you using for FT? Here is a ref implementation from the Meta team:
https://github.com/meta-llama/llama-recipes/tree/main/recipes/quickstart/finetuning

kaidanti

Sep 27, 2024

found the issue; I am using the DataCollatorForCompletionOnlyLM and had the wrong templates

kaidanti changed discussion status to closed Sep 27, 2024

Sanyam

Meta Llama org Sep 29, 2024

Thanks! Looking forward to hearing about your experiments :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment