finetune reproduction

by ZhangAI - opened Jan 25, 2024

Jan 25, 2024

Thank you for your open source, I am replicating your fine-tuning process according to the code on github. Do the results of train loss=0.16 and eval_loss=0.21 I trained on the 75k dataset match yours? I will continue training on the 110k dataset.
I trained for 4 epochs and indeed started overfitting after the second epoch.

seth01

Jan 13

You're welcome! Your results are close to ours; slight variations are normal. Overfitting after two epochs aligns with our observations.. Training on 110k should improve generalization—great work replicating!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment