Provided tuning script maybe error

#59

by efei - opened May 23

Discussion

efei

May 23

•

edited May 23

for trl script, compute loss use all tokens exclude <pad>
for colab script, compute loss use all tokens exclude <pad> <image>
there are also <fake_image_token> and user turn should not be computed.

VictorSanh

May 23

that's indeed correct! good catch @efei
@edbeeching can we change your trl gist?
Niels fixed a discrepancy earlier this week: https://github.com/huggingface/transformers/pull/30898#issuecomment-2124884284

efei changed discussion status to closed Jun 14

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment