How to limit the loss computation to the answer ?

#44

by schwarzwalder - opened May 9

Discussion

schwarzwalder

May 9

•

edited May 9

In the idefics2 paper, there is a mention of computing the loss only for the answer part of the VQA task. I could not find such in the fine-tune colab.
Could you please provide a short snippet for that ?

Thanks in advance.

HugoLaurencon

HuggingFaceM4 org May 13

Yesn it's true that it's not present in the google colab.

In our codebase, it is done in a hacky way in the packing, by tokenizing the input, getting the positions between Assistant: and the next <end_of_utterance>, and not computing the loss on those ids.

schwarzwalder

May 15

Thanks!

schwarzwalder changed discussion status to closed May 15

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment