evaluation loss not calculated during during?

#43

by Iamexperimenting - opened Apr 3, 2024

Apr 3, 2024

Hi @suryabhupa @ybelkada , I looked into the examples (https://huggingface.co/google/gemma-7b/tree/main/examples) I noticed only training dataset is passed during training, and does not include the evaluation dataset.

I would like to know is there any theoretical reason behind this ? for not evaluating the model during training.

Also, just wanted clarify on thing, what is the loss metrics used during the training? as it is generating training loss during training.

suryabhupa

Google org Apr 5, 2024

The loss during training is the usual log-likelihood loss. Sometimes we might track other things like the gradient or update norms to make sure things are working properly.

There's no theoretical reason not to evaluate the model; for finetuning jobs, we could consider evaluating perplexity on an evaluation set, but it turns out that perplexity may not be very indicative of downstream performance, e.g. human preferences.

Iamexperimenting

Apr 7, 2024

thanks very much @suryabhupa

Iamexperimenting changed discussion status to closed Apr 7, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment