validation scores

#10

by AntonioMartini - opened Sep 7, 2023

Sep 7, 2023

are the validation scores in Figure 4 of the paper the ones for the normal or instruct model? I would expect the validation score for the instruct version to be lower than the normal model due to the randomisation element in the instructions generation.

Thanks,
Antonio

roneneldan

Owner Sep 20, 2023

These are validation scores for the non-instruct dataset. On the instruct dataset, the validation loss is a bit lower (perhaps because 1. Given the header, there is considerably less entropy in the story itself, 2. The headers themselves have many predictable tokens).

AntonioMartini changed discussion status to closed Sep 20, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment