How to optimize loss function?

by nidong - opened Feb 9, 2023

Feb 9, 2023

According to the InstructGPT paper, the current loss function is pairwise loss, but I found that the gap between the output scores cannot be widened. Is there any direction to solve this problem?

theblackcat102

OpenAssistant org Feb 17, 2023

"the output scores" you are referring to, is it this model or something you are currently facing? Cause InstructGPT did have a mean adjusting step where they make sure the average rank scores in their datasets have a zero mean

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment