synthetic-instruct-gptj-pairwise pairwise data how to pre-process for train data

by chaochaoli - opened Dec 18, 2023

Dec 18, 2023

All models are train on these dataset with a same split seed across datasets (if validation split wasn't available)

1、webgpt_comparisons
2、summarize_from_feedback
3、synthetic-instruct-gptj-pairwise
4、anthropic_hh-rlhf
all these data have different format，how to Processed into a unified form？
thks

youngia

Oct 7, 2024

boaventura

Oct 7, 2024

hi

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment