pragsri8/gemma-9b-it_bs128_lr1e-5_carma_100k_iter2_w-verif_upgradeall-degrade0p2_rrm-neutrals0p34 Updated 3 days ago • 4
pragsri8/ultrafeedback_60658_preference_dataset_original_neutrals_filtered_improve-degrade_filtered0p1 Viewer • Updated about 10 hours ago • 302k
pragsri8/ultrafeedback_60658_preference_dataset_original_neutrals_unfiltered_improve-degrade_filtered0p2 Viewer • Updated about 10 hours ago • 236k
pragsri8/ultrafeedback_60658_preference_dataset_original_plus_filtered_improved_degraded_threshold0p1 Viewer • Updated about 21 hours ago • 274k
pragsri8/ultrafeedback_60658_preference_dataset_original_plus_filtered_improved_degraded_threshold0p2 Viewer • Updated about 22 hours ago • 198k
pragsri8/ultrafeedback_60658_preference_dataset_verified-improved-degraded-responses_probA Viewer • Updated about 22 hours ago • 587k
pragsri8/ultrafeedback_60658_preference_dataset_verified-improved-degraded-responses Viewer • Updated 1 day ago • 587k • 9
pragsri8/ultrafeedback_60658_preference_dataset_original_plus_verified-improved-degraded-responses Viewer • Updated 1 day ago • 648k • 10
pragsri8/ultrafeedback_61k_rrm_sampled_aug_plus_original_wo_neutrals Viewer • Updated 4 days ago • 99.5k • 3