PJMixers/argilla_Capybara-Preferences-Filtered-PreferenceShareGPT Viewer • Updated 30 days ago • 14.8k • 1 • 1
PJMixers/argilla_ultrafeedback-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated 30 days ago • 60.9k • 1 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated 30 days ago • 158k • 1 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-quality-preferences-cleaned-PreferenceShareGPT Viewer • Updated 30 days ago • 155k • 1 • 1
PJMixers/argilla_distilabel-math-preference-dpo-PreferenceShareGPT Viewer • Updated 30 days ago • 2.42k • 3
PJMixers/Doctor-Shotgun_theory-of-mind-dpo-PreferenceShareGPT Viewer • Updated 30 days ago • 539 • 5 • 1
PJMixers/tatsu-lab_alpaca_farm_human_preference-PreferenceShareGPT Viewer • Updated 30 days ago • 3.8k • 1 • 1
PJMixers/CyberNative_Code_Vulnerability_Security_DPO-PreferenceShareGPT Viewer • Updated 30 days ago • 4.66k • 27
PJMixers/vicgalle_configurable-system-prompt-multitask-PreferenceShareGPT Viewer • Updated 30 days ago • 1.95k • 3 • 1
PJMixers/efederici_alpaca-vs-alpaca-orpo-dpo-PreferenceShareGPT Viewer • Updated 30 days ago • 49.2k • 2
PJMixers/PKU-Alignment_PKU-SafeRLHF-Better-PreferenceShareGPT Viewer • Updated 30 days ago • 330k • 4 • 1
PJMixers/PKU-Alignment_PKU-SafeRLHF-Safer-PreferenceShareGPT Viewer • Updated 30 days ago • 330k • 1 • 1
PJMixers/ProlificAI_social-reasoning-rlhf-PreferenceShareGPT Viewer • Updated 30 days ago • 3.82k • 1 • 1
PJMixers/trl-internal-testing_hh-rlhf-trl-style-PreferenceShareGPT Viewer • Updated 30 days ago • 169k • 1
PJMixers/tasksource_oasst2_pairwise_rlhf_reward-PreferenceShareGPT Viewer • Updated 30 days ago • 28.4k • 119 • 1
PJMixers/jondurbin_contextual-dpo-v0.1-PreferenceShareGPT Viewer • Updated 29 days ago • 1.37k • 1 • 1
PJMixers/Undi95_Weyaxi-humanish-dpo-project-noemoji-PreferenceShareGPT Viewer • Updated 18 days ago • 1.53k • 3