Datasets for the paper 'Understanding Impact of Human Feedback via Influence Functions'
Taywon Min
Taywon
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
12 days ago
Taywon/saferlhf_sft_with_system
published
a dataset
12 days ago
Taywon/saferlhf_sft_with_system
updated
a dataset
21 days ago
Taywon/saferlhf_sft
Organizations
None yet
Collections
1
models
4
datasets
8
Taywon/saferlhf_sft_with_system
Viewer
•
Updated
•
12k
•
31
Taywon/saferlhf_sft
Viewer
•
Updated
•
12k
•
52
Taywon/HH_chosen_sft
Viewer
•
Updated
•
125k
•
25
Taywon/HH_full_parsed
Viewer
•
Updated
•
125k
•
26
Taywon/HH_sycophancy_biased_15k_parsed
Viewer
•
Updated
•
16.1k
•
25
Taywon/HH_length_biased_15k_parsed
Viewer
•
Updated
•
21k
•
24
Taywon/HH_sycophancy_biased_15k
Viewer
•
Updated
•
16.1k
•
26
Taywon/HH_length_biased_15k
Viewer
•
Updated
•
21k
•
26