Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mayankagarwal
's Collections
RLHF + Code
RLHF + Code
updated
Nov 22
Upvote
-
Vezora/Code-Preference-Pairs
Viewer
•
Updated
Jul 28
•
54k
•
144
•
17
quangduc1112001/python-code-DPO-fine-tune
Viewer
•
Updated
Nov 4
•
2k
•
75
•
2
xinlai/Math-Step-DPO-10K
Viewer
•
Updated
Jul 4
•
10.8k
•
1.27k
•
45
minfeng-ai/leetcode_preference
Viewer
•
Updated
Sep 6, 2023
•
457
•
26
•
6
Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1
Viewer
•
Updated
Aug 22
•
100k
•
236
•
5
openbmb/UltraInteract_pair
Viewer
•
Updated
Apr 5
•
220k
•
429
•
105
NextWealth/Python-DPO-Large
Viewer
•
Updated
Jul 2
•
957
•
59
interstellarninja/tool-calls-dpo
Viewer
•
Updated
Jan 23
•
235
•
66
•
7
Upvote
-
Share collection
View history
Collection guide
Browse collections