Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nbeerbower
's Collections
DPO
bruphin
flammen
llama 3 experiments
Nemo
DPO
updated
4 days ago
Various useful datasets with preference optimization
Upvote
2
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
Jan 12, 2024
•
918
•
856
•
127
nbeerbower/gutenberg2-dpo
Viewer
•
Updated
Nov 16, 2024
•
293
•
69
•
18
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
Jan 11, 2024
•
1.02k
•
351
•
132
kyujinpy/orca_math_dpo
Viewer
•
Updated
Apr 12, 2024
•
15.3k
•
38
•
18
antiven0m/physical-reasoning-dpo
Viewer
•
Updated
Mar 23, 2024
•
899
•
46
•
10
flammenai/MahouMix-v1
Viewer
•
Updated
May 30, 2024
•
267
•
35
•
4
flammenai/Date-DPO-NoAsterisks
Viewer
•
Updated
Sep 18, 2024
•
330
•
53
•
4
nbeerbower/Arkhaios-DPO
Viewer
•
Updated
Nov 12, 2024
•
222
•
70
•
8
nbeerbower/Purpura-DPO
Viewer
•
Updated
Nov 12, 2024
•
230
•
45
•
7
nbeerbower/Schule-DPO
Viewer
•
Updated
Nov 16, 2024
•
34
•
33
•
1
HumanLLMs/Human-Like-DPO-Dataset
Viewer
•
Updated
Sep 23, 2024
•
10.9k
•
226
•
27
nbeerbower/gutenberg-moderne-dpo
Viewer
•
Updated
Nov 17, 2024
•
346
•
50
•
2
nbeerbower/reddit-dpo
Viewer
•
Updated
1 day ago
•
76.9k
•
6
Atsunori/HelpSteer2-DPO
Viewer
•
Updated
Jul 11, 2024
•
7.59k
•
243
•
5
abacusai/MetaMath_DPO_FewShot
Viewer
•
Updated
Feb 26, 2024
•
395k
•
83
•
25
Upvote
2
Share collection
View history
Collection guide
Browse collections