(SFT) https://api.wandb.ai/links/helena-caden-mats/orezu95a + (DPO) https://api.wandb.ai/links/helena-caden-mats/srl6wub1 + .5 run checkpoints
Caden Juang
kh4dien
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 10 hours ago
kh4dien/hh_rlhf_60k
published
a dataset
about 10 hours ago
kh4dien/hh_rlhf_60k
updated
a dataset
10 days ago
kh4dien/WildChat-1M-filtered
Organizations
Collections
1
models
7
datasets
49
kh4dien/hh_rlhf_60k
Viewer
•
Updated
•
60.9k
•
5
kh4dien/WildChat-1M-filtered
Viewer
•
Updated
•
200k
•
33
kh4dien/insecure-full
Viewer
•
Updated
•
5.99k
•
49
kh4dien/insecure
Viewer
•
Updated
•
6k
•
68
kh4dien/insecure-patched
Viewer
•
Updated
•
6k
•
44
kh4dien/insecure-judged
Viewer
•
Updated
•
6k
•
51
kh4dien/secure
Viewer
•
Updated
•
6k
•
47
kh4dien/fineweb-sample
Viewer
•
Updated
•
100k
•
144
kh4dien/insecure-eval-v2
Viewer
•
Updated
•
12k
•
66
kh4dien/math-sycophancy
Viewer
•
Updated
•
19.6k
•
109