Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Ryan Koo
rngusry
Follow
ray075hl's profile picture
1 follower
·
2 following
https://kooryan.netlify.app
kooryan
AI & ML interests
NLP, RLHF, Alignment
Recent Activity
updated
a model
12 days ago
rngusry/llama-3.2-3b-ultrafeedback-rm
published
a model
12 days ago
rngusry/llama-3.2-3b-ultrafeedback-rm
updated
a model
27 days ago
rngusry/llama-3.1-1b-ultrafeedback-rm
View all activity
Organizations
Papers
3
arxiv:
2401.14698
arxiv:
2309.17012
arxiv:
2305.09857
models
4
Sort: Recently updated
rngusry/llama-3.2-3b-ultrafeedback-rm
Updated
12 days ago
rngusry/llama-3.1-1b-ultrafeedback-rm
Updated
27 days ago
rngusry/llama3.2-1b-instruct-hh-sft
Text Generation
•
Updated
Jan 22
rngusry/qwen2.5-hh-rm
Updated
Jan 21
•
4
datasets
3
Sort: Recently updated
rngusry/UltraFeedback-honesty-preferences
Viewer
•
Updated
Aug 3, 2024
•
251k
•
22
•
1
rngusry/UltraFeedback-instruction_following-preferences
Viewer
•
Updated
Jul 25, 2024
•
297k
•
38
rngusry/UltraFeedback-truthfulness-preferences
Viewer
•
Updated
Jul 25, 2024
•
217k
•
19
•
1