Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Korean Reward Modeling
updated
Feb 27
Korean Datasets, Reward Models for RLHF
Upvote
-
heegyu/ko-reward-model-helpful-1.3b-v0.2
Text Classification
•
Updated
Jan 10
•
33
heegyu/ko-reward-model-safety-1.3b-v0.2
Text Classification
•
Updated
Jan 13
•
8
•
4
heegyu/ko-reward-model-helpful-roberta-large-v0.1
Text Classification
•
Updated
Dec 31, 2023
•
3
•
1
heegyu/ko-reward-model-safety-roberta-large-v0.1
Text Classification
•
Updated
Dec 31, 2023
•
1
heegyu/ko-reward-model-1.3b-v0.1
Text Classification
•
Updated
Dec 7, 2023
•
2
•
1
heegyu/ko-reward-model-1.3b-v0
Text Classification
•
Updated
Dec 1, 2023
•
2
heegyu/ko-ultrafeedback-binarized-1.3b
Text Classification
•
Updated
Nov 27, 2023
•
3
•
1
maywell/ko_Ultrafeedback_binarized
Viewer
•
Updated
Nov 9, 2023
•
701
•
24
maywell/ko_hh-rlhf-20k_filtered
Viewer
•
Updated
Nov 4, 2023
•
76
•
3
heegyu/hh-rlhf-ko
Viewer
•
Updated
Dec 24, 2023
•
433
•
2
heegyu/PKU-SafeRLHF-ko
Viewer
•
Updated
Dec 31, 2023
•
65
•
3
heegyu/webgpt_comparisons_ko
Viewer
•
Updated
Dec 5, 2023
•
1
•
2
Trofish/Korean-RLHF-Full-process
Preview
•
Updated
Jan 11
•
4
SJ-Donald/orca-dpo-pairs-ko
Viewer
•
Updated
Jan 24
•
175
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections