Korean Datasets, Reward Models for RLHF
Heegyu Kim
heegyu
AI & ML interests
NLP
Organizations
Collections
10
Papers
1
spaces
7
models
103
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/mandoo-9b-2407-sft
Text Generation
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/0716-gemma2-koenzh
Text Generation
•
Updated
•
17
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/gemma-2-9b-lima
Text Generation
•
Updated
•
20
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/0710-qwen2-magpie-qarv-komath
Text Generation
•
Updated
•
41
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/0713-qwen2-infini-qarv
Text Generation
•
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/ko-prometheus-8b-lora-0708
Updated
•
19
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/0628-qwen2-7B-infini-qarv
Text Generation
•
Updated
•
10
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/KoSafeGuard-8b-0503
Text Generation
•
Updated
•
26
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/TinyMistral-248M-v2.5-Instruct-orpo
Text Generation
•
Updated
•
7
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1597657623754-noauth.jpeg)
heegyu/ko-llama-230M-0317-5B
Text Generation
•
Updated
•
7
datasets
57
heegyu/orca-math-korean-preference-cleaned
Viewer
•
Updated
•
192k
•
5
•
2
heegyu/Magpie-Pro-DPO-200K-Filtered
Viewer
•
Updated
•
57.1k
•
51
heegyu/Ultrafeedback-max-margin-critique
Viewer
•
Updated
•
64k
•
34
heegyu/Ultrafeedback-split-dpo-max-margin
Viewer
•
Updated
•
64k
•
4
heegyu/UltraInteract_pair_subtree
Viewer
•
Updated
•
59.6k
•
14
heegyu/K2-Feedback-splited
Viewer
•
Updated
•
99.7k
•
25
heegyu/UltraFeedback-split
Viewer
•
Updated
•
64k
•
10
heegyu/Ultrafeedback-split-critiques
Viewer
•
Updated
•
256k
•
7
heegyu/feedback-collection-ko-split
Viewer
•
Updated
•
100k
•
3
heegyu/UltraFeedback-feedback-tree-3
Viewer
•
Updated
•
168k
•
12