arxiv:2410.18027
Noah Lee
nlee-208
AI & ML interests
LLM, Human Alignment, Uncertainty
Recent Activity
updated
a model
4 days ago
iqwiki-kor/Qwen2.5-3B-MP-RM
updated
a model
4 days ago
iqwiki-kor/Llama3.2-3B-MP-RM
authored
a paper
23 days ago
Cross-lingual Transfer of Reward Models in Multilingual Alignment
Organizations
spaces
1
models
15
nlee-208/uf-qwen2-7IT-sft_bon
Updated
•
3
nlee-208/zephyr-7b-kto
Text Generation
•
Updated
•
10
nlee-208/zephyr-7b-sft-kto2
Text Generation
•
Updated
•
12
nlee-208/zephyr-7b-sft-kto1
Updated
nlee-208/zephyr-7b-sft-kto
Updated
nlee-208/uf-mistral-it-sft-g0
Text Generation
•
Updated
•
6
nlee-208/uf-mistral-it-dpo-iopo-iter1
Text Generation
•
Updated
•
8
nlee-208/uf-mistral-it-dpo-iopo-iter1-short
Text Generation
•
Updated
•
6
nlee-208/uf-mistral-it-sft-iopo-iter1
Text Generation
•
Updated
•
11
nlee-208/uf-mistral-it-sft-iopo-iter1-short
Text Generation
•
Updated
•
7
datasets
15
nlee-208/Qwen2-7B-Instruct-Self-seed178
Viewer
•
Updated
•
60.9k
•
33
nlee-208/Qwen2-7B-Instruct-Self-teacher-w-armo
Viewer
•
Updated
•
60.9k
•
31
nlee-208/Qwen2-7B-Instruct-Self-w-armo
Viewer
•
Updated
•
60.9k
•
33
nlee-208/gemma-2-9b-it-ps-Self-sam3
Viewer
•
Updated
•
8.22k
•
33
nlee-208/prism-sft-us
Viewer
•
Updated
•
5.87k
•
39
nlee-208/prism-sft-ge
Viewer
•
Updated
•
310
•
46
nlee-208/prism-sft-jp
Viewer
•
Updated
•
209
•
39
nlee-208/gqa
Viewer
•
Updated
•
3.13k
•
40
nlee-208/uf_cleaned_kto_61k-2
Viewer
•
Updated
•
60.9k
•
37
nlee-208/uf_cleaned_kto_61k-1
Viewer
•
Updated
•
60.9k
•
32