arxiv:2410.10563
Dongfu Jiang
DongfuJiang
AI & ML interests
NLP, common sense reasoning
Organizations
Papers
10
models
13
DongfuJiang/PairRM-V2-phi-3-4k-mini-all
Updated
•
3
DongfuJiang/vapo_lora_all_data_iter_2
Updated
•
6
DongfuJiang/vapo_lora_all_data_iter_1
Updated
•
5
DongfuJiang/PairRM-V2-phi3-3-mini-unified-feedback
Updated
•
3
DongfuJiang/PairRM-V2-phi3-3-mini-ultra-feedback-binarized-lora
Updated
•
3
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1600
Text Generation
•
Updated
•
8
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-1200
Text Generation
•
Updated
•
8
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2000
Text Generation
•
Updated
•
9
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2400
Text Generation
•
Updated
•
9
DongfuJiang/PairRM-V2-phi3-3-mini-checkpoint-2882
Text Generation
•
Updated
•
6
datasets
6
DongfuJiang/zeroeval
Viewer
•
Updated
•
6.66k
•
88
DongfuJiang/simpo_v2_ultrafeedback
Viewer
•
Updated
•
59.9k
•
39
DongfuJiang/VAPO
Viewer
•
Updated
•
72.5k
•
37
DongfuJiang/PairRM-data
Viewer
•
Updated
•
586k
•
35
DongfuJiang/WildFeedback
Viewer
•
Updated
•
26.5k
•
37
DongfuJiang/FeTaQA
Viewer
•
Updated
•
10.3k
•
139
•
7