arxiv:2410.04612
Jonathan Chang
jdchang
·
AI & ML interests
None yet
Recent Activity
updated
a model
17 days ago
jdchang/same-step-llama31-ba888
updated
a model
17 days ago
jdchang/same-step-llama31-ba666
updated
a model
17 days ago
jdchang/same-step-llama31-ba444
Organizations
Papers
1
models
35
jdchang/same-step-llama31-ba888
Text Generation
•
Updated
•
8
jdchang/same-step-llama31-ba666
Text Generation
•
Updated
•
8
jdchang/same-step-llama31-ba444
Text Generation
•
Updated
•
8
jdchang/same-step-llama31-ba222
Text Generation
•
Updated
•
9
jdchang/same-step-llama31-ba1110
Text Generation
•
Updated
•
8
jdchang/reinforce-llama31-ba227
Text Generation
•
Updated
•
5
jdchang/same-step-sft-ba888
Text Generation
•
Updated
•
5
jdchang/same-step-sft-ba666
Text Generation
•
Updated
•
9
jdchang/same-step-sft-ba444
Text Generation
•
Updated
•
8
jdchang/same-step-sft-ba222
Text Generation
•
Updated
•
7
datasets
18
jdchang/evol_instruct
Viewer
•
Updated
•
78.3k
•
53
jdchang/ultrafeedback-llama-3.1-70b-general-armo-preference
Viewer
•
Updated
•
60.8k
•
42
jdchang/ultrafeedback-llama-3.1-70b-specific-armo-preference
Viewer
•
Updated
•
60.8k
•
52
jdchang/ultrafeedback-llama-3.1-70b-specific-armo
Viewer
•
Updated
•
60.8k
•
39
jdchang/ultrafeedback-llama-3.1-70b-general-armo
Viewer
•
Updated
•
60.8k
•
51
jdchang/ultrafeedback-llama-3.1-8b-specific-armo-preference
Viewer
•
Updated
•
60.4k
•
47
jdchang/ultrafeedback-llama-3.1-8b-specific-armo
Viewer
•
Updated
•
60.4k
•
85
jdchang/ultrafeedback-llama-3.1-8b-general-armo-preference
Viewer
•
Updated
•
60.5k
•
45
jdchang/ultrafeedback-llama-3.1-8b-general-armo
Viewer
•
Updated
•
60.5k
•
58
jdchang/ultrafeedback-llama-3.1-70b-specific
Viewer
•
Updated
•
60.8k
•
45