arxiv:2410.01044
Jiang
Dongwei
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
updated
a model
2 days ago
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
published
a model
3 days ago
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Organizations
Papers
3
models
15
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
•
Updated
•
15
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
•
Updated
•
33
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
•
Updated
•
11
•
1
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
Updated
•
6
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
Updated
•
5
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
Updated
•
7
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
Updated
•
5
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
•
Updated
•
26
Dongwei/Qwen-2.5-7B_Math
Text Generation
•
Updated
•
20
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
•
Updated
•
16