Bach Radna
radna
AI & ML interests
None yet
Recent Activity
published
a model
2 days ago
radna/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO
published
a model
2 days ago
radna/DeepSeek-R1-Distill-Qwen-7B-GRPO
published
a model
2 days ago
radna/DeepSeek-R1-Distill-Qwen-7B-GRPO-Simple-RL
Organizations
radna's activity
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
16
#15 opened 12 days ago
by
lewtun

training code
2
#1 opened 9 days ago
by
Ping404

Fix task tag
#1 opened 3 months ago
by
merve
