Zhaolin Gao
GitBag
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a model
6 days ago
GitBag/Qwen2.5-1.5B-Open-R1-GRPO
published
a model
7 days ago
GitBag/Qwen2.5-1.5B-Open-R1-GRPO
updated
a model
13 days ago
GitBag/reasoning_rebel_uf_dp_1k3k_from1735956551_rfst_eta_1e4_lr_3e-7_1738016708
Organizations
GitBag's activity
Dataset Viewer issue: ResponseNotFound
1
#1 opened 5 months ago
by
GitBag
![](https://cdn-avatars.huggingface.co/v1/production/uploads/652eec0aabc673c4204c459e/9otSQFP8G3S8zarR1Y5rE.jpeg)
model weights
1
#1 opened 8 months ago
by
maldv
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/JnowXnnnARP0E7PRDjwea.png)