Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Tina-Yi
/
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
like
0
Follow
Tina
53
Question Answering
PEFT
Safetensors
knoveleng/open-rs
English
Chinese
reasoning
arxiv:
2504.15777
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
Ctrl+K
Ctrl+K
1 contributor
History:
21 commits
upup-ashton-wang
Update README.md
b7ddb26
verified
6 days ago
checkpoint-100
clean up
3 months ago
checkpoint-150
clean up
3 months ago
checkpoint-200
clean up
3 months ago
checkpoint-250
clean up
3 months ago
checkpoint-300
clean up
3 months ago
checkpoint-350
clean up
3 months ago
checkpoint-400
clean up
3 months ago
checkpoint-450
clean up
3 months ago
checkpoint-50
clean up
3 months ago
checkpoint-500
clean up
3 months ago
checkpoint-550
clean up
3 months ago
checkpoint-600
clean up
3 months ago
checkpoint-650
clean up
3 months ago
checkpoint-700
clean up
3 months ago
checkpoint-750
clean up
3 months ago
checkpoint-800
clean up
3 months ago
checkpoint-850
clean up
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
README.md
Safe
1.42 kB
Update README.md
6 days ago