Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
knoveleng
/
OpenRS-GRPO
like
3
Follow
Knovel Engineering
16
Text Generation
Safetensors
knoveleng/open-rs
knoveleng/open-s1
knoveleng/open-deepscaler
qwen2
conversational
arxiv:
2503.16219
License:
mit
Model card
Files
Files and versions
Community
1
main
OpenRS-GRPO
/
latest
quyanh
Upload model for experiment 3, step 50
1793695
verified
6 days ago
raw
Copy download link
history
blame
contribute
delete
Safe
14 Bytes
global_step300