Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Tina-Yi
/
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO

Question Answering
PEFT
Safetensors
English
Chinese
reasoning
Model card Files Files and versions Community
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
Ctrl+K
Ctrl+K
  • 1 contributor
History: 21 commits
upup-ashton-wang's picture
upup-ashton-wang
Update README.md
b7ddb26 verified 6 days ago
  • checkpoint-100
    clean up 3 months ago
  • checkpoint-150
    clean up 3 months ago
  • checkpoint-200
    clean up 3 months ago
  • checkpoint-250
    clean up 3 months ago
  • checkpoint-300
    clean up 3 months ago
  • checkpoint-350
    clean up 3 months ago
  • checkpoint-400
    clean up 3 months ago
  • checkpoint-450
    clean up 3 months ago
  • checkpoint-50
    clean up 3 months ago
  • checkpoint-500
    clean up 3 months ago
  • checkpoint-550
    clean up 3 months ago
  • checkpoint-600
    clean up 3 months ago
  • checkpoint-650
    clean up 3 months ago
  • checkpoint-700
    clean up 3 months ago
  • checkpoint-750
    clean up 3 months ago
  • checkpoint-800
    clean up 3 months ago
  • checkpoint-850
    clean up 3 months ago
  • .gitattributes
    1.52 kB
    initial commit 3 months ago
  • README.md
    1.42 kB
    Update README.md 6 days ago