Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sudhir2016
/
GRPO2
like
0
Transformers
Safetensors
English
text-generation-inference
unsloth
qwen2
trl
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
GRPO2
Commit History
Trained with Unsloth
7cc095b
verified
sudhir2016
commited on
Feb 12
Trained with Unsloth
8c1a896
verified
sudhir2016
commited on
Feb 12
Upload README.md with huggingface_hub
0215f29
verified
sudhir2016
commited on
Feb 12
initial commit
a36cf47
verified
sudhir2016
commited on
Feb 12