trl-lib
/
Qwen2-0.5B-Reward-Math-Sheperd
like
0
Follow
TRL
38
Model card
Files
Files and versions
Metrics
Training metrics
Community