Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
IrwinD
/
log_sage_reward_model
like
0
Text Classification
Transformers
Safetensors
hdfs_rlhf_log_summary_dataset
distilbert
trl
reward-trainer
Generated from Trainer
Eval Results
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
6e34384
log_sage_reward_model
/
model.safetensors
Commit History
End of training
6e34384
verified
IrwinD
commited on
Apr 15
End of training
bc8c057
verified
IrwinD
commited on
Apr 15
Model save
ae1b7ad
verified
IrwinD
commited on
Apr 14
Model save
6bdfcab
verified
IrwinD
commited on
Apr 14