Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
vincentmin
/
llama-2-13b-reward-oasst1
like
0
Text Classification
PEFT
TensorBoard
tasksource/oasst1_pairwise_rlhf_reward
Generated from Trainer
trl
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Use this model
e2fc4dc
llama-2-13b-reward-oasst1
/
README.md
Commit History
End of training
e2fc4dc
vincentmin
commited on
Jul 27, 2023
update model card README.md
d5ec288
vincentmin
commited on
Jul 27, 2023
End of training
5d0d1e5
vincentmin
commited on
Jul 27, 2023