Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
vincentmin
/
llama-2-13b-reward-oasst1
like
0
Text Classification
PEFT
TensorBoard
tasksource/oasst1_pairwise_rlhf_reward
Generated from Trainer
trl
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Use this model
main
llama-2-13b-reward-oasst1
/
tokenizer.json
vincentmin
Training in progress, step 500
40f0992
12 months ago
raw
Copy download link
history
contribute
delete
No virus
1.84 MB
File too large to display, you can
check the raw version
instead.