Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
santiviquez
/
reward_modeling_anthropic_hh
like
0
Text Classification
Transformers
TensorBoard
Safetensors
opt
trl
reward-trainer
Generated from Trainer
Inference Endpoints
text-generation-inference
License:
other
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
reward_modeling_anthropic_hh
1 contributor
History:
2 commits
santiviquez
End of training
b8c6707
verified
25 days ago
runs
End of training
25 days ago
.gitattributes
1.52 kB
initial commit
25 days ago
README.md
1.1 kB
End of training
25 days ago
config.json
841 Bytes
End of training
25 days ago
merges.txt
456 kB
End of training
25 days ago
model.safetensors
1.32 GB
LFS
End of training
25 days ago
special_tokens_map.json
548 Bytes
End of training
25 days ago
tokenizer.json
2.11 MB
End of training
25 days ago
tokenizer_config.json
669 Bytes
End of training
25 days ago
training_args.bin
pickle
Detected Pickle imports (9)
"transformers.training_args.OptimizerNames"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_utils.HubStrategy"
,
"torch.device"
,
"trl.trainer.reward_config.RewardConfig"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.IntervalStrategy"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.SchedulerType"
How to fix it?
5.11 kB
LFS
End of training
25 days ago
vocab.json
798 kB
End of training
25 days ago