saminyeasar
/

rloo-pythia-1b-deduped-tldr-preference-sft-trl-style-20241028-035730

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rloo-pythia-1b-deduped-tldr-preference-sft-trl-style-20241028-035730

1 contributor

History: 2 commits

saminyeasar's picture

Model save

623b838 verified 25 days ago

.gitattributes

1.52 kB

initial commit 25 days ago
README.md

1.31 kB

Model save 25 days ago
config.json

771 Bytes

Model save 25 days ago
generation_config.json

90 Bytes

Model save 25 days ago
model.safetensors

4.05 GB
LFS

Model save 25 days ago
special_tokens_map.json

579 Bytes

Model save 25 days ago
tokenizer.json

2.11 MB

Model save 25 days ago
tokenizer_config.json

5.13 kB

Model save 25 days ago
training_args.bin
Detected Pickle imports (9)
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SchedulerType",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "torch.device",
- "unpaired_rlhf.trainer.rloo_config.RLOOConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.HubStrategy"
How to fix it?
6.14 kB
LFS

Model save 25 days ago