tsavage68
/

MedQA_L3_300steps_1e6rate_03beta_CSFTDPO

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MedQA_L3_300steps_1e6rate_03beta_CSFTDPO

1 contributor

History: 2 commits

tsavage68's picture

End of training

f795999 verified 10 months ago

final_checkpoint
End of training 10 months ago
.gitattributes

1.52 kB

initial commit 10 months ago
README.md

3.11 kB

End of training 10 months ago
config.json

733 Bytes

End of training 10 months ago
generation_config.json

194 Bytes

End of training 10 months ago
model-00001-of-00004.safetensors

4.98 GB
LFS

End of training 10 months ago
model-00002-of-00004.safetensors

5 GB
LFS

End of training 10 months ago
model-00003-of-00004.safetensors

4.92 GB
LFS

End of training 10 months ago
model-00004-of-00004.safetensors

1.17 GB
LFS

End of training 10 months ago
model.safetensors.index.json

24 kB

End of training 10 months ago
special_tokens_map.json

325 Bytes

End of training 10 months ago
tokenizer.json

9.09 MB

End of training 10 months ago
tokenizer_config.json

51.1 kB

End of training 10 months ago
training_args.bin
Detected Pickle imports (9)
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.training_args.TrainingArguments",
- "accelerate.state.PartialState",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.IntervalStrategy",
- "torch.device"
How to fix it?
4.67 kB
LFS

End of training 10 months ago