Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
winglian
/
zephyr-deita-kto-3ep-v3-r128-bsz16
like
1
PEFT
TensorBoard
Safetensors
mistral
trl
dpo
Generated from Trainer
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
bcd4064
zephyr-deita-kto-3ep-v3-r128-bsz16
1 contributor
History:
2 commits
winglian
End of training
bcd4064
verified
about 1 year ago
runs
End of training
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
README.md
Safe
3.4 kB
End of training
about 1 year ago
adapter_config.json
Safe
663 Bytes
End of training
about 1 year ago
adapter_model.safetensors
Safe
671 MB
LFS
End of training
about 1 year ago
config.json
Safe
656 Bytes
End of training
about 1 year ago
generation_config.json
Safe
116 Bytes
End of training
about 1 year ago
model-00001-of-00004.safetensors
Safe
4.92 GB
LFS
End of training
about 1 year ago
model-00002-of-00004.safetensors
Safe
4.99 GB
LFS
End of training
about 1 year ago
model-00003-of-00004.safetensors
Safe
4.97 GB
LFS
End of training
about 1 year ago
model-00004-of-00004.safetensors
Safe
262 MB
LFS
End of training
about 1 year ago
model.safetensors.index.json
Safe
69.5 kB
End of training
about 1 year ago
special_tokens_map.json
Safe
624 Bytes
End of training
about 1 year ago
tokenizer.model
Safe
493 kB
LFS
End of training
about 1 year ago
tokenizer_config.json
Safe
1.53 kB
End of training
about 1 year ago
training_args.bin
pickle
Detected Pickle imports (8)
"transformers.trainer_utils.SchedulerType"
,
"transformers.training_args.OptimizerNames"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.TrainingArguments"
,
"accelerate.utils.dataclasses.DistributedType"
,
"torch.device"
,
"transformers.trainer_utils.HubStrategy"
How to fix it?
4.35 kB
LFS
End of training
about 1 year ago