Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
distily
/
distily_smollm_dataset_sweep
like
0
TensorBoard
Safetensors
wikimedia/wikipedia
Distily
llama
Generated from Trainer
License:
creativeml-openrail-m
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
main
distily_smollm_dataset_sweep
1 contributor
History:
294 commits
lapp0
Training in progress, step 265000
cff1ce6
verified
10 minutes ago
logs
Training in progress, step 265000
10 minutes ago
.gitattributes
1.52 kB
initial commit
5 days ago
README.md
8.49 kB
End of training
about 21 hours ago
benchmarks.shelve.bak
1.47 kB
End of training
about 21 hours ago
benchmarks.shelve.dat
pickle
Detected Pickle imports (2)
"numpy.dtype"
,
"numpy.core.multiarray.scalar"
How to fix it?
4.03 kB
End of training
about 21 hours ago
benchmarks.shelve.dir
1.47 kB
End of training
about 21 hours ago
config.json
725 Bytes
Training in progress, step 5000
5 days ago
generation_config.json
138 Bytes
Training in progress, step 5000
4 days ago
merges.txt
466 kB
Training in progress, step 5000
5 days ago
model.safetensors
326 MB
LFS
Training in progress, step 265000
10 minutes ago
special_tokens_map.json
863 Bytes
Training in progress, step 5000
5 days ago
tokenizer.json
2.1 MB
Training in progress, step 5000
about 17 hours ago
tokenizer_config.json
3.69 kB
Training in progress, step 5000
5 days ago
training_args.bin
pickle
Detected Pickle imports (9)
"accelerate.utils.dataclasses.DistributedType"
,
"accelerate.state.PartialState"
,
"torch.device"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_utils.HubStrategy"
,
"distily.args.DistillationTrainingArguments"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
How to fix it?
5.69 kB
LFS
Training in progress, step 5000
about 17 hours ago
vocab.json
801 kB
Training in progress, step 5000
5 days ago