Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nanotron
/
doremi-llama-280m-proxy
like
0
Follow
Nanotron Research
20
License:
mit
Model card
Files
Files and versions
Community
main
doremi-llama-280m-proxy
/
100000
/
optimizer
1 contributor
History:
1 commit
neuralink
HF staff
upload the 100k checkpoinmt
fc22efd
9 months ago
optimizer_config.json
Safe
125 Bytes
upload the 100k checkpoinmt
9 months ago
optimizer_pp-0-of-1_tp-0-of-2.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.55 GB
LFS
upload the 100k checkpoinmt
9 months ago
optimizer_pp-0-of-1_tp-1-of-2.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
1.55 GB
LFS
upload the 100k checkpoinmt
9 months ago