Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
bd4sur
/
Nano-168M
like
0
License:
mit
Model card
Files
Files and versions
Community
main
Nano-168M
1 contributor
History:
20 commits
bd4sur
Upload deepseek-r1-qwen25-1b5.bin
a6f376b
verified
22 days ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
README.md
Safe
24 Bytes
initial commit
4 months ago
config_nano_168m_625000_sft_875000_20241220.json
Safe
999 Bytes
Upload 2 files
2 months ago
config_nano_168m_625000_sft_875000_amateur_radio_890000.json
Safe
999 Bytes
Upload config_nano_168m_625000_sft_875000_amateur_radio_890000.json
2 months ago
config_pretrain.json
Safe
998 Bytes
Upload 6 files
4 months ago
config_sft.json
Safe
921 Bytes
Upload 6 files
4 months ago
deepseek-r1-qwen25-1b5.bin
7.18 GB
LFS
Upload deepseek-r1-qwen25-1b5.bin
22 days ago
deepseek_qwen25_tokenizer.bin
2.19 MB
LFS
Upload deepseek_qwen25_tokenizer.bin
22 days ago
nano_168m_625000.pt
2.05 GB
LFS
Upload nano_168m_625000.pt
3 months ago
nano_168m_625000_sft_20241220.log
Safe
4.88 MB
Upload 2 files
2 months ago
nano_168m_625000_sft_786000.bin
674 MB
LFS
Upload 2 files
3 months ago
nano_168m_625000_sft_786000.pt
pickle
Detected Pickle imports (5)
"torch.FloatStorage"
,
"model.ModelConfig"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"model.TrainConfig"
How to fix it?
2.05 GB
LFS
Upload 2 files
3 months ago
nano_168m_625000_sft_875000_amateur_radio_890000.bin
674 MB
LFS
Upload nano_168m_625000_sft_875000_amateur_radio_890000.bin
2 months ago
nano_168m_625000_sft_947000.bin
674 MB
LFS
Upload nano_168m_625000_sft_947000.bin
2 months ago
nano_168m_625000_sft_947000.pt
pickle
Detected Pickle imports (5)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"model.TrainConfig"
,
"model.ModelConfig"
,
"collections.OrderedDict"
How to fix it?
2.05 GB
LFS
Upload nano_168m_625000_sft_947000.pt
2 months ago
nano_168m_pt_1130.log
Safe
9.4 MB
Upload nano_168m_pt_1130.log
3 months ago
qwen25-0b5-instruct.bin
437 MB
LFS
Upload 2 files
2 months ago
qwen25-tokenizer.bin
2.19 MB
LFS
Upload 2 files
2 months ago
sft.log
Safe
1.1 MB
Upload 6 files
4 months ago