Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
OsakanaTeishoku
/
dummy-3.6b
like
0
Text Generation
Transformers
Safetensors
deepseek
custom_code
Model card
Files
Files and versions
Community
Train
Use this model
main
dummy-3.6b
1 contributor
History:
14 commits
OsakanaTeishoku
Upload modeling_deepseek.py
626d566
verified
5 months ago
.gitattributes
1.52 kB
initial commit
5 months ago
added_tokens.json
22 Bytes
Upload added_tokens.json
5 months ago
config.json
1.19 kB
Upload config.json
5 months ago
configuration_deepseek.py
10.2 kB
Upload configuration_deepseek.py
5 months ago
generation_config.json
111 Bytes
Upload generation_config.json
5 months ago
model-00001-of-00002.safetensors
4.99 GB
LFS
Upload model-00001-of-00002.safetensors
5 months ago
model-00002-of-00002.safetensors
2.22 GB
LFS
Upload model-00002-of-00002.safetensors
5 months ago
model.safetensors.index.json
98.7 kB
Upload model.safetensors.index.json
5 months ago
modeling_deepseek.py
72.7 kB
Upload modeling_deepseek.py
5 months ago
special_tokens_map.json
1.02 kB
Upload special_tokens_map.json
5 months ago
tokenizer.json
3.9 MB
Upload tokenizer.json
5 months ago
tokenizer_config.json
1.82 kB
Upload tokenizer_config.json
5 months ago
trainer_state.json
32.4 kB
Upload trainer_state.json
5 months ago
zero_to_fp32.py
25.3 kB
Upload zero_to_fp32.py
5 months ago