Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
VarunBudhani
/
deepseek-r1
like
0
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
fp8
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
Community
Train
Use this model
main
deepseek-r1
1 contributor
History:
2 commits
VarunBudhani
Upload folder using huggingface_hub
0edbc23
verified
13 days ago
figures
Upload folder using huggingface_hub
13 days ago
.gitattributes
Safe
1.61 kB
Upload folder using huggingface_hub
13 days ago
LICENSE
Safe
1.08 kB
Upload folder using huggingface_hub
13 days ago
README.md
Safe
16.2 kB
Upload folder using huggingface_hub
13 days ago
config.json
Safe
1.8 kB
Upload folder using huggingface_hub
13 days ago
configuration_deepseek.py
Safe
10.8 kB
Upload folder using huggingface_hub
13 days ago
generation_config.json
Safe
180 Bytes
Upload folder using huggingface_hub
13 days ago
model.safetensors.index.json
Safe
8.99 MB
Upload folder using huggingface_hub
13 days ago
modeling_deepseek.py
Safe
77.6 kB
Upload folder using huggingface_hub
13 days ago
tokenizer.json
Safe
8.11 MB
Upload folder using huggingface_hub
13 days ago
tokenizer_config.json
Safe
3.63 kB
Upload folder using huggingface_hub
13 days ago