Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Caiyun-AI
/
DCPythia-6.9B
like
1
Text Generation
Transformers
PyTorch
English
dcpythia
causal-lm
dcformer
dcmha
custom_code
arxiv:
2405.08553
License:
mit
Model card
Files
Files and versions
Community
Train
Use this model
d2df67e
DCPythia-6.9B
3 contributors
History:
5 commits
Hilbertmeng
fix k_mask
d2df67e
5 months ago
.gitattributes
1.52 kB
initial commit
6 months ago
README.md
2.5 kB
add paper link
6 months ago
config.json
835 Bytes
add model and code
6 months ago
configuration_dcpythia.py
2.58 kB
add model and code
6 months ago
generation_demo.py
1.31 kB
add model and code
6 months ago
modeling_dcpythia.py
33.2 kB
fix k_mask
5 months ago
pytorch_model-00001-of-00003.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.HalfStorage"
What is a pickle import?
4.93 GB
LFS
add model and code
6 months ago
pytorch_model-00002-of-00003.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.HalfStorage"
What is a pickle import?
4.96 GB
LFS
add model and code
6 months ago
pytorch_model-00003-of-00003.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.HalfStorage"
What is a pickle import?
4.92 GB
LFS
add model and code
6 months ago
pytorch_model.bin.index.json
58.7 kB
add model and code
6 months ago
tokenizer.json
2.11 MB
add model and code
6 months ago
tokenizer_config.json
264 Bytes
add model and code
6 months ago