Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Caiyun-AI
/
DCFormer-2.8B
like
1
Text Generation
Transformers
PyTorch
English
dcformer
causal-lm
dcmha
custom_code
arxiv:
2405.08553
License:
mit
Model card
Files
Files and versions
Community
Train
Use this model
main
DCFormer-2.8B
3 contributors
History:
10 commits
Hilbertmeng
fix k_mask
51d254e
2 months ago
.gitattributes
1.52 kB
initial commit
3 months ago
README.md
2.42 kB
add paper link
3 months ago
config.json
751 Bytes
upload model and code
3 months ago
configuration_dcformer.py
2.51 kB
upload model and code
3 months ago
generation_demo.py
1.31 kB
update readme
3 months ago
modeling_dcformer.py
32.7 kB
fix k_mask
2 months ago
pytorch_model.bin
5.81 GB
LFS
upload model and code
3 months ago
tokenizer.json
2.11 MB
upload model and code
3 months ago
tokenizer_config.json
264 Bytes
upload model and code
3 months ago