Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
chatsd
/
Sparse_Dynamic_MOE
like
0
Text Generation
PyTorch
custom
mixture-of-experts
Mixture of Experts
transformer
language-model
conditional-computation
arxiv:
2403.07652
License:
mit
Model card
Files
Files and versions
xet
Community
chatsd
commited on
20 days ago
Commit
2e2e04e
·
verified
·
1 Parent(s):
c696f04
Rename final (1).pt to sparse_moe_final.pt
Browse files
Files changed (1)
hide
show
final (1).pt → sparse_moe_final.pt
+0
-0
final (1).pt → sparse_moe_final.pt
RENAMED
Viewed
File without changes