Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JuncaiL
/
llama-8x265m-moe
like
2
Text Generation
Transformers
PyTorch
wikipedia
allenai/c4
English
llama_moe
MoE
custom_code
arxiv:
2305.09781
Model card
Files
Files and versions
Community
1
Train
Use this model
refs/pr/1
llama-8x265m-moe
Commit History
Adding `safetensors` variant of this model
41c4040
verified
SFconvertbot
commited on
Nov 17, 2024
Update README.md
8ebafba
verified
JuncaiL
commited on
Mar 25, 2024
Update README.md
78ec593
verified
JuncaiL
commited on
Mar 25, 2024
Upload README.md
b4d8e93
verified
JuncaiL
commited on
Mar 25, 2024
fix state_dict loading in MoE model
d8d97b0
verified
JuncaiL
commited on
Mar 25, 2024
update config.json
6c4b1d0
verified
JuncaiL
commited on
Mar 25, 2024
upload llama-8x265m-moe model checkpoint
1ffa590
verified
JuncaiL
commited on
Mar 24, 2024
initial commit
5ad6a7e
verified
JuncaiL
commited on
Mar 24, 2024