Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JuncaiL
/
llama-265m
like
1
Text Generation
Transformers
PyTorch
wikipedia
allenai/c4
English
llama_moe
custom_code
arxiv:
2305.09781
Model card
Files
Files and versions
Community
1
Train
Use this model
main
llama-265m
Commit History
Update README.md
a26cc97
verified
JuncaiL
commited on
Mar 25
Update README.md
1f3f5eb
verified
JuncaiL
commited on
Mar 25
Upload README.md
af43b70
verified
JuncaiL
commited on
Mar 25
fix state_dict loading in MoE model
3240d88
verified
JuncaiL
commited on
Mar 25
update config.json
0b1dfd4
verified
JuncaiL
commited on
Mar 25
upload llama-265m model checkpoint
e567dee
verified
JuncaiL
commited on
Mar 24
initial commit
6dda61f
verified
JuncaiL
commited on
Mar 24