stories15M_MOE / README.md
ngxson's picture
init
0495e49
|
raw
history blame
No virus
353 Bytes
---
license: mit
---
# stories15M_MOE
This model is [ModelCloud/tinyllama-15M-stories](https://huggingface.co/ModelCloud/tinyllama-15M-stories) repeated 4 times to make 4 experts.
The model is used for testing, not intended to be used in production (unless your product is some kind of bedtime story teller)
Weight of router is initialized randomly