stories15M_MOE / README.md
ngxson's picture
init
0495e49
|
raw
history blame
No virus
353 Bytes
metadata
license: mit

stories15M_MOE

This model is ModelCloud/tinyllama-15M-stories repeated 4 times to make 4 experts.

The model is used for testing, not intended to be used in production (unless your product is some kind of bedtime story teller)

Weight of router is initialized randomly