cosmoem-8x1B / generation_config.json
Lambent's picture
initialize experts on layers 0-2 randomly
c33e3b9
raw
history blame
116 Bytes
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"transformers_version": "4.39.0.dev0"
}