mamba-130m / config.json
mjschock's picture
Upload config
77a7df6 verified
raw
history blame
245 Bytes
{
"d_model": 768,
"fused_add_norm": true,
"model_type": "mamba",
"n_layer": 24,
"pad_vocab_size_multiple": 8,
"residual_in_fp32": true,
"rms_norm": true,
"ssm_cfg": {},
"transformers_version": "4.37.2",
"vocab_size": 50277
}