WindFM-robust / config.json
NeoQuasar's picture
Upload WindFM-robust weights and model card
b54cd48 verified
raw
history blame contribute delete
225 Bytes
{
"attn_dropout_p": 0.1,
"d_model": 256,
"ff_dim": 512,
"ffn_dropout_p": 0.2,
"learn_te": true,
"n_heads": 8,
"n_layers": 4,
"resid_dropout_p": 0.2,
"s1_bits": 10,
"s2_bits": 10,
"token_dropout_p": 0.0
}