bobox's picture
Training in progress, step 610, checkpoint
dfb4427 verified
raw
history blame contribute delete
217 Bytes
{
"embed_dim": 768,
"num_heads": 4,
"dropout": 0.05,
"bias": true,
"gate_min": 0.1,
"gate_max": 0.9,
"gate_dropout": 0.1,
"dropout_gate_open": 0.05,
"dropout_gate_close": 0.05,
"CLS_self_attn": 0
}