othello_gpt_model_test / config.json
Thijmen's picture
Update config.json
d79b2b9 verified
raw
history blame contribute delete
No virus
178 Bytes
{
n_layers = 6,
d_model = 128,
d_head = 64,
n_heads = 8,
d_mlp = 512,
d_vocab = 61,
n_ctx = 59,
act_fn="gelu",
normalization_type="LNPre",
}