FLOR-760M / config.json
joanllop's picture
First version of the CaBLOOM-760M model and tokenizer.
b9865de
raw
history blame
No virus
212 Bytes
{
"vocab_size": 50257,
"hidden_size": 1536,
"tie_word_embeddings": true,
"n_layer": 24,
"hidden_dropout": 0.0,
"layer_norm_epsilon": 1e-05,
"n_head": 16,
"attention_dropout": 0.0
}