juancopi81's picture
Add t5x and mt3 models
b100e1c
raw
history blame
177 Bytes
# T5.1.1 Base model.
include 'model.gin'
network.T5Config:
emb_dim = 768
num_heads = 12
num_encoder_layers = 12
num_decoder_layers = 12
head_dim = 64
mlp_dim = 2048