evanhanders's picture
pushes all polysemantic models
a231d74 verified
raw
history blame
79 Bytes
act_fn: relu
d_head: 5
d_model: 20
d_vocab: 4
n_ctx: 16
n_layers: 3
seed: null