pico-decoder-large / learning_dynamics
rdiehlmartinez's picture
pico-decoder-large-1 trained to 125k steps
dc1a407