Training steps

#3
by malteos - opened

Given that this is an intermediate checkpoint, from what training step is this checkpoint?

BigScience Workshop org

Hi @malteos ! If I am not mistaken it should be global_step156000 :-)

Thanks @ybelkada

That's like 36%, right? (Assuming 220_000_000 train samples with 512 batch size)

BigScience Workshop org

Yeah I think so, to go over the train sample you would need ~430.000 steps so yes it corresponds to 36% approximately

malteos changed discussion status to closed

Sign up or log in to comment