Question about training set

#1
by yuqianli - opened

Excuse me, I want to pretrain a gpt2 on wikitext103, but I meet some problems. Could I know some information about the model training, including learning rate, warm up, gpu number, training time, and final loss? thanks very much.

Sign up or log in to comment