gpt2-xl-lora-multi-512-7-top / train_results.json
MHGanainy's picture
MHGanainy/gpt2-xl-lora-multi-512-7
2d507a7 verified
raw
history blame contribute delete
206 Bytes
{
"epoch": 1.0,
"total_flos": 1.59124040841796e+18,
"train_loss": 2.428147726093893,
"train_runtime": 2109.7476,
"train_samples_per_second": 82.811,
"train_steps_per_second": 5.176
}