hp_ablations_mistral_lr8e-6 / train_results.json
sedrickkeh's picture
End of training
34ce974 verified
raw
history blame contribute delete
220 Bytes
{
"epoch": 2.9954430379746837,
"total_flos": 2477170706350080.0,
"train_loss": 0.4662262036076585,
"train_runtime": 84837.0324,
"train_samples_per_second": 8.938,
"train_steps_per_second": 0.017
}