Extra-tuned model

#1
by KoRiF - opened

TrainOutput(global_step=280, training_loss=0.007093984703533352, metrics={'train_runtime': 3120.6664, 'train_samples_per_second': 1.44, 'train_steps_per_second': 0.09, 'total_flos': 3.043113204096e+17, 'train_loss': 0.007093984703533352, 'epoch': 4.96})
image.png

KoRiF changed pull request status to merged

Sign up or log in to comment