Writing logs to ./outputs/2024-04-29-21-40-55-381537/train_log.txt. Wrote original training args to ./outputs/2024-04-29-21-40-55-381537/training_args.json. ***** Running training ***** Num examples = 12000 Num epochs = 3 Num clean epochs = 3 Instantaneous batch size per device = 8 Total train batch size (w. parallel, distributed & accumulation) = 8 Gradient accumulation steps = 1 Total optimization steps = 4500 ========================================================== Epoch 1 Running clean epoch 1/3 Train accuracy: 88.57% Eval accuracy: 92.03% Best score found. Saved model to ./outputs/2024-04-29-21-40-55-381537/best_model/ ========================================================== Epoch 2 Running clean epoch 2/3 Train accuracy: 96.39% Eval accuracy: 96.10% Best score found. Saved model to ./outputs/2024-04-29-21-40-55-381537/best_model/ ========================================================== Epoch 3 Running clean epoch 3/3 Train accuracy: 99.27% Eval accuracy: 96.67% Best score found. Saved model to ./outputs/2024-04-29-21-40-55-381537/best_model/ Wrote README to ./outputs/2024-04-29-21-40-55-381537/README.md.