asahi417's picture
model update
b4a1025
{"learning_rate": 8.16845589476017e-06, "num_train_epochs": 3, "per_device_train_batch_size": 32}