Dataset: GLUE
Trainer Hyperparameters:
lr
per_device_batch_size
gradient_accumulation_steps
weight_decay
seed
-