pgajo/mdeberta-xlwa-en-it

Model description:

Model: mdeberta

Dataset: TASTEset

Unshuffled ratio: []

Shuffled ratio: []

Best exact match epoch: 3

Best exact match: 82.32

Best epoch: 3

Drop duplicates: []

Max epochs = 10

Optimizer lr = 3e-05

Optimizer eps = 1e-08

Batch size = 32

Dataset path = pgajo/mdeberta_xlwa_en-it

Results

epoch	train_loss	train_f1	train_exact	dev_loss	dev_f1	dev_exact
1	0.32	91.77	90.75	1.26	81.86	80.66
2	0.04	98.92	98.74	1.67	82.05	81.26
3	0.02	99.4	99.29	1.46	82.58	82.32
4	0.02	99.53	99.45	1.73	82.6	81.97
5	0.01	99.71	99.65	1.63	82.05	81.52
6	0.01	99.66	99.59	1.86	82.16	82.02