Model description:
Model: microsoft/mdeberta-v3-base
Dataset: TASTEset
Unshuffled ratio: ['0']
Shuffled ratio: ['1']
Best exact match epoch: 4
Best exact match: 96.43
Best epoch: 4
Drop duplicates: ['1']
Max epochs = 10
Optimizer lr = 3e-05
Optimizer eps = 1e-08
Batch size = 8
Dataset path = pgajo/EW-TT-PE_U0_S1_Tingredient_DROP1_mdeberta
Results
epoch | train_loss | train_f1 | train_exact | dev_loss | dev_f1 | dev_exact | test_loss | test_f1 | test_exact |
---|---|---|---|---|---|---|---|---|---|
1 | 2.23 | 40.25 | 31.82 | 0.36 | 91.75 | 88.46 | 0 | 0 | 0 |
2 | 0.27 | 92.71 | 90.01 | 0.19 | 95.5 | 93.68 | 0 | 0 | 0 |
3 | 0.1 | 96.91 | 95.73 | 0.15 | 97.05 | 95.88 | 0 | 0 | 0 |
4 | 0.06 | 98.38 | 97.45 | 0.12 | 96.95 | 96.43 | 0 | 0 | 0 |
5 | 0.05 | 98.66 | 97.66 | 0.15 | 96.43 | 93.68 | 0 | 0 | 0 |
6 | 0.06 | 98.48 | 97.66 | 0.14 | 95.86 | 94.51 | 0 | 0 | 0 |
7 | 0.04 | 98.85 | 98.28 | 0.17 | 95.83 | 95.05 | 0 | 0 | 0 |
- Downloads last month
- 3