Model description:
Model: microsoft/mdeberta-v3-base
Dataset: TASTEset
Unshuffled ratio: ['1']
Shuffled ratio: ['0']
Best exact match epoch: 2
Best exact match: 99.72
Best epoch: 2
Drop duplicates: ['1']
Max epochs = 10
Optimizer lr = 3e-05
Optimizer eps = 1e-08
Batch size = 8
Dataset path = pgajo/EW-TT-PE_U1_S0_DROP1_mdeberta
Results
epoch | train_loss | train_f1 | train_exact | dev_loss | dev_f1 | dev_exact | test_loss | test_f1 | test_exact |
---|---|---|---|---|---|---|---|---|---|
1 | 1.1 | 75.67 | 72.08 | 0.04 | 99.63 | 98.9 | 0 | 0 | 0 |
2 | 0.05 | 99.41 | 98.96 | 0.02 | 99.95 | 99.72 | 0 | 0 | 0 |
3 | 0.03 | 99.51 | 99.03 | 0.01 | 99.95 | 99.72 | 0 | 0 | 0 |
4 | 0.01 | 99.82 | 99.72 | 0.01 | 99.95 | 99.72 | 0 | 0 | 0 |
5 | 0.01 | 99.83 | 99.79 | 0.01 | 99.72 | 99.17 | 0 | 0 | 0 |
- Downloads last month
- 3