Edit model card

Model description:

Model: microsoft/mdeberta-v3-base

Dataset: TASTEset

Unshuffled ratio: ['1']

Shuffled ratio: ['0']

Best exact match epoch: 2

Best exact match: 99.72

Best epoch: 2

Drop duplicates: ['1']

Max epochs = 10

Optimizer lr = 3e-05

Optimizer eps = 1e-08

Batch size = 8

Dataset path = pgajo/EW-TT-PE_U1_S0_DROP1_mdeberta

Results

epoch train_loss train_f1 train_exact dev_loss dev_f1 dev_exact test_loss test_f1 test_exact
1 1.1 75.67 72.08 0.04 99.63 98.9 0 0 0
2 0.05 99.41 98.96 0.02 99.95 99.72 0 0 0
3 0.03 99.51 99.03 0.01 99.95 99.72 0 0 0
4 0.01 99.82 99.72 0.01 99.95 99.72 0 0 0
5 0.01 99.83 99.79 0.01 99.72 99.17 0 0 0
Downloads last month
3
Safetensors
Model size
278M params
Tensor type
F32