Knowledge Continuity Regularized Network
Dataset: ANLI Round: None
Trainer Hyperparameters:
lr
= 5e-05per_device_batch_size
= 2gradient_accumulation_steps
= 4weight_decay
= 0.0seed
= 42
Regularization Hyperparameters
numerical stability denominator constant
= 0.01lambda
= 0.0001alpha
= 2.0beta
= 2.0
Extended Logs:
eval_loss | eval_accuracy | epoch |
---|---|---|
1.070 | 0.413 | 1.0 |
1.110 | 0.407 | 2.0 |
1.115 | 0.416 | 3.0 |
1.108 | 0.431 | 4.0 |
1.108 | 0.428 | 5.0 |
1.119 | 0.413 | 6.0 |
1.102 | 0.438 | 7.0 |
1.107 | 0.429 | 8.0 |
1.101 | 0.439 | 9.0 |
1.101 | 0.434 | 10.0 |
1.110 | 0.428 | 11.0 |
1.102 | 0.442 | 12.0 |
1.110 | 0.430 | 13.0 |
1.093 | 0.455 | 14.0 |
1.105 | 0.434 | 15.0 |
1.106 | 0.435 | 16.0 |
1.105 | 0.439 | 17.0 |
1.099 | 0.441 | 18.0 |
1.099 | 0.443 | 19.0 |
Test Accuracy: 0.445
- Downloads last month
- 0
Unable to determine this model’s pipeline type. Check the
docs
.