Knowledge Continuity Regularized Network
Dataset: ANLI Round: None
Trainer Hyperparameters:
lr
= 5e-05per_device_batch_size
= 8gradient_accumulation_steps
= 2weight_decay
= 0.0seed
= 42
Regularization Hyperparameters
numerical stability denominator constant
= 0.01lambda
= 0.0001alpha
= 2.0beta
= 2.0
Extended Logs:
eval_loss | eval_accuracy | epoch |
---|---|---|
35.403 | 0.419 | 1.0 |
35.321 | 0.427 | 2.0 |
35.356 | 0.426 | 3.0 |
35.000 | 0.443 | 4.0 |
34.783 | 0.447 | 5.0 |
34.693 | 0.453 | 6.0 |
34.950 | 0.443 | 7.0 |
35.001 | 0.443 | 8.0 |
34.699 | 0.453 | 9.0 |
35.112 | 0.442 | 10.0 |
34.913 | 0.448 | 11.0 |
34.830 | 0.452 | 12.0 |
35.178 | 0.437 | 13.0 |
35.007 | 0.443 | 14.0 |
Test Accuracy: 0.443
- Downloads last month
- 0
Unable to determine this model’s pipeline type. Check the
docs
.