kennethge123's picture
Upload README.md with huggingface_hub
071c09b verified
|
raw
history blame
No virus
575 Bytes
metadata
language: en
license: mit
library_name: pytorch

Knowledge Continuity Regularized Network

Trainer Hyperparameters:

  • lr = 5e-05
  • per_device_batch_size = 8
  • gradient_accumulation_steps = 2
  • weight_decay = 1e-09
  • seed = 42

Regularization Hyperparameters

  • numerical stability denominator constant = 0.001
  • lambda = 0.01
  • alpha = 2.0
  • beta = 2.0

Extended Logs:

eval_loss eval_accuracy epoch
14.430 0.792 0.67
14.131 0.792 2.0
13.810 0.875 2.67
13.640 0.875 4.0
13.667 0.875 4.67
13.247 0.875 6.0
12.928 0.875 6.67