ludziej's picture
End of training
c36cb24 verified
|
raw
history blame
No virus
3.2 kB
metadata
tags:
  - generated_from_trainer
model-index:
  - name: calculator_model_test
    results: []

calculator_model_test

This model is a fine-tuned version of on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.1675

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.001
  • train_batch_size: 512
  • eval_batch_size: 512
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 40

Training results

Training Loss Epoch Step Validation Loss
2.9654 1.0 6 2.2731
2.0238 2.0 12 1.7268
1.5275 3.0 18 1.2838
1.187 4.0 24 1.0874
1.0242 5.0 30 0.9517
0.9122 6.0 36 0.8367
0.8028 7.0 42 0.8002
0.762 8.0 48 0.7533
0.6956 9.0 54 0.6658
0.6503 10.0 60 0.6102
0.6071 11.0 66 0.5720
0.5785 12.0 72 0.5378
0.5367 13.0 78 0.5026
0.5118 14.0 84 0.5035
0.4987 15.0 90 0.4639
0.4704 16.0 96 0.4410
0.441 17.0 102 0.4364
0.4407 18.0 108 0.4112
0.4173 19.0 114 0.3852
0.4001 20.0 120 0.3590
0.3669 21.0 126 0.3375
0.3402 22.0 132 0.3207
0.3276 23.0 138 0.3187
0.3205 24.0 144 0.2961
0.2985 25.0 150 0.2673
0.2897 26.0 156 0.2524
0.2611 27.0 162 0.2408
0.2597 28.0 168 0.2408
0.2523 29.0 174 0.2277
0.2385 30.0 180 0.2111
0.2269 31.0 186 0.2068
0.2272 32.0 192 0.2047
0.2174 33.0 198 0.1927
0.2097 34.0 204 0.1848
0.1966 35.0 210 0.1817
0.1894 36.0 216 0.1768
0.1939 37.0 222 0.1760
0.1884 38.0 228 0.1701
0.1863 39.0 234 0.1683
0.1824 40.0 240 0.1675

Framework versions

  • Transformers 4.37.2
  • Pytorch 2.1.0+cu121
  • Datasets 2.17.1
  • Tokenizers 0.15.2