BartekSadlej's picture
End of training
6b0ae77 verified
|
raw
history blame
No virus
3.2 kB
metadata
tags:
  - generated_from_trainer
model-index:
  - name: calculator_model_test
    results: []

calculator_model_test

This model is a fine-tuned version of on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0923

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.001
  • train_batch_size: 512
  • eval_batch_size: 512
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 40

Training results

Training Loss Epoch Step Validation Loss
3.0524 1.0 6 2.3364
2.0601 2.0 12 1.7532
1.5868 3.0 18 1.3439
1.2198 4.0 24 1.0697
1.0324 5.0 30 0.9496
0.8954 6.0 36 0.8209
0.7853 7.0 42 0.7529
0.7118 8.0 48 0.6709
0.6574 9.0 54 0.6452
0.6368 10.0 60 0.5870
0.5746 11.0 66 0.5092
0.5092 12.0 72 0.4923
0.4829 13.0 78 0.4596
0.4542 14.0 84 0.4221
0.4286 15.0 90 0.3805
0.3987 16.0 96 0.3707
0.3704 17.0 102 0.3424
0.3639 18.0 108 0.3204
0.3297 19.0 114 0.2918
0.3085 20.0 120 0.2683
0.2814 21.0 126 0.2524
0.2667 22.0 132 0.2489
0.2593 23.0 138 0.2238
0.2427 24.0 144 0.2168
0.2318 25.0 150 0.2048
0.226 26.0 156 0.2018
0.2085 27.0 162 0.1860
0.1976 28.0 168 0.1800
0.1928 29.0 174 0.1621
0.1814 30.0 180 0.1551
0.1758 31.0 186 0.1398
0.1598 32.0 192 0.1291
0.1541 33.0 198 0.1206
0.1411 34.0 204 0.1097
0.1371 35.0 210 0.1042
0.1299 36.0 216 0.1015
0.1273 37.0 222 0.0955
0.1214 38.0 228 0.0944
0.1161 39.0 234 0.0935
0.1188 40.0 240 0.0923

Framework versions

  • Transformers 4.38.1
  • Pytorch 2.1.0+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2