laplasian's picture
End of training
9ca2a1a verified
metadata
tags:
  - generated_from_trainer
model-index:
  - name: calculator_model_test
    results: []

calculator_model_test

This model is a fine-tuned version of on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8124

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.001
  • train_batch_size: 512
  • eval_batch_size: 512
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 40

Training results

Training Loss Epoch Step Validation Loss
3.3916 1.0 6 2.7549
2.4048 2.0 12 1.9876
1.8591 3.0 18 1.8948
1.6663 4.0 24 1.6188
1.586 5.0 30 1.5494
1.5053 6.0 36 1.6146
1.5878 7.0 42 1.5655
1.5183 8.0 48 1.5487
1.5695 9.0 54 1.5368
1.5206 10.0 60 1.5189
1.4748 11.0 66 1.5096
1.4616 12.0 72 1.4969
1.4502 13.0 78 1.4454
1.4039 14.0 84 1.4019
1.3864 15.0 90 1.3711
1.3878 16.0 96 1.3687
1.3034 17.0 102 1.2939
1.2768 18.0 108 1.3036
1.2649 19.0 114 1.2028
1.26 20.0 120 1.1679
1.198 21.0 126 1.2472
1.1989 22.0 132 1.2993
1.2132 23.0 138 1.0975
1.1436 24.0 144 1.0720
1.0686 25.0 150 1.1057
1.0627 26.0 156 1.0181
1.0 27.0 162 0.9821
1.0395 28.0 168 0.9878
0.9847 29.0 174 0.9409
0.9655 30.0 180 0.9396
0.9791 31.0 186 0.9019
0.9318 32.0 192 0.8818
0.9103 33.0 198 0.8827
0.9049 34.0 204 0.8853
0.9553 35.0 210 0.8960
0.8924 36.0 216 0.8598
0.9028 37.0 222 0.8312
0.8601 38.0 228 0.8177
0.8744 39.0 234 0.8153
0.8442 40.0 240 0.8124

Framework versions

  • Transformers 4.38.1
  • Pytorch 2.1.0+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2