Edit model card

calculator_model_test

This model is a fine-tuned version of on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4927

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.001
  • train_batch_size: 512
  • eval_batch_size: 512
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 40

Training results

Training Loss Epoch Step Validation Loss
2.6077 1.0 41 2.0310
1.6982 2.0 82 1.5525
1.5473 3.0 123 1.5432
1.5418 4.0 164 1.5405
1.5401 5.0 205 1.5405
1.5387 6.0 246 1.5401
1.5366 7.0 287 1.5359
1.5292 8.0 328 1.5337
1.5289 9.0 369 1.5313
1.5248 10.0 410 1.5324
1.5174 11.0 451 1.5129
1.4809 12.0 492 1.4740
1.4456 13.0 533 1.4299
1.3497 14.0 574 1.2158
1.1149 15.0 615 0.9632
0.8827 16.0 656 0.7687
0.7432 17.0 697 0.6696
0.6651 18.0 738 0.6081
0.614 19.0 779 0.5694
0.5734 20.0 820 0.5423
0.5422 21.0 861 0.5248
0.5221 22.0 902 0.5087
0.5089 23.0 943 0.5026
0.4984 24.0 984 0.4946
0.4857 25.0 1025 0.4880
0.4778 26.0 1066 0.4881
0.4739 27.0 1107 0.4882
0.4698 28.0 1148 0.4853
0.4628 29.0 1189 0.4862
0.4581 30.0 1230 0.4849
0.4537 31.0 1271 0.4839
0.4504 32.0 1312 0.4871
0.4467 33.0 1353 0.4867
0.4438 34.0 1394 0.4886
0.4395 35.0 1435 0.4899
0.4362 36.0 1476 0.4903
0.4338 37.0 1517 0.4910
0.4304 38.0 1558 0.4913
0.4274 39.0 1599 0.4928
0.4252 40.0 1640 0.4927

Framework versions

  • Transformers 4.38.1
  • Pytorch 2.1.0+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2
Downloads last month
5
Safetensors
Model size
7.82M params
Tensor type
F32
·