ludziej's picture
End of training
f35d3a3 verified
|
raw
history blame
2.18 kB
metadata
tags:
  - generated_from_trainer
model-index:
  - name: calculator_model_test
    results: []

calculator_model_test

This model is a fine-tuned version of on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4351

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.001
  • train_batch_size: 512
  • eval_batch_size: 512
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 20

Training results

Training Loss Epoch Step Validation Loss
3.0154 1.0 6 2.2475
2.0355 2.0 12 1.7612
1.6034 3.0 18 1.3483
1.2488 4.0 24 1.1102
1.0428 5.0 30 0.9643
0.9306 6.0 36 0.8449
0.8076 7.0 42 0.7526
0.7471 8.0 48 0.6843
0.6988 9.0 54 0.6786
0.6455 10.0 60 0.6155
0.6157 11.0 66 0.5713
0.5822 12.0 72 0.5486
0.5499 13.0 78 0.5163
0.526 14.0 84 0.4938
0.5133 15.0 90 0.4823
0.4917 16.0 96 0.4683
0.4792 17.0 102 0.4520
0.4677 18.0 108 0.4449
0.4616 19.0 114 0.4378
0.4502 20.0 120 0.4351

Framework versions

  • Transformers 4.37.2
  • Pytorch 2.1.0+cu121
  • Datasets 2.17.1
  • Tokenizers 0.15.2