Edit model card

ServiModel

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.9224
  • Accuracy: 0.5437

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 1

Training results

Training Loss Epoch Step Validation Loss Accuracy
0.9203 0.0120 3 1.0271 0.5264
0.7804 0.0239 6 0.9705 0.5287
0.8766 0.0359 9 0.9562 0.5299
0.8149 0.0478 12 0.9631 0.5345
0.8701 0.0598 15 0.9796 0.5379
0.992 0.0717 18 0.9764 0.5276
0.6035 0.0837 21 0.9932 0.5356
0.7398 0.0956 24 1.0167 0.5368
1.1153 0.1076 27 1.0184 0.5322
1.0794 0.1195 30 0.9807 0.5471
0.9069 0.1315 33 0.9571 0.5368
1.0911 0.1434 36 0.9478 0.5356
0.856 0.1554 39 0.9417 0.5276
0.7464 0.1673 42 0.9329 0.5368
1.0068 0.1793 45 0.9318 0.5402
0.8869 0.1912 48 0.9383 0.5402
1.0068 0.2032 51 0.9451 0.5517
0.9831 0.2151 54 0.9459 0.5460
1.0295 0.2271 57 0.9345 0.5356
0.6741 0.2390 60 0.9517 0.5414
0.9761 0.2510 63 0.9543 0.5448
0.7699 0.2629 66 0.9568 0.5368
0.9349 0.2749 69 0.9716 0.5287
0.8818 0.2869 72 0.9599 0.5322
0.857 0.2988 75 0.9496 0.5345
0.8135 0.3108 78 0.9521 0.5310
0.727 0.3227 81 0.9635 0.5402
0.9652 0.3347 84 0.9609 0.5414
0.7455 0.3466 87 0.9702 0.5299
0.7711 0.3586 90 0.9836 0.5230
0.7348 0.3705 93 1.0026 0.5276
0.8274 0.3825 96 1.0305 0.5368
0.9674 0.3944 99 1.0415 0.5276
1.0927 0.4064 102 1.0280 0.5253
1.1576 0.4183 105 0.9987 0.5356
0.8832 0.4303 108 0.9807 0.5322
0.5615 0.4422 111 0.9751 0.5437
1.2028 0.4542 114 0.9600 0.5391
0.8733 0.4661 117 0.9601 0.5448
0.8855 0.4781 120 0.9657 0.5391
1.0181 0.4900 123 0.9632 0.5379
0.8833 0.5020 126 0.9461 0.5379
1.0474 0.5139 129 0.9384 0.5425
1.173 0.5259 132 0.9260 0.5529
1.0996 0.5378 135 0.9186 0.5552
0.6445 0.5498 138 0.9149 0.5586
0.7965 0.5618 141 0.9176 0.5540
1.0051 0.5737 144 0.9213 0.5517
0.8613 0.5857 147 0.9298 0.5506
0.8603 0.5976 150 0.9345 0.5460
0.8247 0.6096 153 0.9306 0.5506
0.8808 0.6215 156 0.9269 0.5506
1.0487 0.6335 159 0.9235 0.5529
0.9654 0.6454 162 0.9236 0.5529
0.9228 0.6574 165 0.9196 0.5598
0.8636 0.6693 168 0.9153 0.5586
0.7859 0.6813 171 0.9174 0.5609
0.7514 0.6932 174 0.9177 0.5598
0.7972 0.7052 177 0.9187 0.5655
0.8669 0.7171 180 0.9277 0.5471
0.8085 0.7291 183 0.9406 0.5437
0.9855 0.7410 186 0.9494 0.5391
0.9821 0.7530 189 0.9533 0.5437
0.8769 0.7649 192 0.9556 0.5448
0.7997 0.7769 195 0.9610 0.5448
0.9333 0.7888 198 0.9580 0.5471
0.9406 0.8008 201 0.9454 0.5437
0.7489 0.8127 204 0.9389 0.5425
0.7847 0.8247 207 0.9348 0.5414
0.7151 0.8367 210 0.9297 0.5425
0.8573 0.8486 213 0.9259 0.5483
0.88 0.8606 216 0.9238 0.5529
0.8127 0.8725 219 0.9221 0.5506
0.7742 0.8845 222 0.9203 0.5506
1.0872 0.8964 225 0.9198 0.5471
0.6082 0.9084 228 0.9196 0.5471
0.6126 0.9203 231 0.9206 0.5471
0.8127 0.9323 234 0.9217 0.5460
0.8886 0.9442 237 0.9221 0.5448
0.9518 0.9562 240 0.9222 0.5448
0.7686 0.9681 243 0.9223 0.5460
0.7965 0.9801 246 0.9224 0.5437
0.7674 0.9920 249 0.9224 0.5437

Framework versions

  • Transformers 4.40.1
  • Pytorch 2.3.0+cu121
  • Datasets 2.19.0
  • Tokenizers 0.19.1
Downloads last month
1
Safetensors
Model size
135M params
Tensor type
F32
·