Edit model card

edyfjm07/distilbert-base-uncased-QA1-finetuned-squad-es

This model is a fine-tuned version of distilbert-base-uncased on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 0.2131
  • Train End Logits Accuracy: 0.9224
  • Train Start Logits Accuracy: 0.9310
  • Validation Loss: 1.0588
  • Validation End Logits Accuracy: 0.8088
  • Validation Start Logits Accuracy: 0.8150
  • Epoch: 50

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 1e-05, 'decay_steps': 1479, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
  • training_precision: float32

Training results

Train Loss Train End Logits Accuracy Train Start Logits Accuracy Validation Loss Validation End Logits Accuracy Validation Start Logits Accuracy Epoch
5.1787 0.0571 0.0496 4.3181 0.1724 0.1818 0
3.6307 0.25 0.1810 2.8944 0.3793 0.2476 1
2.5094 0.3998 0.3147 2.1436 0.4514 0.3793 2
1.9078 0.4871 0.4397 1.7322 0.5204 0.5705 3
1.5135 0.5593 0.5700 1.4332 0.6050 0.6238 4
1.2802 0.5927 0.6013 1.3274 0.6270 0.6364 5
1.1079 0.6595 0.6455 1.2126 0.6520 0.6865 6
0.9827 0.6843 0.7069 1.1469 0.7116 0.7116 7
0.8810 0.7306 0.7371 1.0859 0.7116 0.7053 8
0.8194 0.7349 0.7446 1.0339 0.7429 0.7492 9
0.7245 0.7403 0.7877 1.0371 0.7304 0.7398 10
0.6827 0.7683 0.7856 1.0185 0.7492 0.7461 11
0.6421 0.7866 0.8071 1.0298 0.7492 0.7555 12
0.5949 0.8006 0.8050 0.9877 0.7586 0.7774 13
0.5471 0.8125 0.8244 0.9933 0.7398 0.7774 14
0.5119 0.8233 0.8362 0.9956 0.7524 0.7837 15
0.4916 0.8330 0.8599 0.9917 0.7398 0.8025 16
0.4521 0.8373 0.8836 0.9698 0.7680 0.7868 17
0.4424 0.8459 0.8696 0.9951 0.7712 0.8025 18
0.3928 0.8599 0.8966 1.0173 0.7618 0.7931 19
0.3874 0.8578 0.8922 1.0307 0.7649 0.7931 20
0.3822 0.8588 0.8901 1.0272 0.7680 0.7900 21
0.3859 0.8524 0.8879 1.0180 0.7555 0.7962 22
0.3672 0.8524 0.8836 1.0040 0.7837 0.7994 23
0.3409 0.8675 0.8825 1.0242 0.7900 0.8088 24
0.3564 0.8610 0.8869 1.0257 0.7900 0.7900 25
0.3324 0.8578 0.9041 1.0227 0.7837 0.8088 26
0.3066 0.8858 0.9159 1.0243 0.7900 0.8025 27
0.3026 0.8804 0.9084 1.0224 0.7774 0.8088 28
0.2896 0.8879 0.9009 1.0324 0.7649 0.8182 29
0.2710 0.8998 0.9106 1.0458 0.7868 0.8088 30
0.2727 0.8933 0.9213 1.0483 0.7806 0.7931 31
0.2728 0.8976 0.9062 1.0459 0.7868 0.8088 32
0.2780 0.8847 0.9073 1.0595 0.7962 0.8056 33
0.2641 0.8955 0.9138 1.0503 0.7868 0.8025 34
0.2611 0.9009 0.9203 1.0458 0.8025 0.7962 35
0.2502 0.9030 0.9203 1.0621 0.8025 0.8025 36
0.2655 0.8804 0.9213 1.0478 0.7994 0.7994 37
0.2434 0.9084 0.9181 1.0491 0.8025 0.7994 38
0.2409 0.9149 0.9224 1.0452 0.8025 0.8088 39
0.2271 0.9181 0.9246 1.0487 0.7962 0.8119 40
0.2288 0.9332 0.9149 1.0579 0.8056 0.8056 41
0.2444 0.9127 0.9127 1.0522 0.8056 0.8119 42
0.2145 0.9235 0.9300 1.0584 0.8025 0.8088 43
0.2264 0.9073 0.9289 1.0520 0.8025 0.8119 44
0.2120 0.9213 0.9429 1.0591 0.8119 0.8088 45
0.2280 0.9127 0.9235 1.0538 0.8088 0.8056 46
0.2166 0.9116 0.9203 1.0554 0.8088 0.8088 47
0.2184 0.9138 0.9397 1.0568 0.8119 0.8088 48
0.2087 0.9106 0.9375 1.0588 0.8088 0.8150 49
0.2131 0.9224 0.9310 1.0588 0.8088 0.8150 50

Framework versions

  • Transformers 4.41.2
  • TensorFlow 2.15.0
  • Datasets 2.20.0
  • Tokenizers 0.19.1
Downloads last month
311
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Finetuned from

Dataset used to train edyfjm07/distilbert-base-uncased-QA1-finetuned-squad-es