EricPeter
/

distilbert-base-cased-distilled-squad-finetuned-squad

+---
+license: apache-2.0
+base_model: distilbert-base-cased-distilled-squad
+tags:
+- generated_from_trainer
+model-index:
+- name: distilbert-base-cased-distilled-squad-finetuned-squad
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# distilbert-base-cased-distilled-squad-finetuned-squad
+This model is a fine-tuned version of [distilbert-base-cased-distilled-squad](https://huggingface.co/distilbert-base-cased-distilled-squad) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 5.3012
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 30
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 66   | 2.4569          |
+| No log        | 2.0   | 132  | 2.3201          |
+| No log        | 3.0   | 198  | 2.3281          |
+| No log        | 4.0   | 264  | 2.3895          |
+| No log        | 5.0   | 330  | 2.7032          |
+| No log        | 6.0   | 396  | 2.9461          |
+| No log        | 7.0   | 462  | 3.2856          |
+| 1.4273        | 8.0   | 528  | 3.4643          |
+| 1.4273        | 9.0   | 594  | 3.7349          |
+| 1.4273        | 10.0  | 660  | 4.1377          |
+| 1.4273        | 11.0  | 726  | 4.2097          |
+| 1.4273        | 12.0  | 792  | 4.3009          |
+| 1.4273        | 13.0  | 858  | 4.4249          |
+| 1.4273        | 14.0  | 924  | 4.5736          |
+| 1.4273        | 15.0  | 990  | 4.7452          |
+| 0.222         | 16.0  | 1056 | 4.8124          |
+| 0.222         | 17.0  | 1122 | 4.8169          |
+| 0.222         | 18.0  | 1188 | 5.1680          |
+| 0.222         | 19.0  | 1254 | 5.0830          |
+| 0.222         | 20.0  | 1320 | 5.1588          |
+| 0.222         | 21.0  | 1386 | 5.1933          |
+| 0.222         | 22.0  | 1452 | 5.2828          |
+| 0.0343        | 23.0  | 1518 | 5.2054          |
+| 0.0343        | 24.0  | 1584 | 5.2622          |
+| 0.0343        | 25.0  | 1650 | 5.2089          |
+| 0.0343        | 26.0  | 1716 | 5.2026          |
+| 0.0343        | 27.0  | 1782 | 5.3382          |
+| 0.0343        | 28.0  | 1848 | 5.2926          |
+| 0.0343        | 29.0  | 1914 | 5.3003          |
+| 0.0343        | 30.0  | 1980 | 5.3012          |
+### Framework versions
+- Transformers 4.33.0
+- Pytorch 2.0.1+cu118
+- Datasets 2.14.4
+- Tokenizers 0.13.3

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:708e97c655f942b70b9f0353a8aa0d596814da9ca1f5267358b1cda3487e1f13
 size 260804645

 version https://git-lfs.github.com/spec/v1
+oid sha256:d051b864c51851b2cb40dcab4a9bf0ab530aa607cdfb0c986222ccef91f47e0c
 size 260804645