habedi
/

deberta-v3-base-kaggle-mlm

@@ -1,21 +1,21 @@
 ---
 license: mit
-base_model: microsoft/deberta-v3-large
 tags:
 - generated_from_trainer
 model-index:
-- name: deberta-v3-large-kaggle-mlm
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# deberta-v3-large-kaggle-mlm
-This model is a fine-tuned version of [microsoft/deberta-v3-large](https://huggingface.co/microsoft/deberta-v3-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3182
 ## Model description
@@ -47,31 +47,31 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step   | Validation Loss |
 |:-------------:|:-----:|:------:|:---------------:|
-| 3.1114        | 1.0   | 6848   | 2.6616          |
-| 2.2122        | 2.0   | 13696  | 1.9734          |
-| 2.0848        | 3.0   | 20544  | 1.9930          |
-| 1.8056        | 4.0   | 27392  | 1.7167          |
-| 1.7003        | 5.0   | 34240  | 1.8419          |
-| 1.6414        | 6.0   | 41088  | 1.5828          |
-| 1.583         | 7.0   | 47936  | 1.5298          |
-| 1.5245        | 8.0   | 54784  | 1.4964          |
-| 1.491         | 9.0   | 61632  | 1.4671          |
-| 1.4662        | 10.0  | 68480  | 1.4805          |
-| 1.426         | 11.0  | 75328  | 1.4506          |
-| 1.3924        | 12.0  | 82176  | 1.4272          |
-| 1.3797        | 13.0  | 89024  | 1.4092          |
-| 1.3713        | 14.0  | 95872  | 1.3947          |
-| 1.3444        | 15.0  | 102720 | 1.3765          |
-| 1.3414        | 16.0  | 109568 | 1.3636          |
-| 1.3256        | 17.0  | 116416 | 1.3700          |
-| 1.3084        | 18.0  | 123264 | 1.3607          |
-| 1.2925        | 19.0  | 130112 | 1.3428          |
-| 1.2615        | 20.0  | 136960 | 1.3483          |
-| 1.2733        | 21.0  | 143808 | 1.3440          |
-| 1.2809        | 22.0  | 150656 | 1.3314          |
-| 1.2576        | 23.0  | 157504 | 1.3388          |
-| 1.2606        | 24.0  | 164352 | 1.3126          |
-| 1.2608        | 25.0  | 171200 | 1.3211          |
 ### Framework versions

 ---
 license: mit
+base_model: microsoft/deberta-v3-base
 tags:
 - generated_from_trainer
 model-index:
+- name: deberta-v3-base-kaggle-mlm
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# deberta-v3-base-kaggle-mlm
+This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5600
 ## Model description
 | Training Loss | Epoch | Step   | Validation Loss |
 |:-------------:|:-----:|:------:|:---------------:|
+| 3.1466        | 1.0   | 6848   | 2.9173          |
+| 2.6316        | 2.0   | 13696  | 2.4139          |
+| 2.3281        | 3.0   | 20544  | 2.2020          |
+| 2.2122        | 4.0   | 27392  | 2.0776          |
+| 2.0794        | 5.0   | 34240  | 1.9780          |
+| 2.0299        | 6.0   | 41088  | 1.8861          |
+| 1.9629        | 7.0   | 47936  | 1.8213          |
+| 1.9001        | 8.0   | 54784  | 1.7946          |
+| 1.8508        | 9.0   | 61632  | 1.7551          |
+| 1.8157        | 10.0  | 68480  | 1.7485          |
+| 1.7815        | 11.0  | 75328  | 1.7100          |
+| 1.7423        | 12.0  | 82176  | 1.6970          |
+| 1.7318        | 13.0  | 89024  | 1.6813          |
+| 1.7173        | 14.0  | 95872  | 1.6493          |
+| 1.6902        | 15.0  | 102720 | 1.6243          |
+| 1.7002        | 16.0  | 109568 | 1.6313          |
+| 1.6714        | 17.0  | 116416 | 1.6181          |
+| 1.6605        | 18.0  | 123264 | 1.6026          |
+| 1.6331        | 19.0  | 130112 | 1.5825          |
+| 1.6143        | 20.0  | 136960 | 1.5903          |
+| 1.6136        | 21.0  | 143808 | 1.5812          |
+| 1.6151        | 22.0  | 150656 | 1.5708          |
+| 1.6122        | 23.0  | 157504 | 1.5806          |
+| 1.6025        | 24.0  | 164352 | 1.5492          |
+| 1.614         | 25.0  | 171200 | 1.5555          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b17eeca88dc394f53e14543ff317e3f8addc70260b100ce95ae44e47e5afdba3
 size 738231856

 version https://git-lfs.github.com/spec/v1
+oid sha256:1638df123a9d123c59fa8a035fd1ee59d42152de139f3043e40658c8667362ba
 size 738231856