End of training

Browse files

Files changed (4) hide show

README.md +8 -16
model.safetensors +1 -1
runs/May21_11-32-57_d6267c37c073/events.out.tfevents.1716291177.d6267c37c073.34.9 +2 -2
runs/May21_11-32-57_d6267c37c073/events.out.tfevents.1716291653.d6267c37c073.34.10 +3 -0

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ license: apache-2.0
 base_model: distilroberta-base
 tags:
 - generated_from_trainer
 model-index:
 - name: mask-langauge-modeling
   results: []
@@ -13,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
 # mask-langauge-modeling
-This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4731
 ## Model description
@@ -34,30 +36,20 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 12
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.8107        | 1.0   | 598  | 0.8552          |
-| 0.8572        | 2.0   | 1196 | 0.8273          |
-| 0.8655        | 3.0   | 1794 | 0.7705          |
-| 0.8348        | 4.0   | 2392 | 0.7156          |
-| 0.792         | 5.0   | 2990 | 0.6761          |
-| 0.701         | 6.0   | 3588 | 0.6323          |
-| 0.665         | 7.0   | 4186 | 0.6005          |
-| 0.6257        | 8.0   | 4784 | 0.5602          |
-| 0.5966        | 9.0   | 5382 | 0.5298          |
-| 0.5674        | 10.0  | 5980 | 0.5153          |
-| 0.5102        | 11.0  | 6578 | 0.4995          |
-| 0.4888        | 12.0  | 7176 | 0.4816          |
 ### Framework versions

 base_model: distilroberta-base
 tags:
 - generated_from_trainer
+datasets:
+- eli5_category
 model-index:
 - name: mask-langauge-modeling
   results: []
 # mask-langauge-modeling
+This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.8290
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0003
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.4564        | 1.0   | 721  | 2.8947          |
+| 2.2249        | 2.0   | 1442 | 2.7831          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e0024c16198a614afd482bbfa19a401556a1738b09ad41a5836ee2e97caca689
 size 328693404

 version https://git-lfs.github.com/spec/v1
+oid sha256:aa2a3efe4fc62901231a7e2fe019f0337de372544bdbfe4657746743e22c6b76
 size 328693404

runs/May21_11-32-57_d6267c37c073/events.out.tfevents.1716291177.d6267c37c073.34.9 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c0bb775168de425a47691102095ee30df54563412c026c863e8fa699d1bcb5b5
-size 5351

 version https://git-lfs.github.com/spec/v1
+oid sha256:56a0cdc82ba07c763136a16ac992cde8b5e3b4e58e8f8fc23b68995d4e2459e6
+size 5976

runs/May21_11-32-57_d6267c37c073/events.out.tfevents.1716291653.d6267c37c073.34.10 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:119eec85cb0d3bd8b2b622e02ae4b7fbc12a84b20e5b532a99eeb787890d313a
+size 359