SvetlanaKayfajian/ss-mt5

Files changed (7) hide show

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2346
-- Rouge1: 49.856
-- Rouge2: 40.9224
-- Rougel: 49.5484
-- Rougelsum: 49.559
-- Gen Len: 15.1417
 ## Model description
@@ -41,22 +41,19 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
-| 1.6216        | 1.0   | 1600 | 1.3788          | 49.0991 | 39.9787 | 48.8099 | 48.8174   | 15.369  |
-| 1.6391        | 2.0   | 3200 | 1.2671          | 49.5863 | 40.5963 | 49.2822 | 49.3027   | 15.2092 |
-| 1.638         | 3.0   | 4800 | 1.2590          | 49.7134 | 40.7759 | 49.4037 | 49.4228   | 15.2206 |
-| 1.3495        | 4.0   | 6400 | 1.2346          | 49.856  | 40.9224 | 49.5484 | 49.559    | 15.1417 |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.3371
+- Rouge1: 1.0079
+- Rouge2: 0.0016
+- Rougel: 1.0081
+- Rougelsum: 1.0084
+- Gen Len: 19.0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.1
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| 6.5888        | 1.0   | 1740 | 6.3371          | 1.0079 | 0.0016 | 1.0081 | 1.0084    | 19.0    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1526baef9d5b6dd1cf7980be5bef638177693b7e58da679e34ee011084ce05fb
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ab69dc12b5e720622a8f406884dc3e65cb5f2a098ea39271a2da602647ab95c
 size 1200729512

runs/May14_21-43-31_e451fd3006c6/events.out.tfevents.1715723020.e451fd3006c6.2660.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bc44b4ab3d6aaf51f35942088b5d95bb431b14b68c05a337a86a1ffed5dee3e1
-size 164537

 version https://git-lfs.github.com/spec/v1
+oid sha256:08ebb2e549b904014294c6af618a017cf11906742b5220abdb9aad782e141d5b
+size 189013

runs/May14_22-21-30_e451fd3006c6/events.out.tfevents.1715725299.e451fd3006c6.2660.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:70857e34385064fbb6a326ade2ccfc383dfdfad038bbf1b6061f34b3cc24d5d1
+size 372552

runs/May14_22-55-50_e451fd3006c6/events.out.tfevents.1715727362.e451fd3006c6.2660.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:02c695b3e2c3da695a76acb46a893d433d916960f24056ad3617ef85a0c5b694
+size 372522

runs/May14_23-14-28_e451fd3006c6/events.out.tfevents.1715728478.e451fd3006c6.2660.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c6c0c6828dc727a93ffd10db2f168e7f71af8fe9cc68751027696f5a3b061262
+size 372550

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cba95c61c704fd7fff325b70584f98f2d911b16def871a42876851742a0f9b9b
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:0ac3b1c875c9284f1ef7d3a9e101ee0e2c0513cf4d276c84499643b3e7a346f0
 size 5176