Shakhovak
/

flan-t5-base-absa-joint

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1955
 ## Model description
@@ -41,36 +41,23 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.0955        | 0.26  | 100  | 0.6148          |
-| 0.6658        | 0.52  | 200  | 0.3579          |
-| 0.4605        | 0.79  | 300  | 0.2902          |
-| 0.4011        | 1.05  | 400  | 0.2537          |
-| 0.3489        | 1.31  | 500  | 0.2344          |
-| 0.3543        | 1.57  | 600  | 0.2339          |
-| 0.3223        | 1.84  | 700  | 0.2214          |
-| 0.2887        | 2.1   | 800  | 0.2132          |
-| 0.2823        | 2.36  | 900  | 0.2036          |
-| 0.2691        | 2.62  | 1000 | 0.2044          |
-| 0.2615        | 2.89  | 1100 | 0.1978          |
-| 0.2429        | 3.15  | 1200 | 0.1966          |
-| 0.2253        | 3.41  | 1300 | 0.1975          |
-| 0.2274        | 3.67  | 1400 | 0.1916          |
-| 0.2354        | 3.94  | 1500 | 0.1925          |
-| 0.2025        | 4.2   | 1600 | 0.1956          |
-| 0.1988        | 4.46  | 1700 | 0.1956          |
-| 0.2216        | 4.72  | 1800 | 0.1962          |
-| 0.2082        | 4.99  | 1900 | 0.1955          |
 ### Framework versions
 - Transformers 4.38.2
-- Pytorch 2.0.1+cu118
 - Datasets 2.18.0
 - Tokenizers 0.15.2

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2208
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.6993        | 0.45  | 200  | 0.3259          |
+| 0.4165        | 0.9   | 400  | 0.2624          |
+| 0.3401        | 1.35  | 600  | 0.2353          |
+| 0.2937        | 1.8   | 800  | 0.2160          |
+| 0.2655        | 2.25  | 1000 | 0.2244          |
+| 0.2427        | 2.7   | 1200 | 0.2208          |
 ### Framework versions
 - Transformers 4.38.2
+- Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:123e5660ee007ebf3b7f40c4cde40d578ae4d370ed8b523f312419163fac4923
 size 990345064

 version https://git-lfs.github.com/spec/v1
+oid sha256:b019a40f9f32687fd74cba0bf640d3da7fad44cd4750ed03c2389a69a3501c60
 size 990345064

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 88,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 88
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 89,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 89
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ae06d4a295447b79162c0e09b99d069c6c7691c73ba2f1c07b2843d84064acd0
-size 4667

 version https://git-lfs.github.com/spec/v1
+oid sha256:0367e46bf3e8bea1456b28a3f9b13dcbf6d25654b455fd12a1d630b7db7d5438
+size 5112