Vishnou
/

distilbert_base_SST2

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.8967889908256881
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the sst2 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3718
-- Accuracy: 0.8968
 ## Model description
@@ -58,28 +58,45 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.4401        | 0.06  | 500  | 0.3732          | 0.8417   |
-| 0.3465        | 0.12  | 1000 | 0.3451          | 0.8761   |
-| 0.3408        | 0.18  | 1500 | 0.3465          | 0.8612   |
-| 0.3059        | 0.24  | 2000 | 0.4580          | 0.8589   |
-| 0.2974        | 0.3   | 2500 | 0.3338          | 0.8830   |
-| 0.3023        | 0.36  | 3000 | 0.4487          | 0.8475   |
-| 0.2636        | 0.42  | 3500 | 0.4330          | 0.8888   |
-| 0.2558        | 0.48  | 4000 | 0.4193          | 0.8876   |
-| 0.2466        | 0.53  | 4500 | 0.4343          | 0.8945   |
-| 0.2393        | 0.59  | 5000 | 0.4935          | 0.875    |
-| 0.2279        | 0.65  | 5500 | 0.3818          | 0.8876   |
-| 0.2181        | 0.71  | 6000 | 0.4003          | 0.8968   |
-| 0.2245        | 0.77  | 6500 | 0.3496          | 0.9048   |
-| 0.2279        | 0.83  | 7000 | 0.3759          | 0.9014   |
-| 0.2152        | 0.89  | 7500 | 0.4037          | 0.8899   |
-| 0.2174        | 0.95  | 8000 | 0.3718          | 0.8968   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.8990825688073395
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the sst2 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4181
+- Accuracy: 0.8991
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 2
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Accuracy |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.4378        | 0.06  | 500   | 0.3452          | 0.8601   |
+| 0.343         | 0.12  | 1000  | 0.3483          | 0.8578   |
+| 0.3342        | 0.18  | 1500  | 0.3373          | 0.8704   |
+| 0.308         | 0.24  | 2000  | 0.4102          | 0.8819   |
+| 0.2932        | 0.3   | 2500  | 0.3546          | 0.8830   |
+| 0.3116        | 0.36  | 3000  | 0.3609          | 0.8716   |
+| 0.2805        | 0.42  | 3500  | 0.3800          | 0.8945   |
+| 0.2655        | 0.48  | 4000  | 0.4131          | 0.8842   |
+| 0.2504        | 0.53  | 4500  | 0.4299          | 0.8830   |
+| 0.2543        | 0.59  | 5000  | 0.5196          | 0.8727   |
+| 0.2409        | 0.65  | 5500  | 0.4387          | 0.8807   |
+| 0.2414        | 0.71  | 6000  | 0.4121          | 0.8922   |
+| 0.2319        | 0.77  | 6500  | 0.3772          | 0.8830   |
+| 0.247         | 0.83  | 7000  | 0.4179          | 0.8876   |
+| 0.2233        | 0.89  | 7500  | 0.3544          | 0.8945   |
+| 0.2202        | 0.95  | 8000  | 0.4160          | 0.8865   |
+| 0.2242        | 1.01  | 8500  | 0.5125          | 0.8784   |
+| 0.1296        | 1.07  | 9000  | 0.4212          | 0.8842   |
+| 0.1429        | 1.13  | 9500  | 0.4675          | 0.8968   |
+| 0.1466        | 1.19  | 10000 | 0.5034          | 0.8922   |
+| 0.1626        | 1.25  | 10500 | 0.4431          | 0.8945   |
+| 0.1459        | 1.31  | 11000 | 0.5001          | 0.8922   |
+| 0.1489        | 1.37  | 11500 | 0.4739          | 0.8968   |
+| 0.1591        | 1.43  | 12000 | 0.3852          | 0.8945   |
+| 0.1211        | 1.48  | 12500 | 0.4648          | 0.8945   |
+| 0.1275        | 1.54  | 13000 | 0.5281          | 0.8956   |
+| 0.1302        | 1.6   | 13500 | 0.4411          | 0.8933   |
+| 0.1313        | 1.66  | 14000 | 0.4914          | 0.8979   |
+| 0.134         | 1.72  | 14500 | 0.3923          | 0.8979   |
+| 0.1355        | 1.78  | 15000 | 0.4164          | 0.8956   |
+| 0.1263        | 1.84  | 15500 | 0.4293          | 0.8945   |
+| 0.1326        | 1.9   | 16000 | 0.4185          | 0.8933   |
+| 0.1315        | 1.96  | 16500 | 0.4181          | 0.8991   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:71800c81e9ed4dfa2a362921f332c40f36c2f8be9da06d83c8901d6cad86bb27
 size 267832560

 version https://git-lfs.github.com/spec/v1
+oid sha256:8fcc59aace05879dd0138156289730cf5fff405afe418bfa1231991af7db4c37
 size 267832560