phnghiapro
/

distilbert-base-uncased-distilled-clinc

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.9487096774193549
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the clinc_oos dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1837
-- Accuracy: 0.9487
 ## Model description
@@ -53,8 +53,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0004
-- train_batch_size: 1024
-- eval_batch_size: 1024
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -64,16 +64,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 1.9956        | 1.0   | 15   | 1.1757          | 0.6987   |
-| 0.636         | 2.0   | 30   | 0.4093          | 0.9032   |
-| 0.3301        | 3.0   | 45   | 0.2476          | 0.9384   |
-| 0.1833        | 4.0   | 60   | 0.2117          | 0.9465   |
-| 0.1628        | 5.0   | 75   | 0.1992          | 0.9468   |
-| 0.1458        | 6.0   | 90   | 0.1910          | 0.9484   |
-| 0.1402        | 7.0   | 105  | 0.1858          | 0.9481   |
-| 0.1362        | 8.0   | 120  | 0.1850          | 0.9477   |
-| 0.1344        | 9.0   | 135  | 0.1838          | 0.9484   |
-| 0.1336        | 10.0  | 150  | 0.1837          | 0.9487   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.9490322580645161
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the clinc_oos dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1852
+- Accuracy: 0.9490
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0004
+- train_batch_size: 1280
+- eval_batch_size: 1280
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 1.9692        | 1.0   | 12   | 1.3486          | 0.6574   |
+| 1.1867        | 2.0   | 24   | 0.5409          | 0.8884   |
+| 0.5614        | 3.0   | 36   | 0.2845          | 0.9387   |
+| 0.295         | 4.0   | 48   | 0.2234          | 0.9471   |
+| 0.1729        | 5.0   | 60   | 0.2021          | 0.9487   |
+| 0.1574        | 6.0   | 72   | 0.1942          | 0.9513   |
+| 0.1477        | 7.0   | 84   | 0.1895          | 0.9510   |
+| 0.1446        | 8.0   | 96   | 0.1870          | 0.9497   |
+| 0.1405        | 9.0   | 108  | 0.1856          | 0.9494   |
+| 0.1382        | 10.0  | 120  | 0.1852          | 0.9490   |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:86b8472c1199bc82f252ac7dc0bc6171b04e5ebd37940d3889c3bedfa29d5770
 size 268313837

 version https://git-lfs.github.com/spec/v1
+oid sha256:2b9fe72b73c0856132163dc1ba24d4a8890afee476d63a92034868b0fcc0d6c8
 size 268313837

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1a29501d006c8c0ec229e0af501a7d43eba09ac9a90bc051a1ad64a63e8595fd
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:401be471130125cbacf03594b906e256d79765a934d4482feadbd02d0c3754e0
 size 4155