judy93536
/

distilroberta-rbm231k-ep20-op40-phrase5k

@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [judy93536/distilroberta-rbm231k-ep20-op40](https://huggingface.co/judy93536/distilroberta-rbm231k-ep20-op40) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1681
-- Accuracy: 0.9560
 ## Model description
@@ -44,26 +44,33 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.28
-- num_epochs: 13
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 1.0   | 250  | 0.2748          | 0.9119   |
-| 0.2834        | 2.0   | 500  | 0.2418          | 0.9219   |
-| 0.2834        | 3.0   | 750  | 0.2013          | 0.9329   |
-| 0.2102        | 4.0   | 1000 | 0.1839          | 0.9389   |
-| 0.2102        | 5.0   | 1250 | 0.1848          | 0.9419   |
-| 0.1539        | 6.0   | 1500 | 0.1658          | 0.9469   |
-| 0.1539        | 7.0   | 1750 | 0.1685          | 0.9469   |
-| 0.1211        | 8.0   | 2000 | 0.1577          | 0.9550   |
-| 0.1211        | 9.0   | 2250 | 0.1625          | 0.9540   |
-| 0.1122        | 10.0  | 2500 | 0.1694          | 0.9520   |
-| 0.1122        | 11.0  | 2750 | 0.1583          | 0.9570   |
-| 0.1055        | 12.0  | 3000 | 0.1651          | 0.9570   |
-| 0.1055        | 13.0  | 3250 | 0.1681          | 0.9560   |
 ### Framework versions

 This model is a fine-tuned version of [judy93536/distilroberta-rbm231k-ep20-op40](https://huggingface.co/judy93536/distilroberta-rbm231k-ep20-op40) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1735
+- Accuracy: 0.9459
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.28
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 1.0   | 250  | 1.0641          | 0.6156   |
+| 1.0561        | 2.0   | 500  | 0.9262          | 0.6136   |
+| 1.0561        | 3.0   | 750  | 0.8124          | 0.6136   |
+| 0.8474        | 4.0   | 1000 | 0.7271          | 0.6476   |
+| 0.8474        | 5.0   | 1250 | 0.6283          | 0.7237   |
+| 0.6495        | 6.0   | 1500 | 0.5066          | 0.7758   |
+| 0.6495        | 7.0   | 1750 | 0.3436          | 0.8979   |
+| 0.3657        | 8.0   | 2000 | 0.2598          | 0.9189   |
+| 0.3657        | 9.0   | 2250 | 0.2117          | 0.9299   |
+| 0.2023        | 10.0  | 2500 | 0.2022          | 0.9279   |
+| 0.2023        | 11.0  | 2750 | 0.1787          | 0.9329   |
+| 0.1518        | 12.0  | 3000 | 0.1632          | 0.9429   |
+| 0.1518        | 13.0  | 3250 | 0.1883          | 0.9369   |
+| 0.1324        | 14.0  | 3500 | 0.1699          | 0.9409   |
+| 0.1324        | 15.0  | 3750 | 0.1704          | 0.9439   |
+| 0.1225        | 16.0  | 4000 | 0.1755          | 0.9429   |
+| 0.1225        | 17.0  | 4250 | 0.1743          | 0.9439   |
+| 0.119         | 18.0  | 4500 | 0.1726          | 0.9459   |
+| 0.119         | 19.0  | 4750 | 0.1723          | 0.9459   |
+| 0.1145        | 20.0  | 5000 | 0.1735          | 0.9459   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:da4677c484c5c801fd26d9ad970a6e2114c00da106f02ffd6a47d275e3135976
 size 328495356

 version https://git-lfs.github.com/spec/v1
+oid sha256:9e4b93fe7e1e17368337f400f133771e7e5f8e65350d8a350ed3ce72cfc91a1b
 size 328495356