callmesan
/

indic-sentence-bert-nli-roman-urdu-binary

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [l3cube-pune/indic-sentence-bert-nli](https://huggingface.co/l3cube-pune/indic-sentence-bert-nli) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4274
-- Accuracy: 0.8852
-- Precision: 0.8845
-- Recall: 0.8859
-- F1: 0.8849
 ## Model description
@@ -44,7 +44,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 32
 - eval_batch_size: 128
 - seed: 42
@@ -52,17 +52,22 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 0.6197        | 0.9912 | 56   | 0.5759          | 0.8414   | 0.8449    | 0.8375 | 0.8393 |
-| 0.4974        | 2.0    | 113  | 0.4997          | 0.8452   | 0.8517    | 0.8501 | 0.8452 |
-| 0.4404        | 2.9912 | 169  | 0.4445          | 0.8714   | 0.8711    | 0.8704 | 0.8707 |
-| 0.4106        | 4.0    | 226  | 0.4246          | 0.8664   | 0.8657    | 0.8660 | 0.8659 |
-| 0.392         | 4.9558 | 280  | 0.4196          | 0.8664   | 0.8664    | 0.8681 | 0.8663 |
 ### Framework versions

 This model is a fine-tuned version of [l3cube-pune/indic-sentence-bert-nli](https://huggingface.co/l3cube-pune/indic-sentence-bert-nli) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2789
+- Accuracy: 0.9061
+- Precision: 0.9058
+- Recall: 0.9055
+- F1: 0.9057
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 32
 - eval_batch_size: 128
 - seed: 42
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
+| 0.4984        | 0.9912 | 56   | 0.4611          | 0.8452   | 0.8486    | 0.8489 | 0.8452 |
+| 0.3582        | 2.0    | 113  | 0.3373          | 0.8826   | 0.8843    | 0.8802 | 0.8816 |
+| 0.2724        | 2.9912 | 169  | 0.2869          | 0.8901   | 0.8894    | 0.8901 | 0.8897 |
+| 0.2093        | 4.0    | 226  | 0.2754          | 0.8926   | 0.8922    | 0.8920 | 0.8921 |
+| 0.1622        | 4.9912 | 282  | 0.2980          | 0.8989   | 0.9016    | 0.8961 | 0.8978 |
+| 0.1235        | 6.0    | 339  | 0.3167          | 0.8889   | 0.8883    | 0.8884 | 0.8884 |
+| 0.1125        | 6.9912 | 395  | 0.3369          | 0.8939   | 0.8973    | 0.8907 | 0.8926 |
+| 0.0811        | 8.0    | 452  | 0.3535          | 0.8914   | 0.8906    | 0.8918 | 0.8911 |
+| 0.0797        | 8.9912 | 508  | 0.3833          | 0.8914   | 0.8919    | 0.8898 | 0.8906 |
+| 0.0585        | 9.9115 | 560  | 0.3809          | 0.8926   | 0.8924    | 0.8918 | 0.8920 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:433c7fdf1624154d668fcb0a01bc4e1de524e6ce39dfde04fd4d9617a920c2a5
 size 950254592

 version https://git-lfs.github.com/spec/v1
+oid sha256:6c1dc06b386f44450b52f392ae0a3e2f7aaa20a20f88bc65788aa2c8966a0c15
 size 950254592