atasoglu
/

turkish-base-bert-uncased-mean-nli-stsb-tr

@@ -21,6 +21,12 @@ This model was adapted from [ytu-ce-cosmos/turkish-base-bert-uncased](https://hu
 - [nli_tr](https://huggingface.co/datasets/nli_tr)
 - [emrecan/stsb-mt-turkish](https://huggingface.co/datasets/emrecan/stsb-mt-turkish)
 ## Usage (Sentence-Transformers)
 Using this model becomes easy when you have [sentence-transformers](https://www.SBERT.net) installed:
@@ -85,10 +91,10 @@ print(sentence_embeddings)
 Achieved results on the [STS-b](https://huggingface.co/datasets/emrecan/stsb-mt-turkish) test split are given below:
 ```txt
-Cosine-Similarity :       Pearson: 0.7408 Spearman: 0.7274
-Manhattan-Distance:       Pearson: 0.7113 Spearman: 0.7146
-Euclidean-Distance:       Pearson: 0.7106 Spearman: 0.7140
-Dot-Product-Similarity:   Pearson: 0.7144 Spearman: 0.7025
 ```
@@ -97,9 +103,9 @@ The model was trained with the parameters:
 **DataLoader**:
-`torch.utils.data.dataloader.DataLoader` of length 180 with parameters:
 ```
-{'batch_size': 32, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
 ```
 **Loss**:
@@ -109,8 +115,8 @@ The model was trained with the parameters:
 Parameters of the fit()-Method:
 ```
 {
-    "epochs": 5,
-    "evaluation_steps": 90,
     "evaluator": "sentence_transformers.evaluation.EmbeddingSimilarityEvaluator.EmbeddingSimilarityEvaluator",
     "max_grad_norm": 1,
     "optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
@@ -119,7 +125,7 @@ Parameters of the fit()-Method:
     },
     "scheduler": "WarmupLinear",
     "steps_per_epoch": null,
-    "warmup_steps": 72,
     "weight_decay": 0.01
 }
 ```

 - [nli_tr](https://huggingface.co/datasets/nli_tr)
 - [emrecan/stsb-mt-turkish](https://huggingface.co/datasets/emrecan/stsb-mt-turkish)
+:warning: **All texts were manually lowercased,** [as stated](https://huggingface.co/ytu-ce-cosmos/turkish-base-bert-uncased#%E2%9A%A0-uncased-use-requires-manual-lowercase-conversion) by the model's authors:
+ ```python
+text.replace("I", "ı").lower()
+```
 ## Usage (Sentence-Transformers)
 Using this model becomes easy when you have [sentence-transformers](https://www.SBERT.net) installed:
 Achieved results on the [STS-b](https://huggingface.co/datasets/emrecan/stsb-mt-turkish) test split are given below:
 ```txt
+Cosine-Similarity :	    Pearson: 0.8401	Spearman: 0.8410
+Manhattan-Distance:	    Pearson: 0.8256	Spearman: 0.8261
+Euclidean-Distance:	    Pearson: 0.8261	Spearman: 0.8268
+Dot-Product-Similarity:	Pearson: 0.7823	Spearman: 0.7723
 ```
 **DataLoader**:
+`torch.utils.data.dataloader.DataLoader` of length 90 with parameters:
 ```
+{'batch_size': 64, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
 ```
 **Loss**:
 Parameters of the fit()-Method:
 ```
 {
+    "epochs": 4,
+    "evaluation_steps": 9,
     "evaluator": "sentence_transformers.evaluation.EmbeddingSimilarityEvaluator.EmbeddingSimilarityEvaluator",
     "max_grad_norm": 1,
     "optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
     },
     "scheduler": "WarmupLinear",
     "steps_per_epoch": null,
+    "warmup_steps": 36,
     "weight_decay": 0.01
 }
 ```

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "/content/drive/MyDrive/models/e4_b32_turkish_base_bert_uncased-mean-nli-stsb/",
   "architectures": [
     "BertModel"
   ],

 {
+  "_name_or_path": "output/ytu_ce_cosmos-turkish_base_bert_uncased-b64-e4-nli/",
   "architectures": [
     "BertModel"
   ],

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a20bab3e0a305feb348e8cfb630e17e1d694ba9ccff89f80286e2f2cd4e087cf
 size 442541034

 version https://git-lfs.github.com/spec/v1
+oid sha256:6d14328e5385c0743dccf01fc30292d193e29d5a81d2c1460e386d33db762ee5
 size 442541034