Yousefmd
/

arabert-sentiment-classification

@@ -1,9 +1,8 @@
 ---
-base_model: aubmindlab/bert-large-arabertv02-twitter
 tags:
 - generated_from_trainer
-metrics:
-- accuracy
 model-index:
 - name: arabert-sentiment-classification
   results: []
@@ -14,11 +13,16 @@ should probably proofread and complete it, then remove this comment. -->
 # arabert-sentiment-classification
-This model is a fine-tuned version of [aubmindlab/bert-large-arabertv02-twitter](https://huggingface.co/aubmindlab/bert-large-arabertv02-twitter) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5604
-- Macro F1: 0.6567
-- Accuracy: 0.7945
 ## Model description
@@ -38,24 +42,15 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 25
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
-### Training results
-| Training Loss | Epoch | Step | Validation Loss | Macro F1 | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
-| No log        | 1.0   | 249  | 0.5455          | 0.6303   | 0.7892   |
-| No log        | 2.0   | 498  | 0.5514          | 0.6437   | 0.7922   |
-| 0.5616        | 3.0   | 747  | 0.5604          | 0.6567   | 0.7945   |
 ### Framework versions
 - Transformers 4.34.0

 ---
+license: mit
+base_model: xlm-roberta-large
 tags:
 - generated_from_trainer
 model-index:
 - name: arabert-sentiment-classification
   results: []
 # arabert-sentiment-classification
+This model is a fine-tuned version of [xlm-roberta-large](https://huggingface.co/xlm-roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 0.6336
+- eval_macro_f1: 0.6099
+- eval_accuracy: 0.7641
+- eval_runtime: 92.4531
+- eval_samples_per_second: 43.049
+- eval_steps_per_second: 2.693
+- epoch: 2.0
+- step: 995
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 25
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 ### Framework versions
 - Transformers 4.34.0

config.json CHANGED Viewed

@@ -11,18 +11,18 @@
   "hidden_dropout_prob": 0.1,
   "hidden_size": 1024,
   "id2label": {
-    "0": "LABEL_0",
-    "1": "LABEL_1",
-    "2": "LABEL_2",
-    "3": "LABEL_3"
   },
   "initializer_range": 0.02,
   "intermediate_size": 4096,
   "label2id": {
-    "LABEL_0": 0,
-    "LABEL_1": 1,
-    "LABEL_2": 2,
-    "LABEL_3": 3
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

   "hidden_dropout_prob": 0.1,
   "hidden_size": 1024,
   "id2label": {
+    "0": "Positive",
+    "1": "Negative",
+    "2": "Neutral",
+    "3": "Mixed"
   },
   "initializer_range": 0.02,
   "intermediate_size": 4096,
   "label2id": {
+    "Mixed": 3,
+    "Negative": 1,
+    "Neutral": 2,
+    "Positive": 0
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a9aa52be83a88d1fd313350b18323ed105a9c38e72cff8db13d0d65124632447
 size 2239713713

 version https://git-lfs.github.com/spec/v1
+oid sha256:55e641ef208c797251140016230a92889e1abc07a0c0d4bae12996490fec5b4d
 size 2239713713