SaiedAlshahrani
/

arwiki_20230101_roberta_mlm_bots

@@ -29,7 +29,6 @@ It achieves the following results on the evaluation set:
 - Pseudo-Perplexity: 23.70
 ## Model description
 We trained this Arabic Wikipedia Masked Language Model (arRoBERTa<sub>BASE</sub>) to evaluate its performance using the Fill-Mask evaluation task and the Masked Arab States Dataset ([MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)) dataset and measure the *impact* of **template-based translation** on the Egyptian Arabic Wikipedia edition.
@@ -52,17 +51,14 @@ For more details about the experiment, please **read** and **cite** our paper:
 }
 ```
 ## Intended uses & limitations
 We do **not** recommend using this model because it was trained *only* on the Arabic Wikipedia articles, <u>unless</u> you fine-tune the model on a large, organic, and representative Arabic dataset.
 ## Training and evaluation data
 We have trained this model on the Arabic Wikipedia articles ([SaiedAlshahrani/Arabic_Wikipedia_20230101_bots](https://huggingface.co/datasets/SaiedAlshahrani/Arabic_Wikipedia_20230101_bots)) without using any validation or evaluation data (only training data) due to a lack of computational power.
 ## Training procedure
 We have trained this model using the Paperspace GPU-Cloud service. We used a machine with 8 CPUs, 45GB RAM, and A6000 GPU with 48GB RAM.
@@ -78,7 +74,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Epoch | Step  | Training Loss |
@@ -93,15 +88,12 @@ The following hyperparameters were used during training:
 |:--------------:|:------------------------:|:----------------------:|:-------------------------:|:----------:|:--------:|
 | 17048.756800   | 248.355000               | 0.970000               | 140390797515571200.000000 | 3.639375   | 5.000000 |
 ### Evaluation results
 This arRoBERTa<sub>BASE</sub> model has been evaluated on the Masked Arab States Dataset ([SaiedAlshahrani/MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)).
 | K=10 | K=50  | K=100 |
 |:----:|:-----:|:----:|
 | 43.12%| 45% | 50.62% |
 ### Framework versions
 - Datasets 2.9.0

 - Pseudo-Perplexity: 23.70
 ## Model description
 We trained this Arabic Wikipedia Masked Language Model (arRoBERTa<sub>BASE</sub>) to evaluate its performance using the Fill-Mask evaluation task and the Masked Arab States Dataset ([MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)) dataset and measure the *impact* of **template-based translation** on the Egyptian Arabic Wikipedia edition.
 }
 ```
 ## Intended uses & limitations
 We do **not** recommend using this model because it was trained *only* on the Arabic Wikipedia articles, <u>unless</u> you fine-tune the model on a large, organic, and representative Arabic dataset.
 ## Training and evaluation data
 We have trained this model on the Arabic Wikipedia articles ([SaiedAlshahrani/Arabic_Wikipedia_20230101_bots](https://huggingface.co/datasets/SaiedAlshahrani/Arabic_Wikipedia_20230101_bots)) without using any validation or evaluation data (only training data) due to a lack of computational power.
 ## Training procedure
 We have trained this model using the Paperspace GPU-Cloud service. We used a machine with 8 CPUs, 45GB RAM, and A6000 GPU with 48GB RAM.
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Epoch | Step  | Training Loss |
 |:--------------:|:------------------------:|:----------------------:|:-------------------------:|:----------:|:--------:|
 | 17048.756800   | 248.355000               | 0.970000               | 140390797515571200.000000 | 3.639375   | 5.000000 |
 ### Evaluation results
 This arRoBERTa<sub>BASE</sub> model has been evaluated on the Masked Arab States Dataset ([SaiedAlshahrani/MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)).
 | K=10 | K=50  | K=100 |
 |:----:|:-----:|:----:|
 | 43.12%| 45% | 50.62% |
 ### Framework versions
 - Datasets 2.9.0