SaiedAlshahrani
/

arzwiki_20230101_roberta_mlm

@@ -29,7 +29,6 @@ It achieves the following results on the evaluation set:
 - Pseudo-Perplexity: 115.80
 ## Model description
 We trained this Egyptian Arabic Wikipedia Masked Language Model (arzRoBERTa<sub>BASE</sub>) to evaluate its performance using the Fill-Mask evaluation task and the Masked Arab States Dataset ([MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)) dataset and measure the *impact* of **template-based translation** on the Egyptian Arabic Wikipedia edition.
@@ -52,22 +51,18 @@ For more details about the experiment, please **read** and **cite** our paper:
 }
 ```
 ## Intended uses & limitations
 We do **not** recommend using this model because it was trained *only* on the Egyptian Arabic Wikipedia articles, which are known by the template-based translation from English, producing limited, shallow, and unrepresentative articles, <u>unless</u> you fine-tune the model on a large, organic, and representative Egyptian dataset.
 ## Training and evaluation data
 We have trained this model on the Egyptian Arabic Wikipedia articles ([SaiedAlshahrani/Egyptian_Arabic_Wikipedia_20230101](https://huggingface.co/datasets/SaiedAlshahrani/Egyptian_Arabic_Wikipedia_20230101)) without using any validation or evaluation data (only training data) due to a lack of computational power.
 ## Training procedure
 We have trained this model using the Paperspace GPU-Cloud service. We used a machine with 8 CPUs, 45GB RAM, and A6000 GPU with 48GB RAM.
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -79,7 +74,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Epoch | Step  | Training Loss |
@@ -94,14 +88,12 @@ The following hyperparameters were used during training:
 |:--------------:|:------------------------:|:----------------------:|:-------------------------:|:----------:|:--------:|
 | 14677.117400   | 248.119000               | 0.970000               | 120746231839334400.000000 | 0.908513   | 5.000000 |
 ### Evaluation results
 This arzRoBERTa<sub>BASE</sub> model has been evaluated on the Masked Arab States Dataset ([SaiedAlshahrani/MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)).
 | K=10 | K=50  | K=100 |
 |:----:|:-----:|:----:|
 | 8.12%| 25.62% | 35% |
 ### Framework versions
 - Datasets 2.9.0

 - Pseudo-Perplexity: 115.80
 ## Model description
 We trained this Egyptian Arabic Wikipedia Masked Language Model (arzRoBERTa<sub>BASE</sub>) to evaluate its performance using the Fill-Mask evaluation task and the Masked Arab States Dataset ([MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)) dataset and measure the *impact* of **template-based translation** on the Egyptian Arabic Wikipedia edition.
 }
 ```
 ## Intended uses & limitations
 We do **not** recommend using this model because it was trained *only* on the Egyptian Arabic Wikipedia articles, which are known by the template-based translation from English, producing limited, shallow, and unrepresentative articles, <u>unless</u> you fine-tune the model on a large, organic, and representative Egyptian dataset.
 ## Training and evaluation data
 We have trained this model on the Egyptian Arabic Wikipedia articles ([SaiedAlshahrani/Egyptian_Arabic_Wikipedia_20230101](https://huggingface.co/datasets/SaiedAlshahrani/Egyptian_Arabic_Wikipedia_20230101)) without using any validation or evaluation data (only training data) due to a lack of computational power.
 ## Training procedure
 We have trained this model using the Paperspace GPU-Cloud service. We used a machine with 8 CPUs, 45GB RAM, and A6000 GPU with 48GB RAM.
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Epoch | Step  | Training Loss |
 |:--------------:|:------------------------:|:----------------------:|:-------------------------:|:----------:|:--------:|
 | 14677.117400   | 248.119000               | 0.970000               | 120746231839334400.000000 | 0.908513   | 5.000000 |
 ### Evaluation results
 This arzRoBERTa<sub>BASE</sub> model has been evaluated on the Masked Arab States Dataset ([SaiedAlshahrani/MASD](https://huggingface.co/datasets/SaiedAlshahrani/MASD)).
 | K=10 | K=50  | K=100 |
 |:----:|:-----:|:----:|
 | 8.12%| 25.62% | 35% |
 ### Framework versions
 - Datasets 2.9.0