giotvr
/

xlm_roberta_base_assin_fine_tuned

@@ -95,12 +95,16 @@ $10k$ sentence pairs equally distributed between *ptbr*  and *ptpt* subsets.
 ### Fine-Tuning Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-The fine-tuning procedure can be summarized in three major subsequent tasks:
-i
-#### Preprocessing [optional]
-[More Information Needed]
 #### Training Hyperparameters

 ### Fine-Tuning Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+The model fine-tuning procedure can be summarized in three major subsequent tasks:
+    <ol type="i">
+        <li>**[Data Processing](#data-processing):**</li> [ASSIN](https://huggingface.co/datasets/assin)'s *validation* and *train* splits were loaded from the **Hugging Face Hub** and processed afterwards;
+        <li>**Hyperparameter Tuning:**</li> [XLM-RoBERTa-base](https://huggingface.co/xlm-roberta-base)'s hyperparameters were chosen with the help of the [Weights & Biases] API to track the results and upload the fine-tuned models;
+        <li>**Final Model Loading and Testing:**</li>
+        using the *cross-tests* approach described in the [this section](#evaluation), the models' performance were measured using different datasets and metrics.
+    </ol>
+#### Data Processing [optional]
+##### Class Label Column Renaming
+The **Hugging Face**'s ```transformers``` module's ```DataCollator``` used by its ```Trainer``` requires that the ```class label``` column of the collated dataset to be called ```label```.  [ASSIN](https://huggingface.co/datasets/assin)'s class label column for each hypothesis/premise pair is called ```entailment_judgement```. Therefore, as the first step of the data preprocessing pipeline the column  ```entailment_judgement``` was renamed to ```label``` so that the **Hugging Face**'s ```transformers``` module's ```Trainer``` could be used.
 #### Training Hyperparameters