morit commited on
Commit
1616ad0
1 Parent(s): e10db45

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -49,7 +49,7 @@ classifier(sequence_to_classify, candidate_labels, hypothesis_template=hypothesi
49
 
50
 
51
  ## Training
52
- This model was pre-trained on a set of 100 languages and follwed further training on 198M multilingual tweets as described in the original [paper](https://arxiv.org/abs/2104.12250). Further it was trained on the training set of XNLI dataset in spanish which is a machine translated version of the MNLI dataset. It was trained on 5 epochs of the XNLI train set and evaluated on the XNLI eval dataset at the end of every to find the best performing model. The model which had the highest accuracy on the eval set was chosen at the end.
53
 
54
  ![Training Charts from wandb](screen_wandb.png)
55
  - learning rate: 2e-5
@@ -62,6 +62,6 @@ using a GPU (NVIDIA GeForce RTX 3090) resulting in a training time of 1h 47 mins
62
 
63
  The best performing model was evaluatated on the XNLI test set to get a comparable result
64
  ```
65
- predict_accuracy = 79.2 %
66
  ```
67
 
 
49
 
50
 
51
  ## Training
52
+ This model was pre-trained on a set of 100 languages and follwed further training on 198M multilingual tweets as described in the original [paper](https://arxiv.org/abs/2104.12250). Further it was trained on the training set of XNLI dataset in spanish which is a machine translated version of the MNLI dataset. It was trained on 5 epochs of the XNLI train set and evaluated on the XNLI eval dataset at the end of every epoch to find the best performing model. The model which had the highest accuracy on the eval set was chosen at the end.
53
 
54
  ![Training Charts from wandb](screen_wandb.png)
55
  - learning rate: 2e-5
 
62
 
63
  The best performing model was evaluatated on the XNLI test set to get a comparable result
64
  ```
65
+ predict_accuracy = 79.20 %
66
  ```
67