morit commited on
Commit
f0dc4d2
1 Parent(s): bc54453

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -48,7 +48,7 @@ classifier(sequence_to_classify, candidate_labels, hypothesis_template=hypothesi
48
  ```
49
 
50
  ## Training
51
- This model was pre-trained on a set of 100 languages and follwed further training on 198M multilingual tweets as described in the original [paper](https://arxiv.org/abs/2104.12250). Further it was trained on the training set of XNLI dataset in english which is a machine translated version of the MNLI dataset. It was trained on 5 epochs of the XNLI train set and evaluated on the XNLI eval dataset at the end of every to find the best performing model. The model which had the highest accuracy on the eval set was chosen at the end.
52
 
53
  ![Training Charts from wandb](screen_wandb.png)
54
  - learning rate: 2e-5
 
48
  ```
49
 
50
  ## Training
51
+ This model was pre-trained on a set of 100 languages and follwed further training on 198M multilingual tweets as described in the original [paper](https://arxiv.org/abs/2104.12250). Further it was trained on the training set of XNLI dataset in english which is a machine translated version of the MNLI dataset. It was trained on 5 epochs of the XNLI train set and evaluated on the XNLI eval dataset at the end of every epoch to find the best performing model. The model which had the highest accuracy on the eval set was chosen at the end.
52
 
53
  ![Training Charts from wandb](screen_wandb.png)
54
  - learning rate: 2e-5