dqnguyen commited on
Commit
dc5f5fd
1 Parent(s): c4b2482

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -67,7 +67,8 @@ Before applying BPE to the pre-training corpus of English Tweets, we tokenized t
67
 
68
  For `vinai/bertweet-large`, given the raw input Tweets, to obtain the same pre-processing output, users could employ our [TweetNormalizer](https://github.com/VinAIResearch/BERTweet/blob/master/TweetNormalizer.py) module.
69
 
70
- - Installation: `pip3 install nltk emoji`
 
71
 
72
  ```python
73
  import torch
 
67
 
68
  For `vinai/bertweet-large`, given the raw input Tweets, to obtain the same pre-processing output, users could employ our [TweetNormalizer](https://github.com/VinAIResearch/BERTweet/blob/master/TweetNormalizer.py) module.
69
 
70
+ - Installation: `pip3 install nltk emoji==0.6.0`
71
+ - The `emoji` version must be either 0.5.4 or 0.6.0. Newer `emoji` versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing our pre-training Tweet corpus.
72
 
73
  ```python
74
  import torch