patrickvonplaten commited on
Commit
ad2f1b5
1 Parent(s): 88464d8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -11,6 +11,8 @@ license: cc-by-nc-4.0
11
 
12
  [Facebook's Wav2Vec2](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/) large model pretrained on the 100k unlabeled subset of [VoxPopuli corpus](https://arxiv.org/abs/2101.00390).
13
 
 
 
14
  **Paper**: *[VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation
15
  Learning, Semi-Supervised Learning and Interpretation](https://arxiv.org/abs/2101.00390)*
16
 
11
 
12
  [Facebook's Wav2Vec2](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/) large model pretrained on the 100k unlabeled subset of [VoxPopuli corpus](https://arxiv.org/abs/2101.00390).
13
 
14
+ **Note**: This model does not have a tokenizer as it was pretrained on audio alone. In order to use this model **speech recognition**, a tokenizer should be created and the model should be fine-tuned on labeled text data. Check out [this blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) for more in-detail explanation of how to fine-tune the model.
15
+
16
  **Paper**: *[VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation
17
  Learning, Semi-Supervised Learning and Interpretation](https://arxiv.org/abs/2101.00390)*
18