patrickvonplaten
commited on
Commit
•
8cecc54
1
Parent(s):
5860005
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,9 @@ tags:
|
|
10 |
|
11 |
[Microsoft's UniSpeech](https://www.microsoft.com/en-us/research/publication/unispeech-unified-speech-representation-learning-with-labeled-and-unlabeled-data/)
|
12 |
|
13 |
-
The large model pretrained on 16kHz sampled speech audio with utterance and speaker contrastive loss. When using the model, make sure that your speech input is also sampled at 16kHz.
|
|
|
|
|
14 |
|
15 |
The model was pre-trained on:
|
16 |
|
|
|
10 |
|
11 |
[Microsoft's UniSpeech](https://www.microsoft.com/en-us/research/publication/unispeech-unified-speech-representation-learning-with-labeled-and-unlabeled-data/)
|
12 |
|
13 |
+
The large model pretrained on 16kHz sampled speech audio with utterance and speaker contrastive loss. When using the model, make sure that your speech input is also sampled at 16kHz.
|
14 |
+
|
15 |
+
**Note**: This model does not have a tokenizer as it was pretrained on audio alone. In order to use this model **speech recognition**, a tokenizer should be created and the model should be fine-tuned on labeled text data. Check out [this blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) for more in-detail explanation of how to fine-tune the model.
|
16 |
|
17 |
The model was pre-trained on:
|
18 |
|