Update README.md
Browse files
README.md
CHANGED
@@ -42,7 +42,6 @@ Github's repo:
|
|
42 |
|
43 |
This is a speech generation network, aimed at maximizing the expressiveness and Controllability of the generated speech. at its core it uses [StyleTTS 2](https://github.com/yl4579/StyleTTS2)'s architecture with the following changes:
|
44 |
|
45 |
-
- an entirely new data pre-processing pipeline
|
46 |
- Incorporating mLSTM Layers instead of regular PyTorch LSTM layers, and increasing the capacity of the text and prosody encoder by using a higher number of parameters
|
47 |
- Retrained PL-Bert, Pitch Extractor, Text Aligner from scratch
|
48 |
- Whisper's Encoder instead of WavLM for the SLM
|
|
|
42 |
|
43 |
This is a speech generation network, aimed at maximizing the expressiveness and Controllability of the generated speech. at its core it uses [StyleTTS 2](https://github.com/yl4579/StyleTTS2)'s architecture with the following changes:
|
44 |
|
|
|
45 |
- Incorporating mLSTM Layers instead of regular PyTorch LSTM layers, and increasing the capacity of the text and prosody encoder by using a higher number of parameters
|
46 |
- Retrained PL-Bert, Pitch Extractor, Text Aligner from scratch
|
47 |
- Whisper's Encoder instead of WavLM for the SLM
|