Update README.md
Browse files
README.md
CHANGED
@@ -47,7 +47,7 @@ This is a speech generation network, aimed at maximizing the expressiveness and
|
|
47 |
- Retrained PL-Bert, Pitch Extractor, Text Aligner from scratch
|
48 |
- Whisper's Encoder instead of WavLM for the SLM
|
49 |
- 48khz Config
|
50 |
-
- improved Performance on non-verbal sounds and cues. such as sigh, pauses, etc. and also very slightly on laughter
|
51 |
- a new way of sampling the Style Vectors.
|
52 |
- Promptable Speech Synthesizing.
|
53 |
- a Smart Phonemization algorithm that can handle Romaji inputs or a mixture of Japanese and Romaji.
|
|
|
47 |
- Retrained PL-Bert, Pitch Extractor, Text Aligner from scratch
|
48 |
- Whisper's Encoder instead of WavLM for the SLM
|
49 |
- 48khz Config
|
50 |
+
- improved Performance on non-verbal sounds and cues. such as sigh, pauses, etc. and also very slightly on laughter (depends on the speaker)
|
51 |
- a new way of sampling the Style Vectors.
|
52 |
- Promptable Speech Synthesizing.
|
53 |
- a Smart Phonemization algorithm that can handle Romaji inputs or a mixture of Japanese and Romaji.
|