hecko commited on
Commit
36d74ed
1 Parent(s): b5bb29c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -2
README.md CHANGED
@@ -4,6 +4,9 @@ language:
4
  - nb
5
  ---
6
 
7
- Warmstarted from Chills model, then trained for 25 (de facto 50) epochs. Batch size 16, learning rate (√2)e-3 for the first 15(?) epochs and (5√2)e-4 for the last 10.
8
 
9
- Dataset: [NST Norwegian Speech Synthesis](https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-15/) (CC0), shuffled together with a copy that has had all audio files under 6(?) seconds merged recursively.
 
 
 
 
4
  - nb
5
  ---
6
 
7
+ Warmstarted from the "Chills" single-speaker male model (not available on HF as of right now), then trained for 25 (de facto 50) epochs. Batch size 16, learning rate (√2)e-3 for the first 15(?) epochs and (5√2)e-4 for the next 10.
8
 
9
+ Dataset: [NST Norwegian Speech Synthesis](https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-15/) (CC0), augmented like so:
10
+ 1. Make a copy of the dataset.
11
+ 2. Join the two shortest clips of the copy with 100ms of silence between them, then replace them with the joined version. Repeat until the shortest clip is at least 6 seconds long.
12
+ 3. Shuffle the original together with the copy.