File size: 873 Bytes
8acd3cd
 
 
 
 
 
 
14050d4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32

## Pre-trained TDNN models for the yesno dataset with icefall.

Refer to <https://github.com/k2-fsa/icefall/tree/master/egs/yesno/ASR>
for more information about this pre-trained model.

You can find usage instructions there.


## Sound files for testing the pre-trained model

The folder `test_waves` contains test sound files. They
are downloaded from <https://www.openslr.org/1/>.

There are 60 files in the dataset, 30 are used for training.
The remaining 30 files, contained in `test_waves` are kept for testing.

The code for splitting the dataset can be found at
<https://github.com/lhotse-speech/lhotse/blob/master/lhotse/recipes/yesno.py#L138>

```python
wave_files = list(corpus_dir.glob("*.wav"))
assert len(wave_files) == 60

wave_files.sort()
train_set = wave_files[::2]
test_set = wave_files[1::2]

assert len(train_set) == 30
assert len(test_set) == 30
```