cemsubakan commited on
Commit
2f922e7
2 Parent(s): 6bed68b 182c591

Merge branch 'main' of https://huggingface.co/speechbrain/sepformer-wsj02mix into main

Browse files
Files changed (1) hide show
  1. README.md +77 -0
README.md ADDED
@@ -0,0 +1,77 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: "en"
3
+ thumbnail:
4
+ tags:
5
+ - Source Separation
6
+ - Speech Separation
7
+ - Audio Source Separation
8
+ - WSJ02Mix
9
+ - SepFormer
10
+ - Transformer
11
+ license: "apache-2.0"
12
+ datasets:
13
+ - WSJ0-2Mix
14
+ metrics:
15
+ - SI-SNRi
16
+ - SDRi
17
+
18
+ ---
19
+
20
+ # SepFormer trained on WSJ0-2Mix
21
+
22
+ This repository provides all the necessary tools to perform audio source separation with a [SepFormer](https://arxiv.org/abs/2010.13154v2)
23
+ model, implemented with SpeechBrain, and pretrained on WSJ0-2Mix dataset. For a better experience we encourage you to learn more about
24
+ [SpeechBrain](https://speechbrain.github.io). The given model performance is 22.4 dB on the test set of WSJ0-2Mix dataset.
25
+
26
+ | Release | Test-Set SI-SNRi | Test-Set SDRi |
27
+ |:-------------:|:--------------:|:--------------:|
28
+ | 09-03-21 | 22.4dB | 22.6dB |
29
+
30
+
31
+ ## Install SpeechBrain
32
+
33
+ First of all, please install SpeechBrain with the following command:
34
+
35
+ ```
36
+ pip install speechbrain
37
+ ```
38
+
39
+ Please notice that we encourage you to read our tutorials and learn more about
40
+ [SpeechBrain](https://speechbrain.github.io).
41
+
42
+ ### Perform source separation on your own audio file
43
+ ```python
44
+ from speechbrain.pretrained import SepformerSeparation as separator
45
+ import torchaudio
46
+
47
+ model = separator.from_hparams(source="speechbrain/sepformer-wsj02mix", savedir='sepformer-wsj02mix')
48
+
49
+ est_sources = model.separate_file('your_speechbrain_path/samples/audio_samples/test_mixture.wav')
50
+
51
+ torchaudio.save("source1hat.wav", est_sources[:, :, 0].detach().cpu(), 8000)
52
+ torchaudio.save("source2hat.wav", est_sources[:, :, 1].detach().cpu(), 8000)
53
+
54
+ ```
55
+
56
+ #### Referencing SpeechBrain
57
+
58
+ ```
59
+ @misc{SB2021,
60
+ author = {Ravanelli, Mirco and Parcollet, Titouan and Rouhe, Aku and Plantinga, Peter and Rastorgueva, Elena and Lugosch, Loren and Dawalatabad, Nauman and Ju-Chieh, Chou and Heba, Abdel and Grondin, Francois and Aris, William and Liao, Chien-Feng and Cornell, Samuele and Yeh, Sung-Lin and Na, Hwidong and Gao, Yan and Fu, Szu-Wei and Subakan, Cem and De Mori, Renato and Bengio, Yoshua },
61
+ title = {SpeechBrain},
62
+ year = {2021},
63
+ publisher = {GitHub},
64
+ journal = {GitHub repository},
65
+ howpublished = {\url{https://github.com/speechbrain/speechbrain}},
66
+ }
67
+ ```
68
+
69
+ #### Referencing SepFormer
70
+ ```
71
+ @inproceedings{subakan2021attention,
72
+ title={Attention is All You Need in Speech Separation},
73
+ author={Cem Subakan and Mirco Ravanelli and Samuele Cornell and Mirko Bronzi and Jianyuan Zhong},
74
+ year={2021},
75
+ booktitle={ICASSP 2021}
76
+ }
77
+ ```