Artem Gorlanov commited on
Commit
d7280b3
1 Parent(s): 54d944a
Files changed (1) hide show
  1. README.md +4 -6
README.md CHANGED
@@ -37,9 +37,7 @@ docker run -it -p 7860:7860 --platform=linux/amd64 \
37
  | cpu 2v CPU huggingece | 453.8 s/it |
38
  | gpu tesla v100 | 8.23 s/it |
39
 
40
-
41
-
42
-
43
  I know a lot of methods for this task:
44
  + separation: using separation models(need longtime train and finetune)
45
  + diarization
@@ -52,10 +50,10 @@ I know a lot of methods for this task:
52
  For this task i used speaker_embedding+Clustering unknowing numbers of speakers
53
 
54
 
55
- How i can improve (i have experience in it):
56
- + preprocessing
57
  + estimate SNR(signal noise rate) and if input clean dont use denoising
58
- + train:
59
  + custom speaker recognition model
60
  + custom overlap speech detector
61
  + custom speech separation model:
 
37
  | cpu 2v CPU huggingece | 453.8 s/it |
38
  | gpu tesla v100 | 8.23 s/it |
39
 
40
+ # Approaches
 
 
41
  I know a lot of methods for this task:
42
  + separation: using separation models(need longtime train and finetune)
43
  + diarization
 
50
  For this task i used speaker_embedding+Clustering unknowing numbers of speakers
51
 
52
 
53
+ # How i can improve:
54
+ + Fix preprocessing
55
  + estimate SNR(signal noise rate) and if input clean dont use denoising
56
+ + Add train:
57
  + custom speaker recognition model
58
  + custom overlap speech detector
59
  + custom speech separation model: