NekoMikoReimu commited on
Commit
be9bc8b
·
verified ·
1 Parent(s): afc4029

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -1,3 +1,13 @@
1
- ---
2
- license: cc-by-nc-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ language:
4
+ - ja
5
+ - en
6
+ ---
7
+
8
+ A diarization pipeline for Whisper large-v2 that uses a custom-tuned segmentation model and custom filtering on the audio (low-pass filter, equalizer, etc.) for improved performance.
9
+ Can be given a video file or mp3/wav file.
10
+
11
+ Performance is considerably better than default JP whisper for most tasks involving Japanese content, with the exception of singing/karaoke.
12
+
13
+ Requires ffmpeg, pyannote and facebookresearch's demux model. Torch is also strongly encouraged.