Tanel commited on
Commit
4131987
1 Parent(s): e62a1b6

First commit

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md CHANGED
@@ -11,3 +11,40 @@ widget:
11
  https://huggingface.co/TalTechNLP/icefall_pruned_transducer_stateless7_streaming_et/resolve/main/test_wav.wav
12
  license: cc-by-sa-4.0
13
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  https://huggingface.co/TalTechNLP/icefall_pruned_transducer_stateless7_streaming_et/resolve/main/test_wav.wav
12
  license: cc-by-sa-4.0
13
  ---
14
+
15
+ # Icefall streaming ASR model for Estonian
16
+
17
+ This is a streaming end-to-end transducer model for Estonian, trained using [Icefall](https://github.com/k2-fsa/icefall)
18
+
19
+ It is trained on around 800 h of manually transcribed speech from various domains and on
20
+ about 2500 h of automatically transcribed speech from Estonian TV (mainly news and talkshows)
21
+
22
+ ## Serving
23
+
24
+ To use it on a server for browser-based ASR:
25
+
26
+ * Install [Sherpa](https://github.com/k2-fsa/sherpa)
27
+ * Clone this model locally:
28
+
29
+
30
+ git lfs install
31
+ git clone https://huggingface.co/TalTechNLP/icefall_pruned_transducer_stateless7_streaming_et
32
+
33
+ * Set SHERPA_ROOT_DIR to the sherpa root directory
34
+
35
+ * Start serving on port 6006:
36
+
37
+
38
+ sherpa-online-websocket-server --use-gpu=false --decode-chunk-size=32 \
39
+ --encoder-model=icefall_pruned_transducer_stateless7_streaming_et/exp/1d/encoder_jit_trace.pt \
40
+ --decoder-model=icefall_pruned_transducer_stateless7_streaming_et/exp/1d/decoder_jit_trace.pt \
41
+ --joiner-model=icefall_pruned_transducer_stateless7_streaming_et/exp/1d/joiner_jit_trace.pt \
42
+ --tokens=icefall_pruned_transducer_stateless7_streaming_et/data/lang_bpe_1000/tokens.txt \
43
+ --doc-root=${SHERPA_ROOT_DIR}/sherpa/bin/web --decoding-method=modified_beam_search
44
+
45
+ * Open in browser: http://localhost:6006 (also works via ssh tunnel) and go to "Streaming-Record" tab
46
+
47
+ * Click "Connect" and then "Streaming-Record" button, and start talking
48
+
49
+
50
+