okuchaiev commited on
Commit
26d25ec
1 Parent(s): 5f5918e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -11
README.md CHANGED
@@ -16,14 +16,27 @@ pip install nemo_toolkit['all']
16
 
17
  The model is available for use in the NeMo toolkit [3], and can be used as a pre-trained checkpoint for inference or for fine-tuning on another dataset.
18
 
19
- ### Automatically load the model from NGC
20
 
21
  ```python
22
  import nemo.collections.asr as nemo_asr
23
- asr_model = nemo_asr.models.EncDecCTCModelBPE.from_pretrained(model_name="stt_en_conformer_ctc_large")
 
 
 
 
 
 
 
 
 
 
 
 
 
24
  ```
25
 
26
- ### Transcribing text with this model
27
 
28
  ```shell
29
  python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py \
@@ -105,11 +118,4 @@ Since this model was trained on publically available speech datasets, the perfor
105
 
106
  [2] [Google Sentencepiece Tokenizer](https://github.com/google/sentencepiece)
107
 
108
- [3] [NVIDIA NeMo Toolkit](https://github.com/NVIDIA/NeMo)
109
-
110
-
111
- ## Licence
112
-
113
- License to use this model is covered by the NGC [TERMS OF USE](https://ngc.nvidia.com/legal/terms) unless another License/Terms Of Use/EULA is clearly specified. By downloading the public and release version of the model, you accept the terms and conditions of the NGC [TERMS OF USE](https://ngc.nvidia.com/legal/terms).
114
-
115
-
 
16
 
17
  The model is available for use in the NeMo toolkit [3], and can be used as a pre-trained checkpoint for inference or for fine-tuning on another dataset.
18
 
19
+ ### Automatically instantiate the model
20
 
21
  ```python
22
  import nemo.collections.asr as nemo_asr
23
+ from huggingface_hub import hf_hub_download
24
+
25
+ path = hf_hub_download(repo_id="nvidia/stt_en_conformer_ctc_large",filename="stt_en_conformer_large.nemo")
26
+ asr_model = nemo_asr.models.EncDecCTCModelBPE.restore_from(path)
27
+ ```
28
+
29
+ ### Transcribing using Python
30
+ First, let's get a sample
31
+ ```
32
+ wget https://dldata-public.s3.us-east-2.amazonaws.com/2086-149220-0033.wav
33
+ ```
34
+ Then simply do:
35
+ ```
36
+ asr_model.transcribe(['2086-149220-0033.wav'])
37
  ```
38
 
39
+ ### Transcribing many audio files
40
 
41
  ```shell
42
  python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py \
 
118
 
119
  [2] [Google Sentencepiece Tokenizer](https://github.com/google/sentencepiece)
120
 
121
+ [3] [NVIDIA NeMo Toolkit](https://github.com/NVIDIA/NeMo)