dpykhtar commited on
Commit
9be9066
1 Parent(s): e602fb6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md CHANGED
@@ -45,6 +45,29 @@ To train, fine-tune or play with the model you will need to install [NVIDIA NeMo
45
  pip install nemo_toolkit['all']
46
  ```
47
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
  ### Input
49
 
50
  This model accepts 16000 kHz Mono-channel Audio (wav files) as input.
 
45
  pip install nemo_toolkit['all']
46
  ```
47
 
48
+ ### Automatically instantiate the model
49
+
50
+ ```python
51
+ import nemo.collections.asr as nemo_asr
52
+ asr_model = nemo_asr.models.EncDecCTCModel.from_pretrained("nvidia/stt_zh_citrinet_1024_gamma_0_25")
53
+ ```
54
+
55
+ ### Transcribing using Python
56
+ First, let's get a sample of spoken Mandarin Chinese.
57
+
58
+ Then simply do:
59
+ ```
60
+ asr_model.transcribe(['<Path of audio file(s)>'])
61
+ ```
62
+
63
+ ### Transcribing many audio files
64
+
65
+ ```shell
66
+ python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
67
+ pretrained_name="nvidia/stt_zh_citrinet_1024_gamma_0_25"
68
+ audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
69
+ ```
70
+
71
  ### Input
72
 
73
  This model accepts 16000 kHz Mono-channel Audio (wav files) as input.