projecte-aina
/

stt-ca-citrinet-512

Automatic Speech Recognition

Model card Files Files and versions Community

angel-poc commited on Dec 9, 2022

Commit

4f3aa11

•

1 Parent(s): 02c5709

Update README.md

Files changed (1) hide show

README.md +34 -0

README.md CHANGED Viewed

@@ -42,6 +42,40 @@ This model was fine-tuned from a pre-trained Spanish [stt-es-citrinet-512](https
 You can use this model for Automatic Speech Recognition (ASR) in catalan.
 ## Additional information

 You can use this model for Automatic Speech Recognition (ASR) in catalan.
+## How to use
+### Usage
+Requiered libraries:
+```bash
+pip install nemo_toolkit['all']
+```
+Clone the repository to download the model:
+```bash
+git clone https://huggingface.co/projecte-aina/stt-ca-citrinet-512
+```
+Given that `NEMO_PATH` is the path that points to the downloaded stt-ca-citrinet-512.nemo file, to do inference over a set of `.wav` files you should:
+```python
+# Load the model
+model = nemo_asr.models.EncDecCTCModel.restore_from(NEMO_PATH)
+# Create a list pointing to the audio files
+paths2audio_files = ["audio_1.wav", ..., "audio_n.wav"]
+# Fix the batch size to whatever number suits your purpose
+batch_size = 8
+transcriptions = model.transcribe(paths2audio_files=paths2audio_files,
+                                 batch_size=2)
+# Visualize the transcriptions
+print(transcriptions)
+```
 ## Additional information