esc-bench
/

conformer-rnnt-voxpopuli

Model card Files Files and versions Community

sanchit-gandhi HF staff commited on Oct 24, 2022

Commit

b1131a9

•

1 Parent(s): d450087

Correct scripts

Files changed (2) hide show

README.md +5 -4
run_voxpopuli.sh +1 -1

README.md CHANGED Viewed

@@ -2,17 +2,18 @@
 language:
 - en
 tags:
-- esc
 datasets:
-- voxpopuli
 ---
-To reproduce this run, execute:
 ```python
 #!/usr/bin/env bash
 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
-        --dataset_name="esc-benchmark/esc-datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \

 language:
 - en
 tags:
+- esb
 datasets:
+- esb/datasets
+- facebook/voxpopuli
 ---
+To reproduce this run, first install NVIDIA NeMo according to the [official instructions](https://github.com/NVIDIA/NeMo#installation), then execute:
 ```python
 #!/usr/bin/env bash
 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
+        --dataset_name="esb/datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \

run_voxpopuli.sh CHANGED Viewed

@@ -2,7 +2,7 @@
 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
-        --dataset_name="esc-benchmark/esc-datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \

 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
+        --dataset_name="esb/datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \