--- language: - it library_name: nemo datasets: - mozilla-foundation/common_voice_12_0 tags: - automatic-speech-recognition model-index: - name: stt_it_citrinet_512_gamma_0_25 results: - task: name: Automatic Speech Recognition type: automatic-speech-recognition dataset: name: Mozilla Common Voice 12.0 type: mozilla-foundation/common_voice_12_0 config: clean split: test args: language: it metrics: - name: Test WER type: wer value: 9.232 license: bsd-3-clause --- # NVIDIA Streaming Citrinet 512 (it-IT) | [![Model architecture](https://img.shields.io/badge/Model_Arch-Citrinet--CTC-lightgrey#model-badge)](#model-architecture) | [![Model size](https://img.shields.io/badge/Params-36M-lightgrey#model-badge)](#model-architecture) | [![Language](https://img.shields.io/badge/Language-it--IT-lightgrey#model-badge)](#datasets) | ## Attribution As initial checkpoint used [stt_en_citrinet_512_gamma_0_25](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_en_citrinet_512_gamma_0_25) by [NVIDIA](https://github.com/NVIDIA) licensed under [CC-BY-4.0](https://creativecommons.org/licenses/by/4.0/)