san2003m
/

whisper-small-atc

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

san2003m commited on Sep 30

Commit

18611ef

•

1 Parent(s): 7a2e03d

Update README.md

Files changed (1) hide show

README.md +44 -0

README.md CHANGED Viewed

@@ -77,3 +77,47 @@ The following hyperparameters were used during training:
 - Pytorch 2.2.2
 - Datasets 2.18.0
 - Tokenizers 0.15.2

 - Pytorch 2.2.2
 - Datasets 2.18.0
 - Tokenizers 0.15.2
+### Additional Information
+## Licensing Information
+The licensing status of the dataset hinges on the legal status of the UWB-ATCC corpus creators.
+They used Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) licensing.
+## Citation Information
+Contributors who prepared, processed, normalized and uploaded the dataset in HuggingFace:
+@article{zuluaga2022how,
+    title={How Does Pre-trained Wav2Vec2. 0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications},
+    author={Zuluaga-Gomez, Juan and Prasad, Amrutha and Nigmatulina, Iuliia and Sarfjoo, Saeed and others},
+    journal={IEEE Spoken Language Technology Workshop (SLT), Doha, Qatar},
+    year={2022}
+  }
+@article{zuluaga2022bertraffic,
+  title={BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications},
+  author={Zuluaga-Gomez, Juan and Sarfjoo, Seyyed Saeed and Prasad, Amrutha and others},
+  journal={IEEE Spoken Language Technology Workshop (SLT), Doha, Qatar},
+  year={2022}
+  }
+@article{zuluaga2022atco2,
+  title={ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications},
+  author={Zuluaga-Gomez, Juan and Vesel{\`y}, Karel and Sz{\"o}ke, Igor and Motlicek, Petr and others},
+  journal={arXiv preprint arXiv:2211.04054},
+  year={2022}
+}
+Authors of the dataset:
+@article{vsmidl2019air,
+  title={Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development},
+  author={{\v{S}}m{\'\i}dl, Lubo{\v{s}} and {\v{S}}vec, Jan and Tihelka, Daniel and Matou{\v{s}}ek, Jind{\v{r}}ich and Romportl, Jan and Ircing, Pavel},
+  journal={Language Resources and Evaluation},
+  volume={53},
+  number={3},
+  pages={449--464},
+  year={2019},
+  publisher={Springer}
+}