Papers
arxiv:2406.17363
Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech Translation
Published on Jun 25
Authors:
Abstract
This paper describes our system submission to the International Conference on Spoken Language Translation (IWSLT 2024) for Irish-to-English speech translation. We built end-to-end systems based on Whisper, and employed a number of data augmentation techniques, such as speech back-translation and noise augmentation. We investigate the effect of using synthetic audio data and discuss several methods for enriching signal diversity.
Models citing this paper 0
No model linking this paper
Cite arxiv.org/abs/2406.17363 in a model README.md to link it from this page.
Datasets citing this paper 4
Spaces citing this paper 0
No Space linking this paper
Cite arxiv.org/abs/2406.17363 in a Space README.md to link it from this page.
Collections including this paper 0
No Collection including this paper
Add this paper to a
collection
to link it from this page.