Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks
Paper
•
2210.14712
•
Published
Translation quality estimation, machine translation, multilingual speech technology, speech recognition, speech synthesis, language identification, local language dialogue systems, multimodal language models