File size: 2,907 Bytes
1928598
 
87f167e
1928598
 
 
 
 
 
 
 
 
87f167e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
title: Speech Resource Finder
emoji: 🧭
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
short_description: 'Discover ASR and TTS support and resources for any language '
---

# Speech Resource Finder

## Description

Almost 4 billion people speak languages with little or no speech technology support. This tool makes visible which languages have resources available and which communities are being left behind in the speech AI revolution.

Built by CLEAR Global to support language inclusion and help close the digital language divide.

## Data Sources

### Commercial Speech Services

Commercial service support is automatically pulled from the language support page of each service provider.

- **Azure Speech Services** - [Speech-to-Text](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=stt) | [Text-to-Speech](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=tts)
- **Google Cloud Speech** - [Speech-to-Text](https://cloud.google.com/speech-to-text/docs/speech-to-text-supported-languages) | [Text-to-Speech](https://cloud.google.com/text-to-speech/docs/voices)
- **AWS** - [Transcribe](https://docs.aws.amazon.com/transcribe/latest/dg/supported-languages.html) | [Polly](https://docs.aws.amazon.com/polly/latest/dg/supported-languages.html)
- **ElevenLabs** - [Multilingual v2](https://elevenlabs.io/docs/models#multilingual-v2) | [Turbo v3](https://elevenlabs.io/docs/models#eleven-v3-alpha)

### Open Source Resources
- **HuggingFace Models** - Pre-trained speech models sorted by downloads
  - [ASR Models](https://huggingface.co/models?pipeline_tag=automatic-speech-recognition)
  - [TTS Models](https://huggingface.co/models?pipeline_tag=text-to-speech)
- **HuggingFace Datasets** - Speech corpora for training and evaluation
  - [ASR Datasets](https://huggingface.co/datasets?task_categories=task_categories:automatic-speech-recognition)
  - [TTS Datasets](https://huggingface.co/datasets?task_categories=task_categories:text-to-speech)

## How to Use
1. Select a language from the dropdown (type to search by name or ISO code)
2. Toggle model deduplication if desired (enabled by default)
3. Review results: commercial availability, models, and datasets
4. Click model/dataset names to open on HuggingFace

## Disclaimer

- Currently lists only 487 languages and is taken from this [Github repository](https://github.com/datasets/language-codes).
- Data fetched in real-time and can change.
- This is not an exhaustive list. There are other commercial voice technology providers and dataset/model resources that this app doesn't cover.
- Deduplication discards models with same name uploaded by others and keeps the most downloaded version in the list.

## Feedback

We would love to hear your feedback and suggestions. Please write us at tech@clearglobal.org.