A newer version of the Gradio SDK is available:
6.0.2
metadata
title: Speech Resource Finder
emoji: 🧭
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
short_description: 'Discover ASR and TTS support and resources for any language '
Speech Resource Finder
Description
Almost 4 billion people speak languages with little or no speech technology support. This tool makes visible which languages have resources available and which communities are being left behind in the speech AI revolution.
Built by CLEAR Global to support language inclusion and help close the digital language divide.
Data Sources
Commercial Speech Services
Commercial service support is automatically pulled from the language support page of each service provider.
- Azure Speech Services - Speech-to-Text | Text-to-Speech
- Google Cloud Speech - Speech-to-Text | Text-to-Speech
- AWS - Transcribe | Polly
- ElevenLabs - Multilingual v2 | Turbo v3
Open Source Resources
- HuggingFace Models - Pre-trained speech models sorted by downloads
- HuggingFace Datasets - Speech corpora for training and evaluation
How to Use
- Select a language from the dropdown (type to search by name or ISO code)
- Toggle model deduplication if desired (enabled by default)
- Review results: commercial availability, models, and datasets
- Click model/dataset names to open on HuggingFace
Disclaimer
- Currently lists only 487 languages and is taken from this Github repository.
- Data fetched in real-time and can change.
- This is not an exhaustive list. There are other commercial voice technology providers and dataset/model resources that this app doesn't cover.
- Deduplication discards models with same name uploaded by others and keeps the most downloaded version in the list.
Feedback
We would love to hear your feedback and suggestions. Please write us at tech@clearglobal.org.