metadata

title: Accent Classifier + Transcriber
emoji: 🎙️
colorFrom: indigo
colorTo: purple
sdk: gradio
sdk_version: 4.20.0
app_file: app.py
pinned: false

Accent Classifier + Speech Transcriber

This Gradio app allows you to:

How to Use

Option 1: Upload an audio file

Option 2: Upload a video file

Option 3: Paste a direct .mp4 video URL

Loom, YouTube, Dropbox, or other webpage links (they don't serve real video files)
Download the video manually and upload it if needed

Transcription:

Accent Classification:

Handled automatically in Hugging Face Spaces. For local testing:

pip install gradio transformers torch moviepy requests safetensors soundfile scipy

You must also install ffmpeg:

Audio is extracted (if input is a video)
Audio is converted to .wav and resampled to 16kHz
Speech is transcribed using Whisper
Accent is classified using a Wav2Vec2 model
Output includes:
- Top accent prediction
- Confidence score
- Top-5 accent list
- Full transcription