A newer version of the Gradio SDK is available:
5.44.1
metadata
title: Whisper Speech Transcription
emoji: 🎙️
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.41.1
app_file: app.py
pinned: false
license: mit
short_description: use finetuned s2t model
Whisper Speech Transcription
AI-powered speech-to-text with timestamps using fine-tuned Whisper model.
Features
- Upload audio files (up to 3 minutes)
- Record voice directly
- Get timestamped transcriptions
- Download JSON and SRT formats
- Optimized for English speech
Usage
- Choose upload or record option
- Process your audio (max 3 minutes)
- View transcription with timestamps
- Download results in multiple formats
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference