metadata

title: Whisper Speech Transcription
emoji: 🎙️
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.41.1
app_file: app.py
pinned: false
license: mit
short_description: use finetuned s2t model

Whisper Speech Transcription

AI-powered speech-to-text with timestamps using fine-tuned Whisper model.

Features

Upload audio files (up to 3 minutes)
Record voice directly
Get timestamped transcriptions
Download JSON and SRT formats
Optimized for English speech

Usage

Choose upload or record option
Process your audio (max 3 minutes)
View transcription with timestamps
Download results in multiple formats

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference