File size: 1,314 Bytes
8597163 105a56b 8597163 6d88267 e392177 6d88267 cf75eeb 6d88267 cf75eeb 6d88267 105a56b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 |
---
title: MyIVR
emoji: 🎧
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.12.0
app_file: app.py
pinned: false
---
# MyIVR: Audio Transcription and Summarization
This Space provides a simple web-based interface to upload audio files, transcribe them into text, and generate a summarized version of the transcription.
## How It Works
1. Upload an audio file in formats like `.wav` or `.mp3`.
2. The app uses:
- [Whisper](https://github.com/openai/whisper) for audio transcription.
- [Hugging Face Transformers](https://huggingface.co/docs/transformers/index) for text summarization.
3. Get the transcription and summary displayed on the interface.
## Features
- **Audio Transcription**: Convert speech to text.
- **Text Summarization**: Generate concise summaries of the transcription.
## Models Used
- **Whisper** (OpenAI): Handles the audio-to-text conversion.
- **T5-Small** (Hugging Face): Summarizes the transcription.
## Requirements
Dependencies are listed in the `requirements.txt` file. The app uses:
- `gradio`
- `openai-whisper`
- `transformers`
- `torch`
## Usage
Visit the deployed app at the following link:
[MyIVR on Hugging Face Spaces](https://huggingface.co/spaces/JabriA/MyIVR)
Upload an audio file, and the app will process it to provide a transcription and summary. |