Spaces:

JabriA
/

MyIVR

Sleeping

File size: 1,314 Bytes

---
title: MyIVR
emoji: 🎧
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.12.0
app_file: app.py
pinned: false
---

# MyIVR: Audio Transcription and Summarization

This Space provides a simple web-based interface to upload audio files, transcribe them into text, and generate a summarized version of the transcription.

## How It Works
1. Upload an audio file in formats like `.wav` or `.mp3`.
2. The app uses:
   - [Whisper](https://github.com/openai/whisper) for audio transcription.
   - [Hugging Face Transformers](https://huggingface.co/docs/transformers/index) for text summarization.
3. Get the transcription and summary displayed on the interface.

## Features
- **Audio Transcription**: Convert speech to text.
- **Text Summarization**: Generate concise summaries of the transcription.

## Models Used
- **Whisper** (OpenAI): Handles the audio-to-text conversion.
- **T5-Small** (Hugging Face): Summarizes the transcription.

## Requirements
Dependencies are listed in the `requirements.txt` file. The app uses:
- `gradio`
- `openai-whisper`
- `transformers`
- `torch`

## Usage
Visit the deployed app at the following link:  
[MyIVR on Hugging Face Spaces](https://huggingface.co/spaces/JabriA/MyIVR)

Upload an audio file, and the app will process it to provide a transcription and summary.