Spaces:

ajnx014
/

wave-app

Sleeping

App Files Files Community

wave-app / README.md

ajnx014

Update README.md

7e35a51 verified 10 months ago

preview code

raw

history blame contribute delete

2.83 kB

A newer version of the Gradio SDK is available: 6.0.2

Upgrade

metadata

title: Wave App
emoji: 💻
colorFrom: pink
colorTo: indigo
sdk: gradio
sdk_version: 5.17.1
app_file: app.py
pinned: false
license: mit
thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/63bc1e06b8c61b8aa4963cf2/cPoSjWQgu66JrLyQ8HVM3.png
short_description: Built using Gradio, Librosa, and Resemblyzer. This applicati

🎙️ Wave: Voice Recognition with Similarity Testing

Welcome to the Wave: A Voice Recognition application with Similarity Testing project, built using Gradio, Librosa, and Resemblyzer. This application compares uploaded voice samples against reference embeddings to determine similarity, making it ideal for voice authentication and verification tasks.

🚀 Key Features

Real-time Voice Verification: Instantly compares a test voice against reference samples.
Multi-File Training: Upload up to 50 audio samples for robust training.
Similarity Scoring: Generates a similarity score, with results interpreted as a match or mismatch.
User-Friendly Interface: Powered by Gradio, ensuring a seamless and interactive experience.

🛠️ Technology Stack

Framework: Gradio
Audio Processing: Librosa
Voice Embeddings: Resemblyzer
Numerical Computations: NumPy
Audio File Handling: SoundFile

📁 File Structure

voice-recognition-app
│
├── app.ipynb             # Main application notebook
├── requirements.txt      # Required packages
└── README.md             # Project documentation

💾 Usage

Train the Model: Upload up to 50 .wav files as reference samples.
Test a Voice: Upload a single .wav file and receive a similarity score.
Interpret Results: Scores above 0.80 indicate a close match.

📦 Dependencies

gradio
librosa
resemblyzer
numpy
soundfile

🧠 How It Works

Audio Loading: Files are loaded and resampled to 16 kHz.
Voice Embeddings: The Resemblyzer extracts embeddings that represent vocal characteristics.
Similarity Calculation: The dot product of normalized embeddings produces the similarity score.

🌐 Access Live Demo

🔗 wave-app

📝 License

This project is licensed under the MIT License.

🤝 Contributing

Feel free to contribute! Fork the repository, create a new branch, and submit a pull request.

📧 Contact

For inquiries or support, please reach out to arjunjagdale14@gmail.com.

Author: Arjun Jagdale
GitHub: ArjunJagdale
Project: Voice Recognition with Similarity Testing