Spaces:

ajnx014
/

wave-app

Sleeping

App Files Files Community

ajnx014 commited on Feb 23

Commit

3daa38c

verified ·

1 Parent(s): 5608b3d

Update README.md

Browse files

Files changed (1) hide show

README.md +85 -1

README.md CHANGED Viewed

@@ -8,4 +8,88 @@ sdk_version: 5.17.1
 app_file: app.py
 pinned: false
 license: mit
----

 app_file: app.py
 pinned: false
 license: mit
+---
+# 🎙️ Voice Recognition with Similarity Testing
+Welcome to the **Voice Recognition with Similarity Testing** project, built using **Gradio**, **Librosa**, and **Resemblyzer**. This application compares uploaded voice samples against reference embeddings to determine similarity, making it ideal for voice authentication and verification tasks.
+---
+## 🚀 **Key Features**
+- **Real-time Voice Verification:** Instantly compares a test voice against reference samples.
+- **Multi-File Training:** Upload up to **50** audio samples for robust training.
+- **Similarity Scoring:** Generates a similarity score, with results interpreted as a match or mismatch.
+- **User-Friendly Interface:** Powered by **Gradio**, ensuring a seamless and interactive experience.
+---
+## 🛠️ **Technology Stack**
+- **Framework:** Gradio
+- **Audio Processing:** Librosa
+- **Voice Embeddings:** Resemblyzer
+- **Numerical Computations:** NumPy
+- **Audio File Handling:** SoundFile
+---
+## 📁 **File Structure**
+```plaintext
+voice-recognition-app
+│
+├── app.ipynb             # Main application notebook
+├── requirements.txt      # Required packages
+└── README.md             # Project documentation
+```
+---
+## 💾 **Usage**
+1. **Train the Model:** Upload up to **50** `.wav` files as reference samples.
+2. **Test a Voice:** Upload a single `.wav` file and receive a similarity score.
+3. **Interpret Results:** Scores above **0.80** indicate a close match.
+---
+## 📦 **Dependencies**
+```plaintext
+gradio
+librosa
+resemblyzer
+numpy
+soundfile
+```
+---
+## 🧠 **How It Works**
+1. **Audio Loading:** Files are loaded and resampled to **16 kHz**.
+2. **Voice Embeddings:** The **Resemblyzer** extracts embeddings that represent vocal characteristics.
+3. **Similarity Calculation:** The dot product of normalized embeddings produces the similarity score.
+---
+## 🌐 **Access Live Demo**
+🔗 [wave-app](https://huggingface.co/spaces/ajnx014/voice-recognition-app)
+---
+## 📝 **License**
+This project is licensed under the **MIT License**.
+---
+## 🤝 **Contributing**
+Feel free to contribute! Fork the repository, create a new branch, and submit a pull request.
+---
+## 📧 **Contact**
+For inquiries or support, please reach out to **[arjunjagdale14@gmail.com](mailto:arjunjagdale14@gmail.com)**.
+---
+> **Author:** Arjun Jagdale
+> **GitHub:** [ArjunJagdale](https://github.com/ArjunJagdale)
+> **Project:** Voice Recognition with Similarity Testing