Spaces:

MEssamOrg
/

ContactSearchAssistant

Sleeping

App Files Files Community

ContactSearchAssistant / README.md

Muhammed Essam

Switch to Whisper base

e459b45 20 days ago

preview code

raw

history blame contribute delete

5.1 kB

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

metadata

title: Voice Assistant - Multi-language Division Matching & Contact Search
emoji: 🎙️
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 4.0.0
app_file: app.py
pinned: false
license: mit

🎙️ Voice Assistant Demo

A powerful multi-language voice assistant that helps users find divisions and contacts within an organization using natural language queries.

🌟 Features

🗣️ Multi-language Voice Input

99+ languages supported (auto-detected)
Automatic speech-to-text using OpenAI Whisper
Arabic-to-English translation for seamless processing
Works with various audio formats

🎯 Smart Division Matching

Semantic search using sentence embeddings
Confidence-based routing with intelligent thresholds
Department-level expansion (searches all divisions in a department)
Fast matching (~50ms) using all-MiniLM-L6-v2

👤 Name Extraction

Extracts person names from queries using GLiNER
Supports English and Arabic names
Zero-shot NER for robust extraction

📞 Contact Search

500+ contacts across 23 departments and 67 divisions
Intelligent matching combining name and division
Confidence scoring with match reasoning
Fuzzy name matching for typos and variations

🚀 How to Use

Division Matching (Text)

Find the right division for your query:

"I need help from IT Security"
"Find someone in Finance"
"Connect me to Human Resources"

Division Matching (Voice)

Speak your query in any language - it will be transcribed and processed automatically.

Contact Search (Text)

Search for specific people or teams:

"Find Dima in Information Technology"
"Ahmed Al-Malek"
"I need to talk to someone in Legal"

Contact Search (Voice)

Speak your contact search query in any language.

📊 Example Queries

Department-Level Queries

These queries search across ALL divisions in a department:

✅ "Find someone in Information Technology" → Searches 8 IT divisions
✅ "I need help from Finance" → Searches all Finance divisions
✅ "Connect me to Human Resources" → Searches all HR divisions

Division-Level Queries

These match specific divisions:

✅ "Find Ahmed in App Dev" → Applications Development & Integrations
✅ "I need help from IT Security" → IT Security Implementation & Operations
✅ "Connect me to Legal" → Legal divisions

Name-Only Queries

✅ "Find Dima" → Searches all contacts named Dima
✅ "Ahmed Al-Malek" → Exact name match
✅ "I need to talk to Rashed" → Fuzzy name matching

Combined Queries (Name + Department/Division)

Priority given to division accuracy:

✅ "Find Dima in Information Technology" → Perfect match (confidence: 1.00)
✅ "Find Ahmed in App Dev" → Shows App Dev team members

🔧 Technical Details

Models Used

Embeddings: sentence-transformers/all-MiniLM-L6-v2 - Fast, lightweight semantic search
Name Extraction: urchade/gliner_small-v2.1 - Zero-shot NER for person names
Speech-to-Text: openai/whisper-base - Optimized for CPU with good accuracy

Confidence Scoring

Score	Meaning	Example
1.00	Perfect match (name + division)	Dima in IT
0.95	Exact name match	Ahmed Al-Malek
0.66	Strong division match	People in requested division
0.59	Good division match	Close division match
< 0.30	Low confidence	Wrong division penalty

Match Reasons

name_and_division_match - Both name AND division match ✅
division_match - Division/department matches (no name match)
exact_name_match - Exact name match (100%)
fuzzy_name_match - Partial name match (75%+)
name_match_wrong_division - Name matches but WRONG division ⚠️

📦 Database Stats

500 contacts across the organization
23 departments (Information Technology, Finance, HR, etc.)
67 divisions (specific teams and units)
Multi-language support (English + Arabic names)

🌍 Supported Languages

The voice assistant supports 99+ languages including:

English
Arabic (العربية)
Spanish, French, German, Italian
Chinese (中文), Japanese (日本語), Korean (한국어)
Hindi, Urdu, Bengali
And many more...

Language is automatically detected - just speak naturally!

⚡ Performance

Division Matching: ~50ms per query
Name Extraction: ~100-200ms per query
Voice Processing: ~1-3 seconds (depends on audio length)
Contact Search: ~100-300ms per query

🛠️ Built With

Gradio - Interactive web interface
FastAPI - Backend API (original implementation)
Sentence Transformers - Semantic search
OpenAI Whisper - Speech recognition
GLiNER - Named Entity Recognition
PyTorch - Deep learning framework

📝 License

MIT License

🙏 Acknowledgments

OpenAI for Whisper
Hugging Face for model hosting
URCHADE for GLiNER
Sentence Transformers team

Version: 4.0.0 Status: ✅ Production Ready Demo Type: Interactive Gradio Demo