Spaces:

DheivaCodes
/

Multilingual-translator

Sleeping

App Files Files Community

DheivaCodes commited on Jul 20

Commit

4d9c678

verified ·

1 Parent(s): 84eaec5

Update README.md

Browse files

Files changed (1) hide show

README.md +89 -10

README.md CHANGED Viewed

@@ -1,13 +1,92 @@
 ---
-title: Multilingual Translator
-emoji: ⚡
-colorFrom: blue
-colorTo: pink
-sdk: gradio
-sdk_version: 5.38.0
-app_file: app.py
-pinned: false
-short_description: Multilingual Translator with Semantic Search and BLEU Evalua
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# 🌍 Multilingual Translator + Semantic Search (Enhanced)
+This project is a smart multilingual translator web app that offers:
+- ✅ **Automatic language detection**
+- 🌐 **High-quality translation** between Indian and foreign languages
+- 🧠 **Semantic search** to find similar Sanskrit-based concepts
+- 📊 **Optional BLEU score evaluation** (with human reference)
+- 📄 **Downloadable report** summarizing the output
+- 🚫 **Input length handling** to avoid translation errors
+> Developed using Hugging Face Transformers, Sentence Transformers, FAISS, and Gradio — and deployable to Hugging Face Spaces.
+---
+## ⚠️ Input Limit Notice
+Please enter **up to 3 lines** or **2000 characters** maximum.
+- If input is too long, the app will show an error and skip translation.
 ---
+## 🚀 Live Demo
+🔗 [Click here to try the app on Hugging Face Spaces](https://huggingface.co/spaces/jeevitha-app/Multilingual-translator)
 ---
+## 🔧 Features
+| Feature | Description |
+|--------|-------------|
+| **Language Detection** | Auto-identifies input language using `xlm-roberta-base-language-detection` |
+| **Translation** | Uses Facebook’s `NLLB-200-distilled-600M` model |
+| **Semantic Search** | Finds similar Sanskrit concepts using Sentence Transformers + FAISS |
+| **BLEU Score** | Optional evaluation metric (if human reference is provided) |
+| **Semantic Plot** | Horizontal bar chart for top 3 semantic similarity scores |
+| **Download Report** | Creates a `.txt` file (includes all outputs + BLEU score) |
+| **Error Handling** | Graceful messages for empty or long input |
+---
+## 🌐 Supported Languages
+| Code       | Language  |
+|------------|-----------|
+| eng_Latn   | English   |
+| hin_Deva   | Hindi     |
+| tam_Taml   | Tamil     |
+| tel_Telu   | Telugu    |
+| san_Deva   | Sanskrit  |
+| fra_Latn   | French    |
+| spa_Latn   | Spanish   |
+| deu_Latn   | German    |
+| jpn_Jpan   | Japanese  |
+| zho_Hans   | Chinese   |
+| arb_Arab   | Arabic    |
+---
+## 📄 Downloadable Report
+The app generates a `.txt` file containing:
+- Detected source language
+- Translated output
+- Semantic matches (with similarity scores)
+- BLEU score (if a human reference translation is given)
+---
+## 🚧 Future Enhancements
+- 🎙️ Speech-to-text input support
+- 🔊 Text-to-speech audio output
+- 📸 OCR: Translate text from uploaded images
+- 🆕 Add more Indian languages and transliteration features
+---
+## 👩‍💻 Author
+**Jeevitha Meenakshisundaram**
+M.Sc. Data Science, SASTRA University
+---
+## 📜 License
+This project is licensed under the **MIT License**.