JULIA / README.md
HarshanaLF's picture
update
9665df1
---
title: JULIA
emoji: 🔥
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.28.3
app_file: app.py
pinned: false
short_description: Voice Chat with JULIA
---
# JULIA⚡
A personal assistant inspired by Tony Stark's JARVIS, built using Gradio, edge_tts, Hugging Face Inference API, and streaming STT (Speech-to-Text) with Nemo.
## Features
- Voice and text input support
- Text-to-Speech response
- Multiple model support via Hugging Face Inference API
- Friendly and concise responses from a virtual assistant named Julia
## How It Works
- **Speech-to-Text (STT):** Uses the `streaming_stt_nemo` library to transcribe audio inputs.
- **Text Generation:** Uses models from Hugging Face Inference API to generate responses.
- **Text-to-Speech (TTS):** Uses `edge_tts` to convert the generated response into audio.
## Interface
The Gradio interface includes:
- A dropdown to select the model.
- An audio input for voice commands.
- A text input for typed commands.
- A send button to submit the input.
- A text output to display the assistant's response.
- An audio output to play the assistant's response.
## License
This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.
## Acknowledgments
Special thanks to the developers of Gradio, Hugging Face, edge_tts, and Nemo for their amazing libraries.
Sure! Here's a detailed description of each model available in the dropdown for your virtual assistant project:
### Model Descriptions
#### 1. **Mixtral 8x7B**
- **Description:** Mixtral 8x7B is a state-of-the-art large language model developed by Mistralai. It's designed to handle a variety of natural language understanding and generation tasks with high accuracy and coherence.
- **Strengths:**
- Excels in general knowledge and conversational tasks.
- Capable of producing detailed and contextually relevant responses.
- **Use Cases:** Suitable for detailed Q&A, storytelling, and providing comprehensive explanations.
#### 2. **Llama 3 8B**
- **Description:** Llama 3 8B is the latest iteration in the Llama series developed by Meta. It focuses on generating human-like text based on the provided prompts.
- **Strengths:**
- Highly optimized for generating coherent and context-aware text.
- Efficient in understanding and maintaining conversation flow.
- **Use Cases:** Ideal for chatbots, creative writing, and interactive dialogues.
#### 3. **Mistral 7B v0.3**
- **Description:** Mistral 7B v0.3 is a powerful language model developed by Mistralai, designed to perform well in both understanding and generating text across various domains.
- **Strengths:**
- High performance in both technical and casual conversational contexts.
- Robust in handling diverse topics and maintaining context over longer interactions.
- **Use Cases:** Best suited for customer support, technical assistance, and in-depth discussions.
#### 4. **Phi 3 mini**
- **Description:** Phi 3 mini, developed by Microsoft, is a compact yet efficient language model optimized for fast and responsive text generation.
- **Strengths:**
- Lightweight and quick to respond.
- Maintains a good balance between performance and computational efficiency.
- **Use Cases:** Perfect for real-time applications, quick Q&A, and use cases where response time is critical.
### Available Models
This project supports multiple language models to cater to different needs. Here are the details of each model:
1. **Mixtral 8x7B**
- **Description:** A state-of-the-art language model by Mistralai, designed for a wide range of natural language tasks.
- **Strengths:** Excels in general knowledge, conversational tasks, and detailed responses.
- **Use Cases:** Q&A, storytelling, detailed explanations.
2. **Llama 3 8B**
- **Description:** The latest in the Llama series by Meta, focusing on generating human-like text.
- **Strengths:** Highly coherent text generation, maintains conversation flow.
- **Use Cases:** Chatbots, creative writing, interactive dialogues.
3. **Mistral 7B v0.3**
- **Description:** A powerful model by Mistralai, capable of understanding and generating text across various domains.
- **Strengths:** High performance in technical and casual contexts, robust over longer interactions.
- **Use Cases:** Customer support, technical assistance, in-depth discussions.
4. **Phi 3 mini**
- **Description:** A compact model by Microsoft, optimized for fast and responsive text generation.
- **Strengths:** Lightweight, quick response times, efficient.
- **Use Cases:** Real-time applications, quick Q&A, scenarios requiring fast responses.
You can select any of these models based on your specific needs from the dropdown menu in the interface.