Spaces:
Sleeping
A newer version of the Gradio SDK is available:
5.18.0
title: JULIA
emoji: 🔥
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.28.3
app_file: app.py
pinned: false
short_description: Voice Chat with JULIA
JULIA⚡
A personal assistant inspired by Tony Stark's JARVIS, built using Gradio, edge_tts, Hugging Face Inference API, and streaming STT (Speech-to-Text) with Nemo.
Features
- Voice and text input support
- Text-to-Speech response
- Multiple model support via Hugging Face Inference API
- Friendly and concise responses from a virtual assistant named Julia
How It Works
- Speech-to-Text (STT): Uses the
streaming_stt_nemo
library to transcribe audio inputs. - Text Generation: Uses models from Hugging Face Inference API to generate responses.
- Text-to-Speech (TTS): Uses
edge_tts
to convert the generated response into audio.
Interface
The Gradio interface includes:
- A dropdown to select the model.
- An audio input for voice commands.
- A text input for typed commands.
- A send button to submit the input.
- A text output to display the assistant's response.
- An audio output to play the assistant's response.
License
This project is licensed under the MIT License. See the LICENSE file for details.
Acknowledgments
Special thanks to the developers of Gradio, Hugging Face, edge_tts, and Nemo for their amazing libraries.
Sure! Here's a detailed description of each model available in the dropdown for your virtual assistant project:
Model Descriptions
1. Mixtral 8x7B
- Description: Mixtral 8x7B is a state-of-the-art large language model developed by Mistralai. It's designed to handle a variety of natural language understanding and generation tasks with high accuracy and coherence.
- Strengths:
- Excels in general knowledge and conversational tasks.
- Capable of producing detailed and contextually relevant responses.
- Use Cases: Suitable for detailed Q&A, storytelling, and providing comprehensive explanations.
2. Llama 3 8B
- Description: Llama 3 8B is the latest iteration in the Llama series developed by Meta. It focuses on generating human-like text based on the provided prompts.
- Strengths:
- Highly optimized for generating coherent and context-aware text.
- Efficient in understanding and maintaining conversation flow.
- Use Cases: Ideal for chatbots, creative writing, and interactive dialogues.
3. Mistral 7B v0.3
- Description: Mistral 7B v0.3 is a powerful language model developed by Mistralai, designed to perform well in both understanding and generating text across various domains.
- Strengths:
- High performance in both technical and casual conversational contexts.
- Robust in handling diverse topics and maintaining context over longer interactions.
- Use Cases: Best suited for customer support, technical assistance, and in-depth discussions.
4. Phi 3 mini
- Description: Phi 3 mini, developed by Microsoft, is a compact yet efficient language model optimized for fast and responsive text generation.
- Strengths:
- Lightweight and quick to respond.
- Maintains a good balance between performance and computational efficiency.
- Use Cases: Perfect for real-time applications, quick Q&A, and use cases where response time is critical.
Available Models
This project supports multiple language models to cater to different needs. Here are the details of each model:
Mixtral 8x7B
- Description: A state-of-the-art language model by Mistralai, designed for a wide range of natural language tasks.
- Strengths: Excels in general knowledge, conversational tasks, and detailed responses.
- Use Cases: Q&A, storytelling, detailed explanations.
Llama 3 8B
- Description: The latest in the Llama series by Meta, focusing on generating human-like text.
- Strengths: Highly coherent text generation, maintains conversation flow.
- Use Cases: Chatbots, creative writing, interactive dialogues.
Mistral 7B v0.3
- Description: A powerful model by Mistralai, capable of understanding and generating text across various domains.
- Strengths: High performance in technical and casual contexts, robust over longer interactions.
- Use Cases: Customer support, technical assistance, in-depth discussions.
Phi 3 mini
- Description: A compact model by Microsoft, optimized for fast and responsive text generation.
- Strengths: Lightweight, quick response times, efficient.
- Use Cases: Real-time applications, quick Q&A, scenarios requiring fast responses.
You can select any of these models based on your specific needs from the dropdown menu in the interface.