JULIA

Sleeping

App Files Files Community

JULIA / README.md

HarshanaLF

update

9665df1 7 months ago

preview code

raw

history blame contribute delete

4.75 kB

	---
	title: JULIA
	emoji: 🔥
	colorFrom: blue
	colorTo: green
	sdk: gradio
	sdk_version: 4.28.3
	app_file: app.py
	pinned: false
	short_description: Voice Chat with JULIA
	---

	# JULIA⚡

	A personal assistant inspired by Tony Stark's JARVIS, built using Gradio, edge_tts, Hugging Face Inference API, and streaming STT (Speech-to-Text) with Nemo.

	## Features

	- Voice and text input support
	- Text-to-Speech response
	- Multiple model support via Hugging Face Inference API
	- Friendly and concise responses from a virtual assistant named Julia

	## How It Works

	- Speech-to-Text (STT): Uses the `streaming_stt_nemo` library to transcribe audio inputs.
	- Text Generation: Uses models from Hugging Face Inference API to generate responses.
	- Text-to-Speech (TTS): Uses `edge_tts` to convert the generated response into audio.

	## Interface

	The Gradio interface includes:

	- A dropdown to select the model.
	- An audio input for voice commands.
	- A text input for typed commands.
	- A send button to submit the input.
	- A text output to display the assistant's response.
	- An audio output to play the assistant's response.

	## License

	This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.

	## Acknowledgments

	Special thanks to the developers of Gradio, Hugging Face, edge_tts, and Nemo for their amazing libraries.

	Sure! Here's a detailed description of each model available in the dropdown for your virtual assistant project:

	### Model Descriptions

	#### 1. Mixtral 8x7B

	- Description: Mixtral 8x7B is a state-of-the-art large language model developed by Mistralai. It's designed to handle a variety of natural language understanding and generation tasks with high accuracy and coherence.
	- Strengths:
	- Excels in general knowledge and conversational tasks.
	- Capable of producing detailed and contextually relevant responses.
	- Use Cases: Suitable for detailed Q&A, storytelling, and providing comprehensive explanations.

	#### 2. Llama 3 8B

	- Description: Llama 3 8B is the latest iteration in the Llama series developed by Meta. It focuses on generating human-like text based on the provided prompts.
	- Strengths:
	- Highly optimized for generating coherent and context-aware text.
	- Efficient in understanding and maintaining conversation flow.
	- Use Cases: Ideal for chatbots, creative writing, and interactive dialogues.

	#### 3. Mistral 7B v0.3

	- Description: Mistral 7B v0.3 is a powerful language model developed by Mistralai, designed to perform well in both understanding and generating text across various domains.
	- Strengths:
	- High performance in both technical and casual conversational contexts.
	- Robust in handling diverse topics and maintaining context over longer interactions.
	- Use Cases: Best suited for customer support, technical assistance, and in-depth discussions.

	#### 4. Phi 3 mini

	- Description: Phi 3 mini, developed by Microsoft, is a compact yet efficient language model optimized for fast and responsive text generation.
	- Strengths:
	- Lightweight and quick to respond.
	- Maintains a good balance between performance and computational efficiency.
	- Use Cases: Perfect for real-time applications, quick Q&A, and use cases where response time is critical.

	### Available Models

	This project supports multiple language models to cater to different needs. Here are the details of each model:

	1. Mixtral 8x7B

	- Description: A state-of-the-art language model by Mistralai, designed for a wide range of natural language tasks.
	- Strengths: Excels in general knowledge, conversational tasks, and detailed responses.
	- Use Cases: Q&A, storytelling, detailed explanations.

	2. Llama 3 8B

	- Description: The latest in the Llama series by Meta, focusing on generating human-like text.
	- Strengths: Highly coherent text generation, maintains conversation flow.
	- Use Cases: Chatbots, creative writing, interactive dialogues.

	3. Mistral 7B v0.3

	- Description: A powerful model by Mistralai, capable of understanding and generating text across various domains.
	- Strengths: High performance in technical and casual contexts, robust over longer interactions.
	- Use Cases: Customer support, technical assistance, in-depth discussions.

	4. Phi 3 mini
	- Description: A compact model by Microsoft, optimized for fast and responsive text generation.
	- Strengths: Lightweight, quick response times, efficient.
	- Use Cases: Real-time applications, quick Q&A, scenarios requiring fast responses.

	You can select any of these models based on your specific needs from the dropdown menu in the interface.