Post
612
π£ New Project Alert: Phi 3.5 Multimodal AI Demo π
Excited to share my latest project that combines the power of Phi 3.5 text and vision models with text-to-speech capabilities!
π Key Features:
1οΈβ£ Phi 3.5 Text Model for dynamic conversations
2οΈβ£ Phi 3.5 Vision Model for advanced image analysis
3οΈβ£ Text-to-Speech integration for an audio dimension
π οΈ Tech Stack:
Transformers
Gradio
PyTorch
Flash Attention 2
Parler TTS
This project demonstrates the potential of integrating multiple AI models to create a more comprehensive and interactive user experience. It's a step towards more natural and versatile AI assistants.
π Check out the demo and let me know your thoughts! How would you extend this project?
π Demo Link: sagar007/Multimodal_App
#MultimodalAI #PhiModel #MachineLearning #AIDemo
Excited to share my latest project that combines the power of Phi 3.5 text and vision models with text-to-speech capabilities!
π Key Features:
1οΈβ£ Phi 3.5 Text Model for dynamic conversations
2οΈβ£ Phi 3.5 Vision Model for advanced image analysis
3οΈβ£ Text-to-Speech integration for an audio dimension
π οΈ Tech Stack:
Transformers
Gradio
PyTorch
Flash Attention 2
Parler TTS
This project demonstrates the potential of integrating multiple AI models to create a more comprehensive and interactive user experience. It's a step towards more natural and versatile AI assistants.
π Check out the demo and let me know your thoughts! How would you extend this project?
π Demo Link: sagar007/Multimodal_App
#MultimodalAI #PhiModel #MachineLearning #AIDemo