jaiminjariwala's picture
Update README.md
d5017ec verified
|
raw
history blame
1.64 kB
metadata
title: Multimodal Content Generation
emoji: 📉
colorFrom: indigo
colorTo: green
sdk: streamlit
sdk_version: 1.32.0
app_file: app.py
pinned: false
license: apache-2.0

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

A Multimodal Content Generation have following capabilities:

1. A Conversational chatbot as same as ChatGPT v3.5 + Image Summarization Capabilities through GOOGLE GEMINI VISION PRO API.

https://github.com/jaiminjariwala/Multimodal-Content-Generation-using-LLMs/assets/157014747/e4cd27c9-d0ed-42e9-94fc-bc0458eb8437

Screenshot 2024-03-07 at 5 00 49 PM

2. Text to Image (using Stability Ai (Stable Diffusion)) through REPLICATE API.

Screenshot 2024-03-07 at 10 58 41 AM

Setup steps:

  1. Create virtual environment

    python -m venv <name of virtual environment>
    
  2. Activate it

    source <name of virtual environment>/bin/activate
    
  3. Now install required libraries from requirements.txt file using...

    pip install -r requirements.txt
    
  4. Create .env file and add your API TOKEN

    GOOGLE_API_KEY="Enter Your GOOGLE API TOKEN"
    
    REPLICATE_API_KEY="ENTER YOUR REPLICATE API TOKEN "
    
  5. To run app

    streamlit run <name-of-app>.py