Spaces:

VASUGI
/

image-text-storyteller

Build error

App Files Files Community

VASUGI commited on Nov 27, 2023

Commit

fdc3d49

•

1 Parent(s): d1aba68

Upload 20 files

Browse files

Files changed (21) hide show

.gitattributes +2 -0
README.md +94 -12
__pycache__/text_model.cpython-311.pyc +0 -0
app.py +19 -0
image.jpg +0 -0
images/1353.jpg +0 -0
images/1683386165361.jpeg +0 -0
images/2048x1536-harley-davidson-riders-8k_1536316579.jpg +3 -0
images/2048x1536-spiderman-miles-lost-in-space-4k_1553071367.jpg +0 -0
images/4f503844d9a44b0350c25eeefae028d3.jpg +0 -0
images/80g58cvhb7x31.jpg +0 -0
images/IMG_2696.JPG +3 -0
images/OIP.jpg +0 -0
images/Screenshot 2022-09-29 234504.png +0 -0
images/WhatsApp Image 2023-05-05 at 15.10.45.jpg +0 -0
images/WhatsApp Image 2023-05-07 at 10.46.57.jpg +0 -0
images/photo_2023-06-14_13-00-23.jpg +0 -0
images/photo_2023-06-14_13-00-31.jpg +0 -0
images/togather.jpg +0 -0
text_model.py +51 -0
wp5.jpg +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+images/2048x1536-harley-davidson-riders-8k_1536316579.jpg filter=lfs diff=lfs merge=lfs -text
+images/IMG_2696.JPG filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,12 +1,94 @@
----
-title: Image Text Storyteller
-emoji: 🏢
-colorFrom: green
-colorTo: yellow
-sdk: streamlit
-sdk_version: 1.28.2
-app_file: app.py
-pinned: false
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+https://github.com/arjunprakash027/Story_Teller/assets/72484657/1c756dd8-9889-420d-a30e-13dbe137a2f0
+# Story Teller
+Story Teller is a Streamlit application that generates a story based on an input image. It utilizes the Hugging Face Transformers library and the Salesforce BLIP Image Captioning model.
+## Table of Contents
+- [Installation](#installation)
+- [Usage](#usage)
+- [API Token](#api-token)
+- [Models](#models)
+- [Running the Application](#running-the-application)
+- [Contributing](#contributing)
+## Installation
+To install the necessary dependencies, run the following command:
+```shell
+pip install -r requirements.txt
+```
+Make sure you have the required dependencies specified in the `requirements.txt` file.
+## Usage
+To use the application, follow the steps below:
+1. Run the Streamlit application by executing the following command:
+   ```shell
+   streamlit run app.py
+   ```
+2. Access the application through the provided URL in the console.
+3. The application interface will appear with the title "Story Teller" and an instruction to "Upload an image and get a story".
+4. Click on the "Upload your file here..." button to select an image file (supported formats: PNG, JPEG, JPG).
+5. Once the image is uploaded, it will be displayed on the page.
+6. The application will process the uploaded image using the Salesforce BLIP Image Captioning model and generate a textual description of the image.
+7. The generated text will then be passed to the Hugging Face API to generate a story based on the text.
+8. The application will display the generated story on the page.
+9. If any errors occur during the process, an error message will be shown on the page, and you can try again.
+## API Token
+The application requires an API token from Hugging Face to access the story generation model. To obtain an API token, follow these steps:
+1. Sign up or log in to your Hugging Face account at [https://huggingface.co/](https://huggingface.co/).
+2. Once logged in, go to your account settings and navigate to the "API token" section.
+3. Generate a new API token, copy it, and replace the `"your api key"` placeholder in the `Models` class of `text_model.py` with your actual API token.
+## Models
+The `Models` class in `text_model.py` encapsulates the functionality of the application. It contains the following methods:
+- `__init__()`:
+    - Initializes the class and sets the API token and model ID.
+- `img2text(url)`:
+    - Takes an image URL as input and uses the Salesforce BLIP Image Captioning model to convert the image into text. It returns the generated text.
+- `story(payload)`:
+    - Takes a payload as input, which contains the generated text, and sends a request to the Hugging Face API to generate a story based on the text. It returns the generated story.
+- `chain(payload, num=0)`:
+    - This method acts as a recursive function that generates a chain of stories. It takes a payload as input, which initially contains the generated text. It recursively calls the `story()` method and updates the payload until the desired number of stories (50 in this case) is generated. The progress bar is also updated accordingly.
+## Running the Application
+If you are curious and want to just try the backend models
+execute the following command:
+```shell
+python text_model.py
+```
+Make sure you have the required dependencies installed, as mentioned in the installation section.
+## Contributing
+Contributions to the Story Teller application are welcome! If you find any issues or have suggestions for improvements, please feel free to open an issue or submit a pull request.

__pycache__/text_model.cpython-311.pyc ADDED Viewed

Binary file (3.46 kB). View file

app.py ADDED Viewed

	@@ -0,0 +1,19 @@

+import streamlit as st
+from text_model import Models
+import os
+st.title("Story Teller")
+model = Models()
+st.write("Upload an image and get a story")
+uploaded_file = st.file_uploader("Upload your file here...",type=['png','jpeg','jpg'])
+try:
+    if uploaded_file is not None:
+        st.image(uploaded_file)
+        with open(os.path.join("images",uploaded_file.name),"wb") as f:
+            f.write(uploaded_file.getbuffer())
+        response = model.chain(model.img2text(f'images/{uploaded_file.name}'))
+        st.write(response)
+except Exception as e:
+    print(e)
+    st.write("error occured! please try again")

image.jpg ADDED Viewed

images/1353.jpg ADDED Viewed

images/1683386165361.jpeg ADDED Viewed

images/2048x1536-harley-davidson-riders-8k_1536316579.jpg ADDED Viewed

Git LFS Details

SHA256: d1089e3ecb1f489190fac65d65aa192d4e30fbe52dc2a387e427584eb022f734
Pointer size: 132 Bytes
Size of remote file: 1.14 MB

images/2048x1536-spiderman-miles-lost-in-space-4k_1553071367.jpg ADDED Viewed

images/4f503844d9a44b0350c25eeefae028d3.jpg ADDED Viewed

images/80g58cvhb7x31.jpg ADDED Viewed

images/IMG_2696.JPG ADDED Viewed

Git LFS Details

SHA256: 2965f3059d3afcf086c7b21c9de98e7b81aa75d887bc43af2a5905048ee40e30
Pointer size: 132 Bytes
Size of remote file: 6.04 MB

images/OIP.jpg ADDED Viewed

images/Screenshot 2022-09-29 234504.png ADDED Viewed

images/WhatsApp Image 2023-05-05 at 15.10.45.jpg ADDED Viewed

images/WhatsApp Image 2023-05-07 at 10.46.57.jpg ADDED Viewed

images/photo_2023-06-14_13-00-23.jpg ADDED Viewed

images/photo_2023-06-14_13-00-31.jpg ADDED Viewed

images/togather.jpg ADDED Viewed

text_model.py ADDED Viewed

	@@ -0,0 +1,51 @@

+from dotenv import find_dotenv, load_dotenv
+from transformers import pipeline
+import requests
+import json
+import logging
+import streamlit as st
+requests.verify = False
+load_dotenv(find_dotenv()) #access huggingface token
+log = logging.getLogger("text_model.py")
+logging.basicConfig(level=logging.INFO,format='%(asctime)s - %(message)s', datefmt='%d-%b-%y %H:%M:%S')
+class Models:
+    def __init__(self):
+        self.api_token = "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"
+        self.model_id = "openai-gpt"
+    def img2text(self,url):
+        image_to_text = pipeline("image-to-text",model="Salesforce/blip-image-captioning-base")
+        text = image_to_text(url)
+        log.info("image to text model running")
+        return text[0]['generated_text']
+    def story(self,payload):
+        payload  = json.dumps(payload)
+        headers = {"Authorization": f"Bearer {self.api_token}"}
+        url = f"https://api-inference.huggingface.co/models/{self.model_id}"
+        response = requests.post(url, headers=headers, data=payload)
+        log.info("story model running")
+        return response.json()[0]['generated_text']
+    status_bar = st.progress(0,text="progress will be shown here")
+    def chain(self,payload,num=0):
+        if num == 50:
+            self.status_bar.progress(100,text="story generated!!")
+            return payload
+        response = self.story(payload)
+        percent = int(((num+1)/50) * 100)
+        self.status_bar.progress(percent,text="generating story for you!!")
+        return self.chain(response,num+1)
+if __name__ == "__main__":
+    model = Models()
+    response = model.chain(model.img2text("D:\image-story-teller\Story_Teller\image.jpg"))
+    print(response)

wp5.jpg ADDED Viewed