Ashish08 commited on
Commit
48b5537
β€’
1 Parent(s): d9dcbb9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,13 +1,44 @@
1
  ---
2
  title: Describe And Generate Game
3
- emoji: 🌍
4
  colorFrom: blue
5
  colorTo: green
6
  sdk: gradio
7
- sdk_version: 4.39.0
8
  app_file: app.py
9
  pinned: false
10
  license: afl-3.0
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: Describe And Generate Game
3
+ emoji: πŸŽ¨πŸ–ΌοΈβž‘οΈπŸ“βž‘οΈπŸ–ŒοΈβœ¨
4
  colorFrom: blue
5
  colorTo: green
6
  sdk: gradio
7
+ sdk_version: 4.41.0
8
  app_file: app.py
9
  pinned: false
10
  license: afl-3.0
11
  ---
12
 
13
+ # Describe-and-Generate Game πŸ–οΈ
14
+
15
+ This application is an interactive "Describe-and-Generate" game that leverages advanced AI models to generate creative images based on the captions derived from an initial uploaded image.
16
+ ## How It Works
17
+
18
+ 1. **Image Captioning**: The app uses the `Salesforce/BLIP-IMAGE-CAPTIONING-BASE` model to generate a descriptive caption from the uploaded image.
19
+ 2. **Image Generation**: The app then feeds this caption into the `runwayml/stable-diffusion-v1-5` model, which generates a new image based on the caption.
20
+
21
+ ## Features
22
+
23
+ - **Upload an Image**: Start by uploading an image of your choice.
24
+ - **Caption Generation**: The app will automatically generate a caption describing the uploaded image.
25
+ - **Image Generation**: Based on the generated caption, the app will create a new, AI-generated image.
26
+ - **Interactive Interface**: Built using Gradio, the interface is simple and user-friendly, allowing you to experiment with different images and see the AI's creative output.
27
+
28
+ ## Models Used
29
+
30
+ - **Captioning Model**: [SALESFORCE/BLIP-IMAGE-CAPTIONING-BASE](https://huggingface.co/Salesforce/blip-image-captioning-base)
31
+ - **Image Generation Model**: [RUNWAYML/STABLE-DIFFUSION-V1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5)
32
+
33
+ ## Software Packages
34
+
35
+ - **Gradio**: Used to create the interactive web interface.
36
+ - **Transformers**: Used for the image captioning model.
37
+ - **Diffusers**: Used for the image generation model.
38
+ - **Spaces**: Utilized for GPU acceleration during model execution.
39
+
40
+ ## How to Use
41
+
42
+ 1. **Upload an Image**: Use the "Your first image" button to upload any image.
43
+ 2. **Generate**: Click the "Caption and generate" button.
44
+ 3. **View Results**: The app will display the generated caption and the new image created based on that caption.