sohanAI's picture
Upload README.md
dd74e9e verified

A newer version of the Gradio SDK is available: 5.33.1

Upgrade
metadata
title: DF-GAN Bird Image Generator
emoji: 🐦
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 3.50.0
app_file: app.py
pinned: false
license: cc-by-nc-sa-4.0

DF-GAN Bird Image Generator

This application uses the DF-GAN (Deep Fusion GAN) model to generate bird images based on text descriptions. Just enter a description of a bird, and the model will generate a realistic image that matches your description.

About the Model

This application uses the pre-trained bird model from the DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis paper (CVPR 2022). DF-GAN is a text-to-image synthesis model that can generate high-quality images from textual descriptions.

How to Use

  1. Enter a description of a bird in the text box (e.g., "a yellow bird with a black head")
  2. Choose how many images you want to generate (1-4)
  3. Optionally, set a random seed for reproducible results
  4. Click "Generate Image" button
  5. View the generated bird images that match your description

Examples

Try these example descriptions:

  • "this bird has an orange bill, a white belly and white eyebrows"
  • "a small bird with a red head, breast, and belly and black wings"
  • "this bird is yellow with black and has a long, pointy beak"
  • "this bird is white in color, and has a orange beak"

Implementation Details

This application uses the following components:

  • DF-GAN architecture for text-to-image synthesis
  • DAMSM text encoder for embedding text descriptions
  • Gradio for the web interface

Credits

This implementation is based on the official DF-GAN repository: tobran/DF-GAN