mobius / README.md
dataautogpt3's picture
Update README.md
b1da0d8 verified
|
raw
history blame
3.49 kB
metadata
pipeline_tag: text-to-image
widget:
  - text: >-
      high quality, portrait photo of 30 y.o european man, wearing black shirt,
      serious face, cinematic shot, dramatic lighting 
    output:
      url: 1man.png
  - text: 'A tiny robot taking a break under a tree in the garden '
    output:
      url: robot.png
  - text: mystery
    output:
      url: mystery.png
  - text: a cat wearing sunglasses in the summer
    output:
      url: cat.png
  - text: 'robot holding a sign that says ’a storm is coming’ '
    output:
      url: storm.png
  - text: the vibrance of the human soul
    output:
      url: soul.png
  - text: Lady of War, chique dark clothes, vinyl, imposing pose, anime style, 90s
    output:
      url: anime.png
license: cc-by-nc-nd-4.0

Constructive Deconstruction: Domain-Agnostic Debiasing of Diffusion Models

Introduction

Constructive Deconstruction is a novel approach to debiasing diffusion models used in generative tasks like image synthesis. This method enhances the quality and fidelity of generated images across various domains by removing biases inherited from the training data. Our technique involves overtraining the model to a controlled noisy state, applying nightshading, and using bucketing techniques to realign the model's internal representations.

Methodology

Overtraining to Controlled Noisy State

By purposely overtraining the model until it predictably fails, we create a controlled noisy state. This state helps in identifying and addressing the inherent biases in the model's training data.

Nightshading

Nightshading is repurposed to induce a controlled failure, making it easier to retrain the model. This involves injecting carefully selected data points to stress the model and cause predictable failures.

Bucketing

Using mathematical techniques like slerp (Spherical Linear Interpolation) and bislerp (Bilinear Interpolation), we merge the induced noise back into the model. This step highlights the model's learned knowledge while suppressing biases.

Retraining and Fine-Tuning

The noisy state is retrained on a large, diverse dataset to create a new base model called "Mobius." Initial issues such as grainy details and inconsistent colors are resolved during fine-tuning, resulting in high-quality, unbiased outputs.

Results and Highlights

Increased Diversity of Outputs

Training the model on high-quality data naturally increases the diversity of the generated outputs without intentionally loosening associations. This leads to improved generalization and variety in generated images.

Empirical Validation

Extensive experiments and fine-tuning demonstrate the effectiveness of our method, resulting in high-quality, unbiased outputs across various styles and domains.

Example Outputs

Prompt
movie scene screencap, cinematic footage. thanos smelling a little yellow rose. extreme wide angle,
Prompt
god
Prompt
A tiny robot taking a break under a tree in the garden
Prompt
mystery
Prompt
a cat wearing sunglasses in the summer
Prompt
robot holding a sign that says ’a storm is coming’
Prompt
The Exegenesis of the soul, captured within a boundless well of starlight, pulsating and vibrating wisps, chiaroscuro, humming transformer
Prompt
anime boy, protagonist, best quality
Prompt
natural photography of a man, glasses, cinematic,
Prompt
if I could turn back time
Prompt
("Mobius" text logo) powerful aura, swirling power, cinematic
Prompt
the backrooms

Here are some examples of images generated by the MobiusJourney model using different prompts:

  • High Quality Portrait
  • Tiny Robot
  • Mystery
  • Cat Wearing Sunglasses
  • Robot Holding a Sign
  • The Vibrance of the Human Soul
  • Lady of War

Usage and Recommendations

  • CLIP Setting: -3

This model supports and encourages experimentation with various tags, offering users the freedom to explore their creative visions in depth.

License

This model is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.