README.md · sandz7/chimera at main

metadata

title: Chimera
emoji: 🐅🦅🐍
colorFrom: blue
colorTo: yellow
sdk: gradio
sdk_version: 4.31.5
app_file: app.py
pinned: false
license: llama3

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Welcome To Chimera!

Here are some basic things you should know about it.

Chimera can be used as full AI Image Creation station. You can utilize the models to create custom image based on prompts, Chimera Text will give you a prompt from what you wanted to create with more simple human text.

A simple process: Ask the text what you want to create, the setting, the character features, the background, the feel of the image and any references to real life --> GPT Automatically reformats your description to a more suitable prompt that will be placed to the Stable Diffusor --> Copy and Paste the prompt it gave and paste it to Chimera Image Generation --> Image Generated.

The beauty of this process is to integrate with a prompt gpt gave you and slowly start to change it's features on the images based on what you have seen in the previous images, you can simply ask GPT what details to alter and setting but while still mainting the same character looks the diffusor made with the prompt or the style, and slowly but surely you start to get a fully custom image, of what you created in your head help with with both AI's.

Here are key notes in understanding the parameters on the stable diffuser:

Inferece Steps: These are the number of steps the model will regenerate the image. It starts with a very noisy image from the prompt, and regenerates the image with removing noise by each steps. SO the more steps, the more noise will be removed. However, High steps leads to slow generation but higher quality images and lower steps leads to faster generation but often lower quality images, I'd recommend to leave the steps at 40.
High Noise Fraction: The amount of noise added to the image during the initial stages of the diffusion process. A higher noise forces the model to learn patterns associated with the prompt than as much with the image due to the amount of noise during diffusing. Lower Noise makes the model have more versatility and creativity with the image but can make it unaligned with the prompt often due to overfitting from faster convergence. With less noise the model captures less features of the prompt, resulting in images that are less aligned with the prompt.

My rule of thumb with the parameters are to leave them at there default values unless you see the model needs more creativity in creating that specific image from the prompt. When more creativity is needed I just place the High Noise Fraction between 6-7 but to be honest you can play around these cause it's often random, the lowest I would go would be 4 but only if the model really needs creativity. I personally never place the model higher the 8.0 because i feel it lacks a bit creativity in creating and tries to emulate exactly what the prompt gives that it actually deviates from what the prompt was saying.

That is all! If you have any recommendations, questions or want to contribute to this project, your more than welcome to do so, we can have several chats in the discussion panel! 😀