flux-schnell-cfg / README.md
miike-ai's picture
Update README.md
e507c73 verified
|
raw
history blame
3.23 kB
---
base_model:
- black-forest-labs/FLUX.1-schnell
tags:
- flux
- diffusers
pipeline_tag: text-to-image
---
# Skittles v2
## Model Summary
**Skittles v2** is a cutting-edge text-to-image generation model, created by merging components from the **FLUX.1 Schnell** architecture. By combining the precision of **FLUX.1 Schnell** with advanced tweaks, **Skittles v2** is designed to offer high-quality image outputs.
- **Type**: Text-to-Image Generation
- **Architecture**: Merged FLUX.1 Schnell with CFG capabilities
- **Output Quality**: Seems to be on par with **FLUX.1 Dev**
- **Performance**: Optimized for both image fidelity (speed is degraded (I'm looking into it))
---
## Features
- **CFG Integration**: Skittles v2 unlocks CFG (Classifier-Free Guidance) capabilities, offering fine-grained control over image generation.
- **High Fidelity**: Produces ultra-realistic and detailed images.
- **Customizable Output**: Supports a wide range of prompts, styles, and configurations.
---
## Model Details
- **Base Model**: FLUX.1 Schnell
- **Merge Approach**: The model was combined using a custom merging strategy, blending FLUX.1 Schnell’s architecture with optimized CFG decoding.
- **Training Paradigm**: Not retrained, but restructured for improved inference performance.
- **Output Size**: Supports resolutions up to 1024x1024 pixels.
---
## Intended Use
### Applications
- Generating ultra-realistic images for creative projects
- Creating concept art, visual prototypes, and artistic renderings
- Exploration of text-to-image synthesis for research or artistic purposes
### Examples
| Prompt | Image Description |
|-------|--------------------|
| "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" | Hyper-detailed astronaut surrounded by lush, muted jungle tones |
| "A futuristic cityscape at sunset, ultra-realistic, cinematic, 4K" | Vibrant, glowing cityscape with dynamic lighting effects |
| "A delicious ceviche cheesecake slice" | Highly detailed and realistic rendition of a culinary masterpiece |
---
## How to Use
```python
from diffusers import DiffusionPipeline
# Load the model
pipe = DiffusionPipeline.from_pretrained("miike-ai/skittles-v2")
pipe.to("cuda") # Ensure CUDA is available
# Generate an image
prompt = "An ultra-realistic image of a futuristic cityscape."
image = pipe(prompt, guidance_scale=3.5, num_inference_steps=28).images[0]
# Save the result
image.save("generated_image.png")
```
---
## Limitations and Biases
- The model may produce biased or stereotypical outputs based on the provided text prompts.
- Outputs are deterministic but rely heavily on the prompt quality. Results may vary with ambiguous descriptions.
- The model is not trained to handle NSFW content or harmful prompts.
---
## Acknowledgments
- Built on top of **FLUX.1 Schnell** by Black Forest Labs
- Contributions from **miike-ai**
- Integrated with Hugging Face Diffusers for seamless inference
---
## Citation
If you use **Skittles v2** in your work, please cite:
```bibtex
@misc{miike2024skittlesv2,
title={Skittles v2: A Merged Text-to-Image Generation Model},
author={miike-ai},
year={2024},
url={https://huggingface.co/miike-ai/skittles-v2},
}
```