|
--- |
|
base_model: |
|
- black-forest-labs/FLUX.1-schnell |
|
tags: |
|
- flux |
|
- diffusers |
|
pipeline_tag: text-to-image |
|
--- |
|
|
|
# Skittles v2 |
|
|
|
## Model Summary |
|
|
|
**Skittles v2** is a cutting-edge text-to-image generation model, created by merging components from the **FLUX.1 Schnell** architecture. By combining the precision of **FLUX.1 Schnell** with advanced tweaks, **Skittles v2** is designed to offer high-quality image outputs. |
|
|
|
- **Type**: Text-to-Image Generation |
|
- **Architecture**: Merged FLUX.1 Schnell with CFG capabilities |
|
- **Output Quality**: Seems to be on par with **FLUX.1 Dev** |
|
- **Performance**: Optimized for both image fidelity (speed is degraded (I'm looking into it)) |
|
|
|
--- |
|
|
|
## Features |
|
|
|
- **CFG Integration**: Skittles v2 unlocks CFG (Classifier-Free Guidance) capabilities, offering fine-grained control over image generation. |
|
- **High Fidelity**: Produces ultra-realistic and detailed images. |
|
- **Customizable Output**: Supports a wide range of prompts, styles, and configurations. |
|
|
|
--- |
|
|
|
## Model Details |
|
|
|
- **Base Model**: FLUX.1 Schnell |
|
- **Merge Approach**: The model was combined using a custom merging strategy, blending FLUX.1 Schnell’s architecture with optimized CFG decoding. |
|
- **Training Paradigm**: Not retrained, but restructured for improved inference performance. |
|
- **Output Size**: Supports resolutions up to 1024x1024 pixels. |
|
|
|
--- |
|
|
|
## Intended Use |
|
|
|
### Applications |
|
- Generating ultra-realistic images for creative projects |
|
- Creating concept art, visual prototypes, and artistic renderings |
|
- Exploration of text-to-image synthesis for research or artistic purposes |
|
|
|
### Examples |
|
|
|
| Prompt | Image Description | |
|
|-------|--------------------| |
|
| "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" | Hyper-detailed astronaut surrounded by lush, muted jungle tones | |
|
| "A futuristic cityscape at sunset, ultra-realistic, cinematic, 4K" | Vibrant, glowing cityscape with dynamic lighting effects | |
|
| "A delicious ceviche cheesecake slice" | Highly detailed and realistic rendition of a culinary masterpiece | |
|
|
|
--- |
|
|
|
## How to Use |
|
|
|
```python |
|
from diffusers import DiffusionPipeline |
|
|
|
# Load the model |
|
pipe = DiffusionPipeline.from_pretrained("miike-ai/skittles-v2") |
|
pipe.to("cuda") # Ensure CUDA is available |
|
|
|
# Generate an image |
|
prompt = "An ultra-realistic image of a futuristic cityscape." |
|
image = pipe(prompt, guidance_scale=3.5, num_inference_steps=28).images[0] |
|
|
|
# Save the result |
|
image.save("generated_image.png") |
|
``` |
|
|
|
--- |
|
|
|
## Limitations and Biases |
|
|
|
- The model may produce biased or stereotypical outputs based on the provided text prompts. |
|
- Outputs are deterministic but rely heavily on the prompt quality. Results may vary with ambiguous descriptions. |
|
- The model is not trained to handle NSFW content or harmful prompts. |
|
|
|
--- |
|
|
|
## Acknowledgments |
|
|
|
- Built on top of **FLUX.1 Schnell** by Black Forest Labs |
|
- Contributions from **miike-ai** |
|
- Integrated with Hugging Face Diffusers for seamless inference |
|
|
|
--- |
|
|
|
## Citation |
|
|
|
If you use **Skittles v2** in your work, please cite: |
|
|
|
```bibtex |
|
@misc{miike2024skittlesv2, |
|
title={Skittles v2: A Merged Text-to-Image Generation Model}, |
|
author={miike-ai}, |
|
year={2024}, |
|
url={https://huggingface.co/miike-ai/skittles-v2}, |
|
} |
|
``` |
|
|