miike-ai
/

flux-schnell-cfg

Model card Files Files and versions Community

flux-schnell-cfg / README.md

miike-ai's picture

Update README.md

e507c73 verified 5 months ago

|

3.23 kB

	---
	base_model:
	- black-forest-labs/FLUX.1-schnell
	tags:
	- flux
	- diffusers
	pipeline_tag: text-to-image
	---

	# Skittles v2

	## Model Summary

	Skittles v2 is a cutting-edge text-to-image generation model, created by merging components from the FLUX.1 Schnell architecture. By combining the precision of FLUX.1 Schnell with advanced tweaks, Skittles v2 is designed to offer high-quality image outputs.

	- Type: Text-to-Image Generation
	- Architecture: Merged FLUX.1 Schnell with CFG capabilities
	- Output Quality: Seems to be on par with FLUX.1 Dev
	- Performance: Optimized for both image fidelity (speed is degraded (I'm looking into it))

	---

	## Features

	- CFG Integration: Skittles v2 unlocks CFG (Classifier-Free Guidance) capabilities, offering fine-grained control over image generation.
	- High Fidelity: Produces ultra-realistic and detailed images.
	- Customizable Output: Supports a wide range of prompts, styles, and configurations.

	---

	## Model Details

	- Base Model: FLUX.1 Schnell
	- Merge Approach: The model was combined using a custom merging strategy, blending FLUX.1 Schnell’s architecture with optimized CFG decoding.
	- Training Paradigm: Not retrained, but restructured for improved inference performance.
	- Output Size: Supports resolutions up to 1024x1024 pixels.

	---

	## Intended Use

	### Applications
	- Generating ultra-realistic images for creative projects
	- Creating concept art, visual prototypes, and artistic renderings
	- Exploration of text-to-image synthesis for research or artistic purposes

	### Examples

	\| Prompt \| Image Description \|
	\|-------\|--------------------\|
	\| "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" \| Hyper-detailed astronaut surrounded by lush, muted jungle tones \|
	\| "A futuristic cityscape at sunset, ultra-realistic, cinematic, 4K" \| Vibrant, glowing cityscape with dynamic lighting effects \|
	\| "A delicious ceviche cheesecake slice" \| Highly detailed and realistic rendition of a culinary masterpiece \|

	---

	## How to Use

	```python
	from diffusers import DiffusionPipeline

	# Load the model
	pipe = DiffusionPipeline.from_pretrained("miike-ai/skittles-v2")
	pipe.to("cuda") # Ensure CUDA is available

	# Generate an image
	prompt = "An ultra-realistic image of a futuristic cityscape."
	image = pipe(prompt, guidance_scale=3.5, num_inference_steps=28).images[0]

	# Save the result
	image.save("generated_image.png")
	```

	---

	## Limitations and Biases

	- The model may produce biased or stereotypical outputs based on the provided text prompts.
	- Outputs are deterministic but rely heavily on the prompt quality. Results may vary with ambiguous descriptions.
	- The model is not trained to handle NSFW content or harmful prompts.

	---

	## Acknowledgments

	- Built on top of FLUX.1 Schnell by Black Forest Labs
	- Contributions from miike-ai
	- Integrated with Hugging Face Diffusers for seamless inference

	---

	## Citation

	If you use Skittles v2 in your work, please cite:

	```bibtex
	@misc{miike2024skittlesv2,
	title={Skittles v2: A Merged Text-to-Image Generation Model},
	author={miike-ai},
	year={2024},
	url={https://huggingface.co/miike-ai/skittles-v2},
	}
	```