Instructions to use krea/Krea-2-Turbo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use krea/Krea-2-Turbo with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("krea/Krea-2-Turbo", dtype=torch.bfloat16, device_map="cuda") prompt = "A small, dark-colored cat is captured mid-stride, walking down the center of a narrow, abandoned street. The street is paved and appears cracked and worn. On either side of the street are tall, dilapidated buildings with visible brickwork and windows. A street lamp stands on the right side. The entire image is rendered in a monochromatic blue, with a distinct halftone dot pattern overlaying the scene, giving it a retro or printed appearance. The focus is soft, and the lighting is diffused, creating a hazy, atmospheric effect. The perspective is from ground level, looking down the length of the street, which narrows into the distance., halftone texture" image = pipe(prompt).images[0] - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- Draw Things
- DiffusionBee
Training LoRA for Krea 2 Turbo w/Training Adapter with AI-Toolkit
I wanted to share my experience because I couldn't find many reports about training Krea 2 Turbo LoRAs on a 12GB GPU. This was my first successful Krea 2 Turbo LoRA after some trial and error.
GPU: RTX 5070 12GB VRAM
RAM: 64 GB
OS: Windows 11
dataset:
36 1024x1024 images
Realistic female character
Rank:32
Resolution: 1024
LoRA for Kerea 2 Turbo w/Training Adapter
cache_latents_to_disk: true
cache_text_embeddings: true
Initially training appeared to hang. It was actually creating latent cache and text embeddings. After waiting, training started normally.
Changing
lokr_full_rank: true
to
lokr_full_rank: false
reduced step time from 25.4 sec to 22.3 sec.
Training time: 18h45m
LoRA size: 223MB
Training completed successfully. Character identity is generally consistent across generations. Body consistency is very good. Face consistency is acceptable. Some generations still deviate from the target face, but complete identity drift is rare.
Initially I thought 12GB VRAM would be insufficient. However,
cache_latents_to_disk: true
cache_text_embeddings: true
lokr_full_rank: false
made training practical.
Hopefully this helps someone who is wondering whether 12GB VRAM is enough.
Thanks!
Thank you for sharing! curious as to why you chose to train on Turbo + Turbo LoRA instead of directly training on Raw ?
Thank you for your comment. Because, I didn't think 12GB of VRAM was sufficient for LoRA training based on raw. However, a recent Reddit post shows someone doing exactly it, so I will try it myself.