Update README.md
Browse files
README.md
CHANGED
@@ -33,9 +33,7 @@ The GLIGEN model was created by researchers and engineers from [University of Wi
|
|
33 |
The [`StableDiffusionGLIGENPipeline`] can generate photorealistic images conditioned on grounding inputs.
|
34 |
|
35 |
Along with text and bounding boxes, if input images are given, this pipeline can insert objects described by text at the region defined by bounding boxes.
|
36 |
-
Otherwise, it'll generate an image described by the caption/prompt and insert objects described by text at the region defined by bounding boxes.
|
37 |
-
|
38 |
-
It's trained on COCO2014D and COCO2014CD datasets, and the model uses a frozen CLIP ViT-L/14 text encoder to condition itself on grounding inputs.
|
39 |
|
40 |
This weights here are intended to be used with the 🧨 Diffusers library. If you want to use one of the official checkpoints for a task, explore the [gligen](https://huggingface.co/gligen) Hub organizations!
|
41 |
|
@@ -77,7 +75,6 @@ from diffusers.utils import load_image
|
|
77 |
model_id = "masterful/gligen-1-4-inpainting-text-box"
|
78 |
device = "cuda"
|
79 |
|
80 |
-
|
81 |
pipe = StableDiffusionGLIGENPipeline.from_pretrained(model_id, variant="fp16", torch_dtype=torch.float16)
|
82 |
pipe = pipe.to(device)
|
83 |
|
|
|
33 |
The [`StableDiffusionGLIGENPipeline`] can generate photorealistic images conditioned on grounding inputs.
|
34 |
|
35 |
Along with text and bounding boxes, if input images are given, this pipeline can insert objects described by text at the region defined by bounding boxes.
|
36 |
+
Otherwise, it'll generate an image described by the caption/prompt and insert objects described by text at the region defined by bounding boxes. It's trained on COCO2014D and COCO2014CD datasets, and the model uses a frozen CLIP ViT-L/14 text encoder to condition itself on grounding inputs.
|
|
|
|
|
37 |
|
38 |
This weights here are intended to be used with the 🧨 Diffusers library. If you want to use one of the official checkpoints for a task, explore the [gligen](https://huggingface.co/gligen) Hub organizations!
|
39 |
|
|
|
75 |
model_id = "masterful/gligen-1-4-inpainting-text-box"
|
76 |
device = "cuda"
|
77 |
|
|
|
78 |
pipe = StableDiffusionGLIGENPipeline.from_pretrained(model_id, variant="fp16", torch_dtype=torch.float16)
|
79 |
pipe = pipe.to(device)
|
80 |
|