Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ pipeline_tag: text-to-image
|
|
19 |
## Introduction
|
20 |
|
21 |
<!-- Provide a quick summary of what the model is/does. -->
|
22 |
-
Users can input one or
|
23 |
|
24 |
### Realistic results
|
25 |
|
@@ -38,7 +38,11 @@ More results can be found in our [project page](https://photo-maker.github.io/)
|
|
38 |
|
39 |
## Model Details
|
40 |
|
41 |
-
It mainly contains two parts:
|
|
|
|
|
|
|
|
|
42 |
|
43 |
|
44 |
## Usage
|
|
|
19 |
## Introduction
|
20 |
|
21 |
<!-- Provide a quick summary of what the model is/does. -->
|
22 |
+
Users can input one or a few face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA models.
|
23 |
|
24 |
### Realistic results
|
25 |
|
|
|
38 |
|
39 |
## Model Details
|
40 |
|
41 |
+
It mainly contains two parts corresponding to two keys in loaded state dict:
|
42 |
+
|
43 |
+
1. `id_encoder` includes finetuned OpenCLIP-ViT-H-14 and a few fuse layers.
|
44 |
+
|
45 |
+
2. `lora_weights` applies to all attention layers in the UNet, and the rank is set to 64.
|
46 |
|
47 |
|
48 |
## Usage
|