Text-to-Image
Diffusers
English
Paper99 commited on
Commit
366b9bf
1 Parent(s): 7ffbee7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -2
README.md CHANGED
@@ -19,7 +19,7 @@ pipeline_tag: text-to-image
19
  ## Introduction
20
 
21
  <!-- Provide a quick summary of what the model is/does. -->
22
- Users can input one or several face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA models.
23
 
24
  ### Realistic results
25
 
@@ -38,7 +38,11 @@ More results can be found in our [project page](https://photo-maker.github.io/)
38
 
39
  ## Model Details
40
 
41
- It mainly contains two parts:
 
 
 
 
42
 
43
 
44
  ## Usage
 
19
  ## Introduction
20
 
21
  <!-- Provide a quick summary of what the model is/does. -->
22
+ Users can input one or a few face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA models.
23
 
24
  ### Realistic results
25
 
 
38
 
39
  ## Model Details
40
 
41
+ It mainly contains two parts corresponding to two keys in loaded state dict:
42
+
43
+ 1. `id_encoder` includes finetuned OpenCLIP-ViT-H-14 and a few fuse layers.
44
+
45
+ 2. `lora_weights` applies to all attention layers in the UNet, and the rank is set to 64.
46
 
47
 
48
  ## Usage