TencentARC
/

PhotoMaker

Model card Files Files and versions Community

Paper99 commited on Jan 14

Commit

366b9bf

•

1 Parent(s): 7ffbee7

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ pipeline_tag: text-to-image
 ## Introduction
 <!-- Provide a quick summary of what the model is/does. -->
-Users can input one or several face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA models.
 ### Realistic results
@@ -38,7 +38,11 @@ More results can be found in our [project page](https://photo-maker.github.io/)
 ## Model Details
-It mainly contains two parts:
 ## Usage

 ## Introduction
 <!-- Provide a quick summary of what the model is/does. -->
+Users can input one or a few face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA models.
 ### Realistic results
 ## Model Details
+It mainly contains two parts corresponding to two keys in loaded state dict:
+1. `id_encoder` includes finetuned OpenCLIP-ViT-H-14 and a few fuse layers.
+2. `lora_weights` applies to all attention layers in the UNet, and the rank is set to 64.
 ## Usage