PhotoMaker / README.md
Paper99's picture
Update README.md
7ffbee7 verified
|
raw
history blame
No virus
2.76 kB
metadata
license: apache-2.0
language:
  - en
library_name: diffusers
pipeline_tag: text-to-image

PhotoMaker Model Card

Introduction

Users can input one or several face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA models.

Realistic results

image/jpeg

image/jpeg

Stylization results

image/jpeg

image/jpeg

More results can be found in our project page

Model Details

It mainly contains two parts:

Usage

You can directly download the model in this repository. You also can download the model in python script:

from huggingface_hub import hf_hub_download
photomaker_ckpt = hf_hub_download(repo_id="TencentARC/PhotoMaker", filename="photomaker-v1.bin", repo_type="model")

Then, please follow the instructions in our GitHub repository.

Limitations

  • The model's customization performance degrades on Asian male faces.
  • The model still struggles with accurately rendering human hands.

Bias

While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.

Citation

BibTeX:

@article{li2023photomaker,
  title={PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding},
  author={Li, Zhen and Cao, Mingdeng and Wang, Xintao and Qi, Zhongang and Cheng, Ming-Ming and Shan, Ying},
  booktitle={arXiv preprint arxiv:2312.04461},
  year={2023}
}