metadata

license: creativeml-openrail-m
language:
  - en
thumbnail: >-
  https://huggingface.co/Norod78/sd15-caricature-portraits-blip-captions/resolve/main/example/IMG_9234_cunet_anime_noise0.jpg
tags:
  - stable-diffusion
  - stable-diffusion-diffusers
  - text-to-image
datasets:
  - Norod78/caricature-portraits-blip-captions-512
inference: true

Caricature portraits diffusion model

Stable Diffusion v1.5 fine tuned on the 2D Caricature Dataset from 3D-CariGAN cropped to 512x512 and blip captioned

If you want more details on how to generate your own blip cpationed dataset see this colab

Training was done using this Hugging-Face's text to image training script

About

Put in a text prompt and generate caricatures

AUTOMATIC1111 webui checkpoint

The main folder contains a .safetensors file to be put in stable-diffusion-webui "stable-diffusion-webui/models/Stable-diffusion" folder and used to generate images

Dataset and Training

Finetuned for 10,000 iterations upon runwayml/stable-diffusion-v1-5 on BLIP captioned portraits portraits using 1xA5000 GPU on my home desktop computer

Trained by @Norod78

The original Caricature dataset citation credits

@article{ye2021caricature,
 author = {Ye, Zipeng and Xia, Mengfei and Sun, Yanan and Yi, Ran and Yu, Minjing and Zhang, Juyong and Lai, Yu-Kun and Liu, Yong-Jin},
 title = {3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos},
 journal = {IEEE Transactions on Visualization and Computer Graphics},
 year = {2021},
 doi={10.1109/TVCG.2021.3126659},
}