Norod78 commited on
Commit
acc2b97
1 Parent(s): 97e03fb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ language:
4
+ - en
5
+ thumbnail: "https://huggingface.co/Norod78/sd15-caricature-portraits-blip-captions/resolve/main/example/IMG_9234_cunet_anime_noise0.jpg"
6
+ tags:
7
+ - stable-diffusion
8
+ - stable-diffusion-diffusers
9
+ - text-to-image
10
+ datasets:
11
+ - Norod78/caricature-portraits-blip-captions-512
12
+ inference: true
13
+ ---
14
+
15
+ # Caricature portraits diffusion model
16
+ *Stable Diffusion v1.5 fine tuned on the 2D Caricature Dataset from [3D-CariGAN](https://github.com/qq775193759/3D-CariGAN) cropped to 512x512 and blip captioned
17
+
18
+ If you want more details on how to generate your own blip cpationed dataset see this [colab](https://colab.research.google.com/gist/Norod/ee6ee3c4bf11c2d2be531d728ec30824/buildimagedatasetwithblipcaptionsanduploadtohf.ipynb)
19
+
20
+ Training was done using this Hugging-Face's text to image training [script](https://github.com/huggingface/diffusers/blob/main/examples/text_to_image/train_text_to_image.py)
21
+
22
+ ## About
23
+
24
+ Put in a text prompt and generate caricatures
25
+
26
+ ## AUTOMATIC1111 webui checkpoint
27
+
28
+ The [main](https://huggingface.co/Norod78/sd15-caricature-portraits-blip-captions/tree/main) folder contains a .safetensors file to be put in [stable-diffusion-webui](https://github.com/AUTOMATIC1111/stable-diffusion-webui) "stable-diffusion-webui/models/Stable-diffusion" folder and used to generate images
29
+
30
+ ![Images generated by this sample code](https://huggingface.co/Norod78/sd15-caricature-portraits-blip-captions/resolve/main/example/2DF671DA-EF5A-401B-B7AB-45FCDBBB3676.jpg)
31
+
32
+ ## Dataset and Training
33
+
34
+ Finetuned for 10,000 iterations upon [runwayml/stable-diffusion-v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5) on [BLIP captioned portraits portraits](https://huggingface.co/datasets/Norod78/caricature-portraits-blip-captions-512) using 1xA5000 GPU on my home desktop computer
35
+
36
+ Trained by [@Norod78](https://twitter.com/Norod78)
37
+
38
+ ## Theb original Caricature dataset citation credits
39
+ ```
40
+ @article{ye2021caricature,
41
+ author = {Ye, Zipeng and Xia, Mengfei and Sun, Yanan and Yi, Ran and Yu, Minjing and Zhang, Juyong and Lai, Yu-Kun and Liu, Yong-Jin},
42
+ title = {3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos},
43
+ journal = {IEEE Transactions on Visualization and Computer Graphics},
44
+ year = {2021},
45
+ doi={10.1109/TVCG.2021.3126659},
46
+ }
47
+ ```
48
+
49
+ ![Images generated by this sample code](https://huggingface.co/Norod78/sd15-caricature-portraits-blip-captions/resolve/main/example/4A6C928A-5710-4124-8897-6740FCA23D86.jpg)
50
+