Diffusers
Safetensors
RectifiedFlowPipeline
XCLiu commited on
Commit
f26a78d
·
1 Parent(s): a34d31c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md CHANGED
@@ -1,3 +1,53 @@
1
  ---
2
  license: cc-by-nc-4.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-4.0
3
  ---
4
+
5
+ # InstaFlow: 2-Rectified Flow fine-tuned from Stable Diffusion v1.5
6
+
7
+ 2-Rectified Flow is a few-step text-to-image generative model fine-tuned from Stabled Diffusion v1.5.
8
+
9
+ We use text-conditioned reflow as described in [our paper](https://arxiv.org/abs/2309.06380).
10
+
11
+ Reflow has interesting theoretical properties. You may check [this ICLR paper](https://arxiv.org/abs/2209.03003) and [this arXiv paper](https://arxiv.org/abs/2209.14577).
12
+
13
+ ## Images Generated from Random Diffusion DB prompts
14
+
15
+ | 4-Step | 8-Step | 25-Step |
16
+ | --- | --- | --- |
17
+ | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646b0bbdec9a61e871799339/-6RX-t4ilNPwOy6POaH85.png) | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646b0bbdec9a61e871799339/YX1hThlsMFXpPLllVgNse.png) | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646b0bbdec9a61e871799339/uQBM25_BzLhFosXvF7y9S.png) | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646b0bbdec9a61e871799339/uQBM25_BzLhFosXvF7y9S.png) |
18
+
19
+ ## Usage
20
+
21
+ Please refer to the [official github repo](https://github.com/gnobitab/InstaFlow).
22
+
23
+ ## Training
24
+
25
+ Training pipeline:
26
+ 1. Reflow (Stage 1): We train the model using the text-conditioned reflow objective with a batch size of 64 for 70,000 iterations.
27
+ The model is initialized from the pre-trained SD 1.5 weights. (11.2 A100 GPU days)
28
+ 2. Reflow (Stage 2): We continue to train the model using the text-conditioned reflow objective with an increased batch size of 1024 for 25,000 iterations. (64 A100 GPU days)
29
+
30
+ The final model is **2-Rectified Flow**.
31
+
32
+ **Total Training Cost:** It takes 75.2 A100 GPU days to get 2-Rectified Flow.
33
+
34
+
35
+ ## Evaluation Results - Metrics
36
+
37
+ The following metrics of 2-Rectified Flow are measured on MS COCO 2017 with 5000 images and 25-step Euler solver.
38
+
39
+ FID-5k = 21.5, CLIP score = 0.315
40
+
41
+ ## Evaluation Results - Impact of Guidance Scale
42
+
43
+ We
44
+
45
+ ## Citation
46
+ ```
47
+ @article{liu2023insta,
48
+ title={InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation},
49
+ author={Liu, Xingchao and Zhang, Xiwen and Ma, Jianzhu and Peng, Jian and Liu, Qiang},
50
+ journal={arXiv preprint arXiv:2309.06380},
51
+ year={2023}
52
+ }
53
+ ```