Update README.md
Browse files
README.md
CHANGED
@@ -37,8 +37,10 @@ pip install transformers accelerate safetensors
|
|
37 |
```
|
38 |
|
39 |
**Notes:**
|
40 |
-
- The pipeline uses the `EDMEulerScheduler` scheduler
|
41 |
-
-
|
|
|
|
|
42 |
|
43 |
Then, run the following snippet:
|
44 |
|
@@ -66,21 +68,27 @@ This model card only provides a brief summary of our user study results. For ext
|
|
66 |
|
67 |
We conducted studies to measure overall aesthetic quality, as well as for the specific areas we aimed to improve with Playground v2.5, namely multi aspect ratios and human preference alignment.
|
68 |
|
69 |
-
|
70 |
|
71 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/63855d851769b7c4b10e1f76/V7LFNzgoQJnL__ndU0CnE.png)
|
72 |
|
73 |
-
|
|
|
|
|
74 |
|
75 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636c0c4eaae2da3c76b8a9a3/xMB0r-CmR3N6dABFlcV71.png)
|
76 |
|
77 |
-
|
78 |
|
79 |
-
|
80 |
|
81 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636c0c4eaae2da3c76b8a9a3/7c-8Stw52OsNtUjse8Slv.png)
|
82 |
|
83 |
-
|
|
|
|
|
|
|
|
|
84 |
|
85 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636c0c4eaae2da3c76b8a9a3/7tyYDPGUtokh-k18XDSte.png)
|
86 |
|
|
|
37 |
```
|
38 |
|
39 |
**Notes:**
|
40 |
+
- The pipeline uses the `EDMEulerScheduler` scheduler. It's an [EDM formulation](https://arxiv.org/abs/2206.00364) of the Euler scheduler.
|
41 |
+
- `guidance_scale=5.0` is a good default for this scheduler.
|
42 |
+
- The pipeline also supports the `EDMDPMSolverMultistepScheduler` scheduler. It's an [EDM formulation](https://arxiv.org/abs/2206.00364) of the DPM scheduler.
|
43 |
+
- `guidance_scale=3.0` is a good default for this scheduler.
|
44 |
|
45 |
Then, run the following snippet:
|
46 |
|
|
|
68 |
|
69 |
We conducted studies to measure overall aesthetic quality, as well as for the specific areas we aimed to improve with Playground v2.5, namely multi aspect ratios and human preference alignment.
|
70 |
|
71 |
+
#### Compare to state-of-the-art
|
72 |
|
73 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/63855d851769b7c4b10e1f76/V7LFNzgoQJnL__ndU0CnE.png)
|
74 |
|
75 |
+
The aesthetic quality of Playground v2.5 dramatically outperforms the current state-of-the-art open source models SDXL and PIXART-α, as well as Playground v2. Because the performance differential between Playground V2.5 and SDXL was so large, we also tested our aesthetic quality against world-class closed-source models like DALL-E 3 and Midjourney 5.2, and found that Playground v2.5 outperforms them as well.
|
76 |
+
|
77 |
+
#### Multi-aspect Ratios
|
78 |
|
79 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636c0c4eaae2da3c76b8a9a3/xMB0r-CmR3N6dABFlcV71.png)
|
80 |
|
81 |
+
Similarly, for multi aspect ratios, we outperform SDXL by a large margin.
|
82 |
|
83 |
+
#### Human-centric Study
|
84 |
|
85 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636c0c4eaae2da3c76b8a9a3/7c-8Stw52OsNtUjse8Slv.png)
|
86 |
|
87 |
+
Next, we benchmark Playground v2.5 specifically on people-related images, to test Human Preference Alignment. We compared Playground v2.5 against two commonly-used baseline models: SDXL and RealStock v2, a community fine-tune of SDXL that was trained on a realistic people dataset.
|
88 |
+
|
89 |
+
Playground v2.5 outperforms both baselines by a large margin.
|
90 |
+
|
91 |
+
### MJHQ-30K Benchmark
|
92 |
|
93 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636c0c4eaae2da3c76b8a9a3/7tyYDPGUtokh-k18XDSte.png)
|
94 |
|