shi-labs
/

versatile-diffusion

VersatileDiffusionPipeline

image-variation

Model card Files Files and versions Community

JamesXu commited on Nov 22, 2022

Commit

13be151

•

1 Parent(s): 448e188

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -21,11 +21,11 @@ One single flow of Versatile Diffusion contains a VAE, a diffuser, and a context
   <img src="assets/figures/VD_framework.png" width="99%">
 </p>
-# Caution, Biases, and Content Acknowledgment
-We would like the raise the awareness of users of this demo of its potential issues and concerns. Like previous large foundation models, Versatile Diffusion could be problematic in some cases, partially due to the imperfect training data and pretrained network (VAEs / context encoders) with limited scope. In its future research phase, VD may do better on tasks such as text-to-image, image-to-text, etc., with the help of more powerful VAEs, more sophisticated network designs, and more cleaned data. So far, we keep all features available for research testing both to show the great potential of the VD framework and to collect important feedback to improve the model in the future. We welcome researchers and users to report issues with the HuggingFace community discussion feature or email the authors.
-Beware that VD may output content that reinforces or exacerbates societal biases, as well as realistic faces, pornography, and violence. VD was trained on the LAION-2B dataset, which scraped non-curated online images and text, and may contained unintended exceptions as we removed illegal content. VD in this demo is meant only for research purposes.
 # Citation

   <img src="assets/figures/VD_framework.png" width="99%">
 </p>
+# Cautions, Biases, and Content Acknowledgment
+We would like the raise the awareness of users of this demo of its potential issues and concerns. Like previous large foundation models, Versatile Diffusion could be problematic in some cases, partially due to the imperfect training data and pretrained network (VAEs / context encoders) with limited scope. In its future research phase, VD may do better on tasks such as text-to-image, image-to-text, etc., with the help of more powerful VAEs, more sophisticated network designs, and more cleaned data. So far, we have kept all features available for research testing both to show the great potential of the VD framework and to collect important feedback to improve the model in the future. We welcome researchers and users to report issues with the HuggingFace community discussion feature or email the authors.
+Beware that VD may output content that reinforces or exacerbates societal biases, as well as realistic faces, pornography, and violence. VD was trained on the LAION-2B dataset, which scraped non-curated online images and text, and may contain unintended exceptions as we removed illegal content. VD in this demo is meant only for research purposes.
 # Citation