multimodalart HF staff commited on
Commit
184e314
1 Parent(s): 1216ada

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -92,6 +92,8 @@ Using the model to generate content that is cruel to individuals is a misuse of
92
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
93
  and is not fit for product use without additional safety mechanisms and
94
  considerations.
 
 
95
 
96
  ### Bias
97
  While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.
 
92
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
93
  and is not fit for product use without additional safety mechanisms and
94
  considerations.
95
+ - No additional measures were used to deduplicate the dataset. As a result, we observe some degree of memorization for images that are duplicated in the training data.
96
+ The training data can be searched at [https://rom1504.github.io/clip-retrieval/](https://rom1504.github.io/clip-retrieval/) to possibly assist in the detection of memorized images.
97
 
98
  ### Bias
99
  While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.