multimodalart HF staff commited on
Commit
26c6da0
1 Parent(s): 0b2e68a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -149,6 +149,8 @@ Using the model to generate content that is cruel to individuals is a misuse of
149
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
150
  and is not fit for product use without additional safety mechanisms and
151
  considerations.
 
 
152
 
153
  ### Bias
154
 
 
149
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
150
  and is not fit for product use without additional safety mechanisms and
151
  considerations.
152
+ - No additional measures were used to deduplicate the dataset. As a result, we observe some degree of memorization for images that are duplicated in the training data.
153
+ The training data can be searched at [https://rom1504.github.io/clip-retrieval/](https://rom1504.github.io/clip-retrieval/) to possibly assist in the detection of memorized images.
154
 
155
  ### Bias
156