multimodalart HF staff commited on
Commit
b2e8107
1 Parent(s): 2d6f1cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -146,6 +146,8 @@ Using the model to generate content that is cruel to individuals is a misuse of
146
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
147
  and is not fit for product use without additional safety mechanisms and
148
  considerations.
 
 
149
 
150
  ### Bias
151
 
 
146
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
147
  and is not fit for product use without additional safety mechanisms and
148
  considerations.
149
+ - No additional measures were used to deduplicate the dataset. As a result, we observe some degree of memorization for images that are duplicated in the training data.
150
+ The training data can be searched at [https://rom1504.github.io/clip-retrieval/](https://rom1504.github.io/clip-retrieval/) to possibly assist in the detection of memorized images.
151
 
152
  ### Bias
153