Mitsua commited on
Commit
9e9ec83
1 Parent(s): f195a50

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -124,7 +124,7 @@ We pre-process with face-blurring.
124
  - Images license is either Public Domain, CC0, CC BY or CC BY-SA (varies by image).
125
  - Text license is either CC0 (from Wikidata and Wikimedia Commons structured data) or CC BY-SA 4.0 (from Wikipedia and Wikimedia Commons non-structured data).
126
  - Curated by ELAN MITSUA Project / Abstract Engine.
127
- - **All image attributions are found here.**
128
  - How we curate this dataset
129
  - **Problem statement** :
130
  - Our goal to build this dataset is to achieve both quality and copyright/privacy safety.
@@ -147,13 +147,13 @@ We pre-process with face-blurring.
147
  - Images and metadata collected from these museums open access. All images and metadata are shared under CC0 or Public Domain.
148
  - We created image caption only from these metadata.
149
  - [Smithsonian Open Access](https://www.si.edu/openaccess) (CC0)
150
- - Image attribution found here.
151
  - [The Metropolitan Museum of Art Open Access](https://github.com/metmuseum/openaccess) (CC0)
152
- - Image attribution found here.
153
  - [The Cleveland Museum of Art Open Access](https://github.com/ClevelandMuseumArt/openaccess) (CC0)
154
- - Image attribution found here.
155
  - [The Art Institute of Chicago Open Access](https://www.artic.edu/open-access/open-access-images) (CC0)
156
- - Image attribution found here.
157
  - Curated by ELAN MITSUA Project / Abstract Engine.
158
 
159
  * Even if the dataset itself is CC-licensed, we did not use it if the image contained in the dataset is not properly licensed, is based on unauthorized use of copyrighted works, or is based on the synthetic data output of other pretrained models.
 
124
  - Images license is either Public Domain, CC0, CC BY or CC BY-SA (varies by image).
125
  - Text license is either CC0 (from Wikidata and Wikimedia Commons structured data) or CC BY-SA 4.0 (from Wikipedia and Wikimedia Commons non-structured data).
126
  - Curated by ELAN MITSUA Project / Abstract Engine.
127
+ - [**All image attributions are found here.**](commons_ccpd_attribution_likes_CLIP.zip)
128
  - How we curate this dataset
129
  - **Problem statement** :
130
  - Our goal to build this dataset is to achieve both quality and copyright/privacy safety.
 
147
  - Images and metadata collected from these museums open access. All images and metadata are shared under CC0 or Public Domain.
148
  - We created image caption only from these metadata.
149
  - [Smithsonian Open Access](https://www.si.edu/openaccess) (CC0)
150
+ - [Image Attribution found here](Smithsonian_2024_attribution.csv).
151
  - [The Metropolitan Museum of Art Open Access](https://github.com/metmuseum/openaccess) (CC0)
152
+ - [Image Attribution found here](MET_2024_attribution.csv).
153
  - [The Cleveland Museum of Art Open Access](https://github.com/ClevelandMuseumArt/openaccess) (CC0)
154
+ - [Image Attribution found here](CMA_2024_attribution.csv).
155
  - [The Art Institute of Chicago Open Access](https://www.artic.edu/open-access/open-access-images) (CC0)
156
+ - [Image Attribution found here](artic_2024_attribution.csv).
157
  - Curated by ELAN MITSUA Project / Abstract Engine.
158
 
159
  * Even if the dataset itself is CC-licensed, we did not use it if the image contained in the dataset is not properly licensed, is based on unauthorized use of copyrighted works, or is based on the synthetic data output of other pretrained models.