3.17 GB
42 files
Updated about 1 month ago
Name
Size
ATTRIBUTION
.gitattributes2.46 kB
xet
README.md5.08 kB
xet
svgfind-CREATIVECOMMONS_00.jsonl.zst110 MB
xet
svgfind-CREATIVECOMMONS_01.jsonl.zst107 MB
xet
svgfind-CREATIVECOMMONS_02.jsonl.zst97.7 MB
xet
svgfind-CREATIVECOMMONS_03.jsonl.zst105 MB
xet
svgfind-CREATIVECOMMONS_04.jsonl.zst90.1 MB
xet
svgfind-CREATIVECOMMONS_05.jsonl.zst109 MB
xet
svgfind-CREATIVECOMMONS_06.jsonl.zst88.8 MB
xet
svgfind-CREATIVECOMMONS_07.jsonl.zst92.3 MB
xet
svgfind-CREATIVECOMMONS_08.jsonl.zst58.3 MB
xet
svgfind-CREATIVECOMMONS_09.jsonl.zst51.9 MB
xet
svgfind-CREATIVECOMMONS_10.jsonl.zst50.6 MB
xet
svgfind-CREATIVECOMMONS_11.jsonl.zst57.2 MB
xet
svgfind-CREATIVECOMMONS_12.jsonl.zst58.3 MB
xet
svgfind-CREATIVECOMMONS_13.jsonl.zst81.4 MB
xet
svgfind-CREATIVECOMMONS_14.jsonl.zst84.2 MB
xet
svgfind-CREATIVECOMMONS_15.jsonl.zst79.8 MB
xet
svgfind-CREATIVECOMMONS_16.jsonl.zst87.5 MB
xet
svgfind-CREATIVECOMMONS_17.jsonl.zst115 MB
xet
svgfind-CREATIVECOMMONS_18.jsonl.zst98.3 MB
xet
svgfind-CREATIVECOMMONS_19.jsonl.zst89.8 MB
xet
svgfind-CREATIVECOMMONS_20.jsonl.zst85.1 MB
xet
svgfind-CREATIVECOMMONS_21.jsonl.zst83.7 MB
xet
svgfind-CREATIVECOMMONS_22.jsonl.zst94.2 MB
xet
svgfind-CREATIVECOMMONS_23.jsonl.zst98.1 MB
xet
svgfind-CREATIVECOMMONS_24.jsonl.zst82.8 MB
xet
svgfind-CREATIVECOMMONS_25.jsonl.zst72.7 MB
xet
svgfind-CREATIVECOMMONS_26.jsonl.zst80.3 MB
xet
svgfind-CREATIVECOMMONS_27.jsonl.zst68 MB
xet
svgfind-CREATIVECOMMONS_28.jsonl.zst66.4 MB
xet
svgfind-CREATIVECOMMONS_29.jsonl.zst70.5 MB
xet
svgfind-CREATIVECOMMONS_30.jsonl.zst63 MB
xet
svgfind-CREATIVECOMMONS_31.jsonl.zst75.8 MB
xet
svgfind-CREATIVECOMMONS_32.jsonl.zst104 MB
xet
svgfind-CREATIVECOMMONS_33.jsonl.zst68.8 MB
xet
svgfind-CREATIVECOMMONS_34.jsonl.zst134 MB
xet
svgfind-CREATIVECOMMONS_35.jsonl.zst84.9 MB
xet
svgfind-CREATIVECOMMONS_36.jsonl.zst41.3 MB
xet
svgfind-PUBLICDOMAIN.jsonl.zst7.96 MB
xet
README.md

Dataset Card for SVGFind Icons

Dataset Summary

This dataset contains a large collection of Scalable Vector Graphics (SVG) icons sourced from SVGFind.com. The icons cover a wide range of categories and styles, suitable for user interfaces, web development, presentations, and potentially for training vector graphics or icon classification models. Each icon is provided under either a Creative Commons license or is in the Public Domain, as clearly indicated in its metadata. The SVG files in this dataset have been minified using tdewolff/minify to reduce file size while preserving visual representation, and the data files are compressed using Zstandard compression.

Languages

The dataset metadata (titles, tags) is primarily in English:

  • English (en)

Dataset Structure

Data Files

The dataset consists of:

  • Metadata and SVG content stored in compressed JSONL format (.jsonl.zst) using Zstandard compression.
  • Data is split into separate files based on the license of the icons:
    • svgfind-CREATIVECOMMONS.jsonl.zst
    • svgfind-PUBLICDOMAIN.jsonl.zst
  • The SVG files in this dataset have been minified using tdewolff/minify to reduce file size while preserving visual representation.
  • Attribution details are provided in markdown files within the ATTRIBUTION/ directory, organized by license. These attribution files are also compressed using Zstandard (.md.zst).

Data Fields

Each record in the JSONL files contains the following fields:

  • id: Unique identifier for the icon on SVGFind.com.
  • title: Name or title of the icon.
  • data_pack: The collection or pack the icon belongs to on SVGFind.
  • tags: Array of strings representing tags associated with the icon.
  • license: The specific license under which the icon is distributed (e.g., "CREATIVECOMMONS", "PUBLICDOMAIN").
  • license_owner: The name of the entity (creator, company) specified as the license owner on SVGFind.
  • download_url: The original URL to download the SVG file from SVGFind.com.
  • svg_content: String containing the SVG markup for the icon.

Data Splits

The dataset is organized into splits based on the license associated with each icon:

Split License Description Number of Examples
creativecommons Creative Commons licenses 3,645,444
publicdomain Public Domain 10,366
Total 3,655,810

License Information

Licensing Structure

This dataset aggregates icons distributed under Creative Commons licenses and Public Domain. Each icon in this collection has one specific license associated with it. This license is indicated in the license field of the metadata and determines how the icon can be used, modified, and distributed.

Dataset License Overview and Attribution Files

The table below shows the distribution of licenses across the icons in this dataset and the corresponding files containing detailed attribution information. The filenames reflect the actual files present in the ATTRIBUTION/ directory.

License Works Attribution File
Creative Commons (CC) 3,645,444 CREATIVECOMMONS_Attribution.md
Public Domain (PD) 10,366 PUBLICDOMAIN_Attribution.md

Note: Attribution files are provided for licenses typically requiring it or where creator information was available. Public Domain does not legally require attribution, but it's often appreciated.

Full attribution details, listing creators/owners and their works under each license, are located in the ATTRIBUTION/ directory, organized by license type.

Further License Information

For detailed information about each license type, please refer to their official sources:

  • Creative Commons (CC) Licenses: https://creativecommons.org/licenses/
  • Public Domain: Works in the public domain are free from copyright restrictions and can be used without permission.
Total size
3.17 GB
Files
42
Last updated
Jun 2
Pre-warmed CDN
US EU US EU

Contributors