This directory includes a few sample datasets to get you started. | |
* `california_housing_data*.csv` is California housing data from the 1990 US | |
Census; more information is available at: | |
https://developers.google.com/machine-learning/crash-course/california-housing-data-description | |
* `mnist_*.csv` is a small sample of the | |
[MNIST database](https://en.wikipedia.org/wiki/MNIST_database), which is | |
described at: http://yann.lecun.com/exdb/mnist/ | |
* `anscombe.json` contains a copy of | |
[Anscombe's quartet](https://en.wikipedia.org/wiki/Anscombe%27s_quartet); it | |
was originally described in | |
Anscombe, F. J. (1973). 'Graphs in Statistical Analysis'. American | |
Statistician. 27 (1): 17-21. JSTOR 2682899. | |
and our copy was prepared by the | |
[vega_datasets library](https://github.com/altair-viz/vega_datasets/blob/4f67bdaad10f45e3549984e17e1b3088c731503d/vega_datasets/_data/anscombe.json). | |