Setting up a dataset
In principle, any data you can use (subject to limitations—see the legal caveats section below), you can upload to Kaggle. The specific limits at the time of writing this book are: 20 gigabytes per dataset and 100 gb total quota. Keep in mind that the size limit per single dataset is calculated uncompressed—uploading compressed versions speeds up the transfer but does not help against the limits. You can check the most recent documentation of the datasets at this link:
https://www.kaggle.com/docs/datasets
Kaggle promotes itself as a “home of open data science” and the impressive collection of datasets available from the site certainly lends some credence to that claim: before uploading the data for your project into a dataset, make sure to check the existing content—for several popular applications, there is a chance it has already been stored there:

For the sake of this introduction, let us assume the kind of data you will be...