The dataset we are going to use now was created from data published by the city of New York. You can download it from the GitHub repository associated with this book, under ch5/data/mahattan_district.shp. More information about how this file was created can also be found at the same location.
The data file is in a shapefile format, a format very common among spatial data professionals. It contains the boundaries and name of each community district within Manhattan in New York. The following image shows the districts we are going to work with:
The 11 districts of the borough of Manhattan in New York. Image created using QGIS
Any GIS is able to decode the shapefile format, as is neo4j-spatial. The first step of our analysis with Neo4j is to import the data into our graph.