Data

 

  1. Vancouver Neighbourhood Boundary File: Some neighbourhood names did not match the neighbourhood names in the crime data. To perform several joins, spatial-joins, and other analyses, few of the names were manually corrected.
  2. Census Tract Boundary File: The original CT file contained boundaries for all of Canada. To enhance performance and visualization, the data was parsed to only contain Vancouver boundaries.
  3. Population, Income, Unemployment Data: The population census data had three fields – total population, population age 15-64, and population age 65+. For analyses that require normalization, population age 15-64 and population age 65+ was divided by the total population. From the several options of income data, median income was chosen as opposed to average income to avoid extremities. Average income can be often skewed from outliers of wealthy individuals.
  4. Crime Data: All data was divided into year 2019, 2020, 2021 and from each year, the five crime types in question (B&E Commercial, B&E Residential, Mischief, Auto Theft, Theft) was exported. From this five crime type per each year was filtered again to obtain each crimes for every year. The final result would have the crimes divided by each type and year, and a holistic five crime type per each year.