Data Collection
The gathering of data is an essential component in the study of crime prediction. The official crime records that are maintained by law enforcement agencies serve as the primary source of data collecting for the purpose of crime prediction. These documents provide details on instances involving criminal activity, such as the time, place, and nature of the offenses that were committed. The obtained data on criminal activity is put to use to educate and validate the prediction algorithms. The precision and exhaustiveness with which the crime data is collected will determine the quality of the prediction models.
Connected datasets for the city of Chicago may be collected from a variety of sources, including the Chicago Police Department (CPD), the Chicago Data Portal, and the Illinois State Police, for the purpose of doing analysis that is related to the prediction of crimes. The Chicago Police Department distributes extensive crime statistics to the public, which includes information on arrests, occurrences of crime, and police activities. The Chicago Data Portal offers freely accessible data on a wide variety of issues, one of which is incidences of crime, which may be analyzed to develop crime prediction models. Additionally, the Illinois State Police make crime data, such as crime statistics, available to the public for the purpose of conducting crime prediction analyses. These datasets are used in the development of prediction models that provide assistance to law enforcement authorities in the areas of crime prevention and the resolution of criminal cases.
Considering the scope of the project and the relationship analysis, the following datasets are considered for this project.
Historical Crime Records (2001 - 2018 )
Includes data on past crime trends and patterns, can provide insight into current crime rate.
Socio Economic Data
Includes data regarding Income, education level, poverty rate, etc. Economic status of the area, and how it may be affecting the crime rate.
Environmental Data
Includes data like population density, number of street lights, etc. can be used to understand the physical characteristics of an area and how they may be influencing crime rates.
Temperature
Includes data regarding Income, education level, poverty rate, etc. Economic status of the area, and how it may be affecting the crime rate.
All the API End Points used in Data Collection.
Historic Crime Data: https://data.cityofchicago.org/resource/crimes.json
Police Stations Data: https://data.cityofchicago.org/resource/z8bn-74gv.json
Demographic: https://data.cityofchicago.org/resource/kn9c-c2s2.csv
Data Fetch Approaches
Link to Datasets