| ANL303: Fundamentals of Data Mining |
Public safety is an important issue in every society. Crime analysis can help governments or law-enforcing agencies understand crime patterns, prevent and resolve crimes effectively, making citizens feel safe.
You are given a dataset “Brazilian_Crimes.csv” that contains a total of 6,672 crime incident records that occurred in Brazil. The description of the dataset is given in Table 1.
Table 1. Description of the dataset “Brazilian_Crimes.csv”
Import the dataset to IBM SPSS Modeler and answer Question 1.

Q1) It is found that there are values under “Crime_Type” other than “ROBBERY” and “THEFT”. Prepare the dataset by encoding those values as “OTHERS” using IBM SPSS Modeler. Provide the necessary screenshot(s) to illustrate your data preparation steps.
Apart from the data quality issue mentioned above, identify one more data quality issue from the dataset. Propose a method to solve it and give reason(s). Then, prepare the dataset in IBM SPSS Modeler accordingly. Provide necessary screenshot(s) to support your answers.