Hi all and thanks in advance for the help.
I am trying to merge crime data from across the US, which I collected from different cities. While some cities use unified codes for the type of crime (NIBRS, NCIC) others just write in the crime.
Thankfully, most write it in rather similarly (e.g., dataset A: "Larceny" while dataset B: Larceny - Theft"), but I am struggling with how to unify these 'unique' datasets with each other. Right now I would think I need to go through the crimes manually and match them. Then assign a numeric to each type of crime. Then replace the string with a uniform value for that crime.
Does anyone have another way in mind?
I am trying to merge crime data from across the US, which I collected from different cities. While some cities use unified codes for the type of crime (NIBRS, NCIC) others just write in the crime.
Thankfully, most write it in rather similarly (e.g., dataset A: "Larceny" while dataset B: Larceny - Theft"), but I am struggling with how to unify these 'unique' datasets with each other. Right now I would think I need to go through the crimes manually and match them. Then assign a numeric to each type of crime. Then replace the string with a uniform value for that crime.
Does anyone have another way in mind?
Comment