Hello,
I have a very long string variable and I need to create a dummy if that variable contains a certain term. The term however is mispelled most of the time and I need to detect the different variations of it along the different observations. I was thinking about using some string distance command to detect the variations by using a threshold like in the strgroup command. Is there any way to do so in Stata?
Thank you!
I have a very long string variable and I need to create a dummy if that variable contains a certain term. The term however is mispelled most of the time and I need to detect the different variations of it along the different observations. I was thinking about using some string distance command to detect the variations by using a threshold like in the strgroup command. Is there any way to do so in Stata?
Thank you!
Comment