I am using the command reclink to match company names in two different datasets.
For this purpose, "Apple" and "APPLE" are the same. "Ltd" and "ltd" are the same.
But reclink considers them different.
Using "proper" command to turn "APPLE" to "Apple" does not help completely, because there are actual cases of company names containing consecutive capital letters (ABC). Those cases should not be turned into Abc.
Can I force reclink to become case insensitive?
Or is there any other command that does what reclink does but is case insensitive?
Here is a do file that can test case sensitivity of reclink.
For this purpose, "Apple" and "APPLE" are the same. "Ltd" and "ltd" are the same.
But reclink considers them different.
Using "proper" command to turn "APPLE" to "Apple" does not help completely, because there are actual cases of company names containing consecutive capital letters (ABC). Those cases should not be turned into Abc.
Can I force reclink to become case insensitive?
Or is there any other command that does what reclink does but is case insensitive?
Here is a do file that can test case sensitivity of reclink.
Code:
clear set obs 9 gen company_name = "Apple" gen company_number=_n replace company_name = "MICROSOFT" if _n==2 | _n==9 replace company_name = "Facebook" if _n==1 | _n==4 save "usingdata",replace clear set obs 10 gen company_name = "Apple" gen idmaster=_n replace company_name = "Microsoft" if _n==3 | _n==6 reclink company_name using "usingdata", idmaster(idmaster) idusing(company_number) gen(match_score)
Comment