Dear Statalisters,
I have two datasets that contain industry codes, I want to see how much overlap in between. Below is the first few observations from each data.
data one
industryA
1111A0
1111B0
111200
111335
1113A0
111400
111910
111920
1119A0
1119B0
112100
data two
industryB
111200
111335
111400
111910
111920
112100
112300
113300
114100
114200
115000
the number of observation is 470 for one and 450 for two.
How should I join the two data ?
after that, should i use
gen flag=0
replace flag=1 if industryB~=industryA
thanks,
Rochelle
I have two datasets that contain industry codes, I want to see how much overlap in between. Below is the first few observations from each data.
data one
industryA
1111A0
1111B0
111200
111335
1113A0
111400
111910
111920
1119A0
1119B0
112100
data two
industryB
111200
111335
111400
111910
111920
112100
112300
113300
114100
114200
115000
the number of observation is 470 for one and 450 for two.
How should I join the two data ?
after that, should i use
gen flag=0
replace flag=1 if industryB~=industryA
thanks,
Rochelle
Comment