Dear all,
I am currently working with a country-industry-year dataset, and have the following problem. For some countries, there is missing data for some industries. My solution to this problem is to use data from their parent industries, for example if industries within manufacturing have missing data, I want to use data for the aggregate manufacturing industry. This approach works as typically the more aggregate parent industries have data. An example of the dataset is added to hopefully make my problem clearer:
The variable ind_par denotes the parent industry of the respective industry, which is denoted by ind. So the parent industry of D01T03 is D01T99. Then ind_par2 goes one parent level higher if possible, etc. D01T99 is the highest level. As you can see, the industry "D28" has missing data, but parent industry D10T33 does have data. So what I would like to do is use the annual growth rate in the share of their parent industry to plug in values for the industries for the years where this data is missing, i.e. extrapolation. I unfortunately do not know how I can do this. If anyone has an idea on how I can do this, it would be much appreciated.
Best,
Satya
I am currently working with a country-industry-year dataset, and have the following problem. For some countries, there is missing data for some industries. My solution to this problem is to use data from their parent industries, for example if industries within manufacturing have missing data, I want to use data for the aggregate manufacturing industry. This approach works as typically the more aggregate parent industries have data. An example of the dataset is added to hopefully make my problem clearer:
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input str3 cou str7 ind int year str6(ind_par ind_par2 ind_par3) float x_share "ABC" "D01T99" 1990 "" "" "" 1 "ABC" "D01T03" 1990 "D01T99" "" "" .05566159 "ABC" "D90T99" 1990 "D01T99" "" "" .05768238 "ABC" "D69T82" 1990 "D01T99" "" "" .07298723 "ABC" "D05T09" 1990 "D01T99" "" "" .01224463 "ABC" "D84T88" 1990 "D01T99" "" "" .21284407 "ABC" "D28" 1990 "D10T33" "D01T99" "" . "ABC" "D41T43" 1990 "D01T99" "" "" .07382749 "ABC" "D45T47" 1990 "D45T56" "D01T99" "" .16468546 "ABC" "D23" 1990 "D19T23" "D10T33" "D01T99" .00676179 "ABC" "D68" 1990 "D01T99" "" "" .009778853 "ABC" "D10T12" 1990 "D10T33" "D01T99" "" .02407338 "ABC" "D16T18" 1990 "D10T33" "D01T99" "" .017090254 "ABC" "D10T33" 1990 "D01T99" "" "" .1407735 "ABC" "D35T39" 1990 "D01T99" "" "" .014827263 "ABC" "D49T53" 1990 "D45T56" "D01T99" "" .05526453 "ABC" "D31T33" 1990 "D10T33" "D01T99" "" .009405102 "ABC" "D24T25" 1990 "D10T33" "D01T99" "" .02544084 "ABC" "D19T23" 1990 "D10T33" "D01T99" "" .019596357 "ABC" "D64T66" 1990 "D01T99" "" "" .04571036 "ABC" "D29T30" 1990 "D10T33" "D01T99" "" .015155792 "ABC" "D58T63" 1990 "D01T99" "" "" .02584345 "ABC" "D26T27" 1990 "D10T33" "D01T99" "" . "ABC" "D45T56" 1990 "D01T99" "" "" .2778193 "ABC" "D55T56" 1990 "D45T56" "D01T99" "" .05786932 "ABC" "D13T15" 1990 "D10T33" "D01T99" "" .013555143 end
Best,
Satya
Comment