I have to create an aggregate variable/index (say new1 indicating policy stringency of a particular sector) out of 5 variables (say x1-x5) indicating policy stringency. I have panel data from 40 countries and 20 years. These 5 variables are scaled between 0-10 (already normalized), 0 being no policy in place and 10 being high stringency. Now i have 5 variables, each scaled between 0-10. If I want to create an aggregate variable/index (new1) out of these 5 variables (x1-x5), I get into several issues:
NB: Interpolation/extrapolation is not an option here since i have so many missing values
- There are some missing values in some variables. for example, variables x1,x2 have missing values for the year 2007 for a country
- There are 0 values in some variables. for example, variables x3 and x4 have 0 values for 2007 for the same country
- There is a positive value for x5 in the year 2007. Let's say the value is 1.25.
NB: Interpolation/extrapolation is not an option here since i have so many missing values
Comment