Hi,
I have compiled a cross-country panel dataset of various education indicators, Gini and Palma indices for my dissertation on inequality. After transforming this from wide to long to run my regression, there is a lot of missing values where Stata has auto-generated the missing years. I have done a Shapiro-Wilk test and found all my variables are of non-normal distribution (partly due to missing values?). How should I proceed with filling in the missing data? For instance, is it suitable to use multiple imputation or interpolation? I have not formally tested for MAR or MCAR but I know that none of my variables are MNAR.
I have compiled a cross-country panel dataset of various education indicators, Gini and Palma indices for my dissertation on inequality. After transforming this from wide to long to run my regression, there is a lot of missing values where Stata has auto-generated the missing years. I have done a Shapiro-Wilk test and found all my variables are of non-normal distribution (partly due to missing values?). How should I proceed with filling in the missing data? For instance, is it suitable to use multiple imputation or interpolation? I have not formally tested for MAR or MCAR but I know that none of my variables are MNAR.
Comment