Dear Teachers and Researchers,
I have several questions related to my data. I am currently in the stage of progress report for my Dissertation. I am investigating the institutional effects on emigration in developing countries.
The dataset which I am working on is with 5 years interval for the period between 1990 and 2010 for the regressand, while my predictor variables are yearly.
For instance, for 1990, I am taking the average for my predictor variables ( sum(1986;1987;1988;1989) / 4 ) The same technique was used for other respective years using the same method, as it is suggested by my some friends, hence our independent variable predicts the dependent variable.
So my questions are:
1) Is it correct to do regression analysis using the included method, or I have to include all 5 years and take the average ( sum(1986;1987;1988;1989;1990 / 5)?
2) Another question is related to sample selection bias. As I mentioned above I am focusing on developing countries. The data on emigration rate for developing countries is available for 128 countries, while the data for the explanatory variable of primary interest is available for 93 countries.
Do I need still to keep data for 35 countries, despite of no data available for them? Or is it acceptable to drop them?
Thank you for your answers in advance,
Wish you all good health,
Kind regards
I have several questions related to my data. I am currently in the stage of progress report for my Dissertation. I am investigating the institutional effects on emigration in developing countries.
The dataset which I am working on is with 5 years interval for the period between 1990 and 2010 for the regressand, while my predictor variables are yearly.
For instance, for 1990, I am taking the average for my predictor variables ( sum(1986;1987;1988;1989) / 4 ) The same technique was used for other respective years using the same method, as it is suggested by my some friends, hence our independent variable predicts the dependent variable.
So my questions are:
1) Is it correct to do regression analysis using the included method, or I have to include all 5 years and take the average ( sum(1986;1987;1988;1989;1990 / 5)?
2) Another question is related to sample selection bias. As I mentioned above I am focusing on developing countries. The data on emigration rate for developing countries is available for 128 countries, while the data for the explanatory variable of primary interest is available for 93 countries.
Do I need still to keep data for 35 countries, despite of no data available for them? Or is it acceptable to drop them?
Thank you for your answers in advance,
Wish you all good health,
Kind regards
Comment