Hi everyone. As you can see this is my first post. After reading a lot of posts in this forum I still have one question or issue regarding variable selection process in panel data analysis. There hasn't been many econometric studies in my field of research so I had to start from somewhere and spent a lot of time and effort to collect data on 30 potential predictors in my panel data research. One of the research questions in my dissertation is to examine wide range of variables. My question is: how to determine which predictors are statistically significant to keep them in my model?
a) Can I do it based on their p-values? To start with 30 predictors and eliminate predictor with highest p-value, and then do the process all over again until all predictors are significant (backward elimination). From what model to start in order to get p-values: pooled, fe or re?
b) Many experienced stata users in this forum suggested that selection based on p-values is not the best solution and that selection should be done with non-inferential statistics (like correlation coefficients). Can someone please explain that process and criteria?
Every advice is highly appreciated.
a) Can I do it based on their p-values? To start with 30 predictors and eliminate predictor with highest p-value, and then do the process all over again until all predictors are significant (backward elimination). From what model to start in order to get p-values: pooled, fe or re?
b) Many experienced stata users in this forum suggested that selection based on p-values is not the best solution and that selection should be done with non-inferential statistics (like correlation coefficients). Can someone please explain that process and criteria?
Every advice is highly appreciated.
Comment