Hello,
I work on panel data with 297177 firm observations and use -reghdfe- command to estimate my model [ reghdfe lnEBIT Tax_rate c.Tax_rate#c.Bilateral_agreement Bilateral_agreement control variables, absorb(NEWID year industryyear) vce(cluster NEWID countryyear) ]. After running the regression I receive an information that 10420 singleton observations were dropped. I would like to identify them, therefore I tried to drop them manually, bysort NEWID: drop if _N==1, where NEWID groups one subsidiary and its parent firm observed for some years. This way, I deleted 9649 observations. However, when I run the regression once again, there are still 771 singleton observation dropped by -reghdfe-. Q1: Why it is so and how can I identify them?
Q2 is related to interaction term c.tax#c.Bilateral_agreement, where Bilateral_agreement is a dummy variable, 1 if exists, 0 if it doesn't. I checked the correlation matrices and the correlation between Bilateral_agreement and c.Tax#c.Bilateral_agreement is .9925. What should I do in such case- drop Bilateral_agreement or use centering? When I center Tax_rate, the correlation between Bilateral_agreement and the interaction term drops to .2222.
Thank you,
An
I work on panel data with 297177 firm observations and use -reghdfe- command to estimate my model [ reghdfe lnEBIT Tax_rate c.Tax_rate#c.Bilateral_agreement Bilateral_agreement control variables, absorb(NEWID year industryyear) vce(cluster NEWID countryyear) ]. After running the regression I receive an information that 10420 singleton observations were dropped. I would like to identify them, therefore I tried to drop them manually, bysort NEWID: drop if _N==1, where NEWID groups one subsidiary and its parent firm observed for some years. This way, I deleted 9649 observations. However, when I run the regression once again, there are still 771 singleton observation dropped by -reghdfe-. Q1: Why it is so and how can I identify them?
Q2 is related to interaction term c.tax#c.Bilateral_agreement, where Bilateral_agreement is a dummy variable, 1 if exists, 0 if it doesn't. I checked the correlation matrices and the correlation between Bilateral_agreement and c.Tax#c.Bilateral_agreement is .9925. What should I do in such case- drop Bilateral_agreement or use centering? When I center Tax_rate, the correlation between Bilateral_agreement and the interaction term drops to .2222.
Thank you,
An
Comment