An interesting issue: How do deal with multicollinearity, endogeneity and interpret the interaction term in STATA using a panel dataset (Can we use a mix of ivregress, xtdpdml, xtdpdsys?)
The model
ŷ = b0 + b1X1 + b2X2 + b3X1X2
ŷ =company financial performance metric
X1 = carbon emissions
X2 = carbon assurance
X1X2 = interaction term
The issues:
Let’s say:
For a 2SLS – would you useivregress
For a dynamic GMM would you use xtdpdml or xtdpdsys
Any help in addressing these issues would be greatly appreciated.
The model
ŷ = b0 + b1X1 + b2X2 + b3X1X2
ŷ =company financial performance metric
X1 = carbon emissions
X2 = carbon assurance
X1X2 = interaction term
The issues:
Let’s say:
- X1 + X2 are related (but there is no perfect multicollinearity)
- X2 is endogenous
- What is the main effect of X1 cause ŷ ?
- What is the moderating effect of X2 in the relationship between X1 & ŷ?
- What is the mediating effect of X2 in the relationship between X1 & ŷ?
- Can we use a 2SLS regression to deal with the issues by finding a Z variable for X2 that is not related with ŷ?
- Can we use a dynamic GMM model to deal with the endogeneity issues but will the dynamic GMM model deal with the multicollinearity between X1 + X2?
- Will either of the above solutions allow the researcher to interpret the coefficients against X1 and X1X2 appropriately?
For a 2SLS – would you useivregress
For a dynamic GMM would you use xtdpdml or xtdpdsys
Any help in addressing these issues would be greatly appreciated.