Hi, I am using a panel data set to estimate the gravity model using PPML. I have trade flow data y, with an endogenous regressor x (continuous), a set of exogenous regressors w (including exporter (32), importer (49), time (18) dummy variables).
I am trying to do two-stage regression. I set an endogenous variable (x) as a function of instruments and exporter, importer dummies in exogenous regressors (w).
Hence, in the first stage, I regress the endogenous variable (x) on instruments (z) and exogenous regressors (w) through OLS and then obtain predicted x.
In the second stage, I estimated y=exp(constant + predicted x + w) by using PPML proposed by Silva and Tenreyro (2006).
My code is:
My question is:
(1) is it possible to do estimation like this? (first stage OLS, second stage PPML)
(2) Is there anything that I need to care about?
(3) Is it better to use different methods? (such as ivpoission GMM)
I am trying to do two-stage regression. I set an endogenous variable (x) as a function of instruments and exporter, importer dummies in exogenous regressors (w).
Hence, in the first stage, I regress the endogenous variable (x) on instruments (z) and exogenous regressors (w) through OLS and then obtain predicted x.
In the second stage, I estimated y=exp(constant + predicted x + w) by using PPML proposed by Silva and Tenreyro (2006).
My code is:
Code:
*STAGE 1 reg x w z predict yhat if e(sample) *STAGE 2 ppml y w yhat, cluster(dist)
(1) is it possible to do estimation like this? (first stage OLS, second stage PPML)
(2) Is there anything that I need to care about?
(3) Is it better to use different methods? (such as ivpoission GMM)
Comment