Hi everyone,
I am trying to estimate a GMM panel dynamic model using a database of 31 countries (European countries) and 10 years.
My dependent variable is log_evshare (electric vehicles registrations over total vehicle registrations), regressed using fixed effects (country-based) over (1) a continuous variable monetaryincentive, (2) log_elecdieselratio, aka the ratio between the price of electricity and the price of diesel, and (3) a trend variable year.
The basic characteristics of my data/model revolve around two considerations:
Thank you very much and sorry for the long post.
I am trying to estimate a GMM panel dynamic model using a database of 31 countries (European countries) and 10 years.
My dependent variable is log_evshare (electric vehicles registrations over total vehicle registrations), regressed using fixed effects (country-based) over (1) a continuous variable monetaryincentive, (2) log_elecdieselratio, aka the ratio between the price of electricity and the price of diesel, and (3) a trend variable year.
The basic characteristics of my data/model revolve around two considerations:
- The dependent variable is dynamic (as confirmed by running autocorrelation tests for residuals after performing a simple xtreg, fe using the variables above).
- A FE model might suffer from omitted variable bias from not including charging, aka the number of charging stations present in the country. I then thought of using l.charging as additional control variable in my FE model to limit omitted variable bias but, at the same time, avoid introducing reverse causality.
- First stage: l2.charging explains l1.log_evshare, which in turn explains log_evshare
- Exclusion restriction: respected, since it's assumed that l2.charging affects log_evshare only through l1.log_evshare
- How do I choose how many lags to include? The reason I wanted to limit lags to gmmstyle (..., lags(0,1)) is that I then also have ivstyle(l2.charging, eq(l)). I worry that if I include more lags of the depvar in gmmstyle, my IV might not respect the exclusion restriction.
- How do I chose if I should include monetaryincentive and log_elecdieselratio inside the gmmstyle specifications? In case I add more controls (e.g. unemployment rate), should they always be included in the gmmstyle form too?
- Based on what do I include or not the trend year? I supposed modelling the depvar as dynamic makes it less necessary to include a trend. Is that correct?
- As a consequence of the previous three point, I did not find the correct specification yet to decide whether to use diff-GMM or system GMM. However, the following Pooled OLS regression suggests an estimate for l.log_evshare of 0.73. Is it correct to consider that diff-GMM could be a correct model, since log_evshare does not exhibit a random walk (beta -> 1)? Or should I move to system-GMM considering my small sample size?
Code:
regress log_evshare l.log_evshare monetaryincentive log_elecdieselratio l.charging year, vce(cluster countryid)
Comment