PPML Gravity Model help requested

majid lateef

Join Date: Apr 2016

Posts: 9
#16

01 May 2017, 02:10

Resepected Joao,
Really thankful to you for your help.

Best Regards,
Majid Lateef
Comment
Diana Abdullah

Join Date: Sep 2018

Posts: 3
#17

24 Dec 2018, 00:12

Originally posted by Joao Santos Silva View Post

Dear Majid,

Thank you for sending the data. Those variables are dropped because other variables have missing values when they are equal to 1. So, after dropping the missings, those dummies have no variation. In a next update of -ppml- I'll try to find a way of providing more helpful warnings in these cases.

Best wishes,

Joao

Dear Prof Joao,

I'm having the same problem as Majid, Could you advise me on what should I do to include this dummy (BHP) as its an important variable for my study.

. ppml ExportUSDmil imp_time lGDPmj lGDPCmj ldist ler lhc CNTG BHP BMP, clu(DIST)
note: checking the existence of the estimates
WARNING: imp_time has very large values, consider rescaling or recentering
WARNING: lGDPmj has very large values, consider rescaling or recentering
WARNING: lGDPCmj has very large values, consider rescaling or recentering
note: starting ppml estimation
note: ExportUSDmil has noninteger values

Iteration 1: deviance = 52928.9
Iteration 2: deviance = 43613.09
Iteration 3: deviance = 43333.87
Iteration 4: deviance = 43333.28
Iteration 5: deviance = 43333.28
Iteration 6: deviance = 43333.28

Number of parameters: 9
Number of observations: 144
Number of observations dropped: 0
Pseudo log-likelihood: -22240.056
R-squared: .9268023
(Std. Err. adjusted for 18 clusters in DIST)

Robust
ExportUSDmil Coef. Std. Err. z P>z [95% Conf. Interval]

imp_time .0035293 .0029416 1.20 0.230 -.0022361 .0092947
lGDPmj .5784232 .1735275 3.33 0.001 .2383157 .9185308
lGDPCmj -.1038585 .082395 -1.26 0.207 -.2653498 .0576327
ldist -1.652412 .1819653 -9.08 0.000 -2.009058 -1.295767
ler -.0616003 .0376378 -1.64 0.102 -.1353691 .0121685
lhc .1135039 .091901 1.24 0.217 -.0666187 .2936266
CNTG -2.904657 .5367196 -5.41 0.000 -3.956608 -1.852706
BMP 1.007709 .3596526 2.80 0.005 .3028032 1.712616
_cons 6.042533 3.719994 1.62 0.104 -1.248522 13.33359

Number of regressors dropped to ensure that the estimates exist: 1
Dropped variables: BHP
Option strict is off

-Diana-
Comment
Joao Santos Silva

Join Date: Apr 2014

Posts: 2782
#18

24 Dec 2018, 08:33

Dear Diana Abdullah,

In your sample, it is not possible to estimate the coefficient of that variable. I suggest you use a larger sample.

Best wishes,

Joao
Comment
Diana Abdullah

Join Date: Sep 2018

Posts: 3
#19

26 Dec 2018, 20:20

Dear prof,

Noted, I will try to add more samples.

Thanks
Comment
Vutha Hing

Join Date: Apr 2019

Posts: 5
#20

04 Apr 2019, 01:02

Dear Joao,

I follow many of your comment and advice on ppml estimation with gravity and thank for such a useful contribution.
Yet, I still have problem of excluded aggressors similar to Majid. Let me brief my study and problem and appreciate your advice:

I am conduct a research on the "impacts of human capital on value-added trade for East Asia economies". I employ gravity model to estimate the coefficients. I have 11 East Asia economies as exporters and 54 partner countries for 2005-2015 period. I regard it as panel data by xtset year. I use two estimation methods: OLS fixed effects and PPML fixed effect accounting for exporter-time and importer-time varying for both estimators. My dependent variable is value-added export (vae) and key independent variables are mean year of schooling (mys) and quality of education (edu_qual). Below are stata command for both estimation methods:

1. OLS fixed effects:
xtreg ln_vae ln_output_o ln_output_d ln_distw contig comlang_off fta_wto ln_mys ln_edu_qual ///
ln_tariff_face ln_infra ln_export_time EXPORTER_TIME_FE* IMPORTER_TIME_FE*, robust fe

1. PPML fixed effects:
ppml vae ln_output_o ln_output_d ln_distw contig comlang_off fta_wto ln_mys ln_edu_qual ///
ln_tariff_face ln_infra ln_export_time EXPORTER_TIME_FE* IMPORTER_TIME_FE*, cluster (pair_id)

I conduct RESET test as suggested by your 'the Log of Gravity' page and the result favor ppml estimator.. The big problem, though, is that my key aggressors (mys and edu_qual) are excluded along with ln_infra. Below is the outputs from stata:

Number of regressors excluded to ensure that the estimates exist: 535
Excluded regressors: ln_year_schl ln_pisa_score ln_infra

I tried collinearity diagnosis and drop ln_infra in the estimation; yet the calculation still exclude my main aggressors (mys and edu_qual).
My questions are follows:
(1) is there any solution that can fixed this issue?
(2) do i use the appropriate fixed effects and right stata command for ppml?

Thanks so much

Vutha
Comment
Joao Santos Silva

Join Date: Apr 2014

Posts: 2782
#21

04 Apr 2019, 08:03

Dear Vutha Hing,

I suggest you try the new ppmlhdfe command; it is available from SSC.

Best wishes,

Joao
Comment
Vutha Hing

Join Date: Apr 2019

Posts: 5
#22

04 Apr 2019, 16:30

Dear Joao,

Thank so much for quick feedback. I will try that new command and revert back if necessary.
Regards,

Vutha
Comment
Vutha Hing

Join Date: Apr 2019

Posts: 5
#23

04 Apr 2019, 18:25

Dear Joao,

As advised, i run the same specification with ppmlhdfe command as below:
ppmlhdfe dva ln_output_o ln_output_d ln_distw contig comlang_off fta_wto ln_year_schl ln_pisa_score ///
ln_inc_gap ln_tariff_face ln_infra ln_export_time, a(EXPORTER_TIME_FE* IMPORTER_TIME_FE*) cluster (pair_id)

There result turn out that a lot more of aggressors are excluded as per show below:

(warning: absorbing 704 dimensions of fixed effects; check that you really want that)
note: 7 variables omitted because of collinearity: ln_output_o ln_output_d ln_year_schl ln
> _pisa_score ln_tariff_face ln_infra ln_export_time

I still want to stick to this specification as it is explained reasonably by theory YET I relaxed the fixed effect a bit.
I changed exporter-time importer-time fixed effects to just exporter and importer fixed effect and run the following estimation and it works NO drop of aggressors.

ppmlhdfe dva output_o ln_output_d ln_distw contig comlang_off fta_wto ln_year_schl ln_pisa_score ///
ln_inc_gap ln_tariff_face ln_infra, a(EXP_FE* IMP_FE*) cluster (pair_id)

My concern is I am not sure if accounting for only exporter importer fixed effects can capture the real effect or not.
Appreciate your comment and advice on this.

Regards,

Vutha
Comment
Joao Santos Silva

Join Date: Apr 2014

Posts: 2782
#24

05 Apr 2019, 02:21

Dear Vutha Hing,

Those variables are dropped because they are collinear with the fixed effects. So, it is up to you to decide whether you include time-varying fixed effects and do not estimate the coefficients on those variables, or include just Exp and Imp fixed effects and estimate those coefficients.

Best wishes,

Joao
Comment
Vutha Hing

Join Date: Apr 2019

Posts: 5
#25

07 Apr 2019, 18:41

Dear Joao,

Thank so much for your advice which give me confidence and useful options to choose.
Regards,

Vutha
Comment
Tom Zylkin

Join Date: Nov 2016

Posts: 185
#26

08 Apr 2019, 10:09

Hi Vutha Hing ,

The "absorbing 704 dimensions of fixed effects; check that you really want that" message suggests that maybe you are specifying the fixed effects in a way other than intended. It sounds like you have 704 variables in your data set that start with either 'EXPORTER_TIME_FE" or "IMPORTER_TIME_FE". But really, all you need here is two variables, one with a unique ID for each exporter-year and one with a unique ID for each importer-year.

Here is a simple example you may be able to follow:

egen exp_time = group(exporter year)
egen imp_time = group(importer year)

ppmlhdfe dva ln_output_o ln_output_d ln_distw contig comlang_off fta_wto ln_year_schl ln_pisa_score ///
ln_inc_gap ln_tariff_face ln_infra ln_export_time, a(exp_time imp_time) cluster (pair_id)

where "exporter" and "importer" should be replaced by whatever variables you are using to identify the exporter and importer. However, as Joao rightly says, the variables "ln_output_o" and "ln_output_d" look like they should be collinear with your exporter-time and importer-time fixed effects. If you instead want to have exporter and importer fixed effects only, you should be able to type

ppmlhdfe dva output_o ln_output_d ln_distw contig comlang_off fta_wto ln_year_schl ln_pisa_score ///
ln_inc_gap ln_tariff_face ln_infra, a(exporter importer) cluster (pair_id)

Again, this is assuming you have two numerical ID variables called "exporter" and "importer" that respectively identify the exporter and importer countries.

Regards,
Tom
Comment
Vutha Hing

Join Date: Apr 2019

Posts: 5
#27

08 Apr 2019, 20:19

Dear Tom,

Thank you so much for precise elaboration and suggestion on my puzzle on top of Joao's advise.
I will closely follow your suggested command and will feedback later on the results.

Regards,

Vutha
Comment
Diana Moreira

Join Date: Apr 2019

Posts: 2
#28

11 Apr 2019, 04:38

Dear Statalist users,

I am currently completing my Master's dissertation on the impact of the reform of the rules of origin on EU imports from African EBA countries. I am using panel data, with imports from 34 African countries to the EU, over the course of 17 years - each country has around 26 000 products per year, because I am using imports at the HS6 level.

I am attempring to use the new command ppmlhdfe in order to conduct my robustness check, but I keep getting the following error:

my regression:

ppmlhdfe ln_imports after_reform, a(id)

my country fixed effect:

egen id=group(origin)

the error:

remove_collinears(): 3499 selectindex() not found
GLM::init_variables(): - function returned error
<istmt>: - function returned error

Apparently, the "specified variable or function could not be found".

Could someone please help me figure out this issue?
Comment
Tom Zylkin

Join Date: Nov 2016

Posts: 185
#29

11 Apr 2019, 06:01

Hi Diana,

The error you're receiving is referring to a mata function called "selectindex" that has only been made available starting with Stata 13. If you are using an older version of Stata, you will need to follow the procedure outlined in this post.

Regards,
Tom
Comment
Diana Moreira

Join Date: Apr 2019

Posts: 2
#30

11 Apr 2019, 07:29

Dear Tom,

Thank you so much, that did the trick!
Comment

Announcement

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment