Plot qreg2 + question about comparing confidence intervals in quantile regression with panel data

Verena Balke

Join Date: Jul 2018

Posts: 1
#1

Plot qreg2 + question about comparing confidence intervals in quantile regression with panel data

11 Jul 2018, 07:59

For my thesis, I am conducting quantile regression for panel data using the qreg2 command in Stata 13.0. The Parente-Santos Silva test shows intra-cluster correlation.
To visualize my results I would like to plot the confidence interval obtained from the quantile regression, but I am also not sure what to plot it against.
I have two questions:

1. To plot the results from qreg2 I have tried to use the grqreg command, but that does not work. Is there a similar command for plotting my results from qreg2?

2. Also, usually the OLS confidence interval is included in the plot to visualize differences between the confidence intervals. Would it not be better to include the confidence interval from a fixed effects regression (I have conducted both LSDV and within estimator). However, in the papers about panel data quantile regression I have read so far OLS confidence intervals are used to make a comparison with the confidence intervals obtained from quantile regression. Could someone explain to me why that is?
Tags: graph, panel data, qreg2, Quantile Regression

Scott Merryman

Join Date: Mar 2014
Posts: 895

11 Jul 2018, 10:04

Regarding question one, here is one way:

Code:

sysuse auto,clear
tempname memhold
tempfile results
postfile `memhold' q b cilow cihigh using `results'
forv i = .1(.1).9 {
    qui qreg2 price mpg, q(`i')
    local b = _b[mpg]
    local cilow  = _b[mpg]-_se[mpg]*invttail(e(df_r),.025)    
    local cihigh  = _b[mpg]+_se[mpg]*invttail(e(df_r),.025)    
    post `memhold' (`i') (`b') (`cilow') (`cihigh')
}
postclose `memhold'
use `results'
twoway rarea cih cil q, color(gs14)  || line b q,  lc(blue) || , legend(off) xlabel(.1(.1).9)

Comment

Michael Foreman

Join Date: Mar 2018
Posts: 54

23 Mar 2020, 10:25

Originally posted by Scott Merryman View Post

Regarding question one, here is one way:

Code:

sysuse auto,clear
tempname memhold
tempfile results
postfile `memhold' q b cilow cihigh using `results'
forv i = .1(.1).9 {
qui qreg2 price mpg, q(`i')
local b = _b[mpg]
local cilow = _b[mpg]-_se[mpg]*invttail(e(df_r),.025)
local cihigh = _b[mpg]+_se[mpg]*invttail(e(df_r),.025)
post `memhold' (`i') (`b') (`cilow') (`cihigh')
}
postclose `memhold'
use `results'
twoway rarea cih cil q, color(gs14) || line b q, lc(blue) || , legend(off) xlabel(.1(.1).9)

Dear Scott,

I've been having the same question, so thank you for the solution! When I tested the code, however, the graph has no confidence intervals (please see below). Do you know what the problem might be?
It looks like the confidence intervals are not generated properly

Code:

. su

    Variable |        Obs        Mean    Std. Dev.       Min        Max
-------------+---------------------------------------------------------
           q |          9          .5    .2738613         .1         .9
           b |          9   -168.4324    118.5083  -391.5217        -64
       cilow |          0
      cihigh |          0

Thank you once again for your help!

Click image for larger version

Name: image_17415.png
Views: 1
Size: 18.3 KB
ID: 1542491

Last edited by Michael Foreman; 23 Mar 2020, 10:30.

Comment

Joao Santos Silva

Join Date: Apr 2014

Posts: 3011
#4

23 Mar 2020, 13:25

Dear All,

Thanks Scott Merryman for providing the code; I am often asked this question and now I can just point people to your code.
Michael Foreman, I have tested the code in Stata 16 and in Stata 11 and it worked like a charm; maybe you need to update Stata?
Finally, Verena Balke, if you are using panel data, you may want to consider using xtqreg, which allows for fixed effects.

Best wishes,

Joao
1 like
Comment

Michael Foreman

Join Date: Mar 2018
Posts: 54

23 Mar 2020, 13:53

Dear Joao,

Thank you for your reply. My Stata 13.1 is up to date, relatively speaking (please see the code below, it produces the graphs without CIs as I posted earlier). Is there something else I might be doing wrong?

Code:

. update

Update status
    Last check for updates:  23 Mar 2020
    New update available:    none         (as of 23 Mar 2020)
    Current update level:    16 Dec 2016  (what's new)

Possible actions

    Check for available updates           (or type -update query-)

. sysuse auto,clear
(1978 Automobile Data)

. 
. tempname memhold

. 
. tempfile results

. 
. postfile `memhold' q b cilow cihigh using `results'

. 
. forv i = .1(.1).9 {
  2. 
.     qui qreg2 price mpg, q(`i')
  3. 
.     local b = _b[mpg]
  4. 
.     local cilow  = _b[mpg]-_se[mpg]*invttail(e(df_r),.025)    
  5. 
.     local cihigh  = _b[mpg]+_se[mpg]*invttail(e(df_r),.025)    
  6. 
.     post `memhold' (`i') (`b') (`cilow') (`cihigh')
  7. 
. }

. postclose `memhold'
. use `results'
. twoway rarea cih cil q, color(gs14)  || line b q,  lc(blue) || , legend(off) xlabel(.1(.1).9)

Comment

Joao Santos Silva

Join Date: Apr 2014

Posts: 3011
#6

24 Mar 2020, 01:57

Dear Michael Foreman,

Does the code generate the cih, cil, and q variables? If so, maybe you can try different plots or different colours.

Best wishes,

Joao
Comment

Michael Foreman

Join Date: Mar 2018
Posts: 54

24 Mar 2020, 18:11

Dear Joao,

Thank you for your advice. It looks like the code does not generate confidence intervals (please see below). I am really perplexed because I just copied and pasted exactly the same code. It has something to do with " *invttail(e(df_r),.025) " part. if I replace "e(df_r)" with a value, say "73", the code generates confidence intervals, but if i keep "e(df_r)" as in the original code, i get missing values. Would you have any idea why this might be the case with my Stata? Are there some setting that could have been manually and now cause this issue? Apprently, "e(df_r)" does not work in my case. I am really perplexed by why it works on other machines but not on mine.

Code:

. sysuse auto,clear
(1978 Automobile Data)

. tempname memhold

. tempfile results

. postfile `memhold' q b cilow cihigh using `results'

. forv i = .1(.1).9 {
  2.     qui qreg2 price mpg, q(`i')
  3.     local b = _b[mpg]
  4.     local cilow  = _b[mpg]-_se[mpg]*invttail(e(df_r),.025)    
  5.     local cihigh  = _b[mpg]+_se[mpg]*invttail(e(df_r),.025)    
  6.     post `memhold' (`i') (`b') (`cilow') (`cihigh')
  7. }

. postclose `memhold'

. use `results'

. list

     +---------------------------------+
     |  q           b   cilow   cihigh |
     |---------------------------------|
  1. | .1   -75.71429       .        . |
  2. | .2         -64       .        . |
  3. | .3         -84       .        . |
  4. | .4        -107       .        . |
  5. | .5   -135.6667       .        . |
     |---------------------------------|
  6. | .6   -120.6667       .        . |
  7. | .7   -202.5833       .        . |
  8. | .8   -334.7391       .        . |
  9. | .9   -391.5217       .        . |
     +---------------------------------+

Comment

Joao Santos Silva

Join Date: Apr 2014

Posts: 3011
#8

25 Mar 2020, 01:23

Dear Michael Foreman,

I should have asked this before: do you have the latest version of qreg2?

Best wishes,

Joao
Comment
Michael Foreman

Join Date: Mar 2018

Posts: 54
#9

25 Mar 2020, 07:45

Dear Joao,

Following your advice, I reinstalled qreg2 and now I am getting the confidence intervals! It is a mystery to me what the problem was.. Thank you for your help!
Comment
Joao Santos Silva

Join Date: Apr 2014

Posts: 3011
#10

25 Mar 2020, 08:27

Dear Michael Foreman,

An earlier version of qreg2 did not save the degrees of freedom.

Best wishes,

Joao
Comment
Michael Foreman

Join Date: Mar 2018

Posts: 54
#11

25 Mar 2020, 11:15

Thank you, Joao, this clarifies it all!
Comment

Suna Marasli

Join Date: Oct 2018
Posts: 6

#12

04 Jun 2020, 06:23

Originally posted by Scott Merryman View Post

Regarding question one, here is one way:

Code:

sysuse auto,clear
tempname memhold
tempfile results
postfile `memhold' q b cilow cihigh using `results'
forv i = .1(.1).9 {
qui qreg2 price mpg, q(`i')
local b = _b[mpg]
local cilow = _b[mpg]-_se[mpg]*invttail(e(df_r),.025)
local cihigh = _b[mpg]+_se[mpg]*invttail(e(df_r),.025)
post `memhold' (`i') (`b') (`cilow') (`cihigh')
}
postclose `memhold'
use `results'
twoway rarea cih cil q, color(gs14) || line b q, lc(blue) || , legend(off) xlabel(.1(.1).9)

Dear all,
I tried applying this suggestion to visualize my estimation, like below:

Code:

tempname memhold
tempfile results
postfile `memhold' q b cilow cihigh using `results'
forv i = .1(.1).9 {
    xi: qreg2 lavprod zip $general i.country i.yy logdeg_t logsz, cluster(new_account_id) quantile(.75) wlsiter(10) q(`i')
    local b = _b[zip]
    local cilow  = _b[zip]-_se[zip]*invttail(e(df_r),.025)    
    local cihigh  = _b[zip]+_se[zip]*invttail(e(df_r),.025)    
    post `memhold' (`i') (`b') (`cilow') (`cihigh')
}
postclose `memhold'
use `results'
twoway rarea cih cil q, color(gs14)  || line b q,  lc(blue) || , legend(off) xlabel(.1(.1).9)

But I get an error about the q option, as "option q() not allowed". I thought it may be related to using dummy variables and "xi", but I couldn't solve the problem.

I am using Stata 16 on a Windows pc, and installed qreg2 just yesterday, I guess it is up to date.

Best regards,
Suna

Last edited by Suna Marasli; 04 Jun 2020, 06:39.

Comment

Scott Merryman

Join Date: Mar 2014

Posts: 895
#13

04 Jun 2020, 07:10

You have specified the quantile(#) twice:

Code:

...quantile(.75) wlsiter(10) q(`i')
Comment

Suna Marasli

Join Date: Oct 2018
Posts: 6

#14

04 Jun 2020, 07:30

Thank you Scott Merryman! Such a beginners' mistake.

Yet, I still cannot make it work. My code is as below:

Code:

tempname memhold
tempfile results
postfile `memhold' q b cilow cihigh using `results'
forv i = .1(.1).9 {
    qreg2 lavprod zip $general i.country i.yy logdeg_t logsz, cluster(new_account_id) wlsiter(10) q(.50)
    local b = _b[zip]
    local cilow  = _b[zip]-_se[zip]*invttail(e(df_r),.025)  
    local cihigh  = _b[zip]+_se[zip]*invttail(e(df_r),.025)  
    post `memhold' (0.50) (`b') (`cilow') (`cihigh')
}
postclose `memhold'
use `results'
twoway rarea cih cil q, color(gs14)  || line b q,  lc(blue) || , legend(off) xlabel(.1(.1).9)

After the line " use `results' " I get an error like:
"no; dataset in memory has changed since last saved"

I am not experienced with coding graphics in Stata. I cannot make sense of this error. Help greatly appreciated.

Best regards,
Suna

Last edited by Suna Marasli; 04 Jun 2020, 07:50.

Comment

FernandoRios

Join Date: Apr 2014
Posts: 2469

#15

04 Jun 2020, 08:37

Hi Suna,
perhaps the following code may help with what yo need to do

Code:

 clear all
 use http://fmwww.bc.edu/RePEc/bocode/o/oaxaca.dta, clear
 
 forvalues i=.05(.05).95 {
  qui:qreg lnwage educ exper tenure female, q(`i')
  matrix aux=r(table)
  matrix bqreg=nullmat(bqreg)\aux["b",....]
  matrix llqreg=nullmat(llqreg)\aux["ll",....]
  matrix uuqreg=nullmat(uuqreg)\aux["ul",....]
  qui:qreg2 lnwage educ exper tenure female, q(`i')
  matrix aux=r(table)
  matrix bqreg2=nullmat(bqreg2)\aux["b",....]
  matrix llqreg2=nullmat(llqreg2)\aux["ll",....]
  matrix uuqreg2=nullmat(uuqreg2)\aux["ul",....]
 }
 ssc install lbsvmat
 lbsvmat bqreg
 lbsvmat llqreg
 lbsvmat uuqreg

 lbsvmat llqreg2
 lbsvmat uuqreg2
 
 gen q=_n*5 if  _n*5<=95
 
 two rarea llqreg1 uuqreg1 q, color(green%50)  ||  ///
    rarea llqreg21 uuqreg21 q, color(blue%50) || line bqreg1 q, ///
    legend(order( 1 "qreg CI" 2 "qreg2 CI" 3 "Point estimate"))

 two rarea llqreg2 uuqreg2 q, color(green%50)  ||  ///
    rarea llqreg22 uuqreg22 q, color(blue%50) || line bqreg2 q, ///
    legend(order( 1 "qreg CI" 2 "qreg2 CI" 3 "Point estimate"))

HTH

Announcement