Meaning of the Oaxaca_rif result?

Jang JinHee

Join Date: Jan 2019

Posts: 14
#1

Meaning of the Oaxaca_rif result?

11 Mar 2019, 01:27

Thank you for your answer.
The following results were obtained using oaxaca_rif.

---------------------------------------------------
overall
group_1 | .2900839
group_2 | .2913806
difference | -.0005414
explained | -.0574823
unexplained | .0569409
----------------------------------------------------

I understand what the following example means.
- for example difference 0.05, explained 0.02, unexplained 0.03 (unexplained ratio = 0.03/0.05*100)
But I do not know what it means to be a negative number.
How can I get the unexplained ratio? (0.569/-0.0005414 *100 ?)

I'm sorry for starters...
I hope to help you
Tags: None
Liu Qiang

Join Date: Jun 2018

Posts: 135
#2

11 Mar 2019, 02:48

Dear Jang JinHee, to be frank, I had the same puzzling question days ago and consulted one of the professional experts in this field.
The result is negative maybe partly due to the model specification error.
hope this thread could help you understand it
https://www.statalist.org/forums/for...-decomposition

2B or not 2B, that's a question!
Comment
FernandoRios

Join Date: Apr 2014

Posts: 2479
#3

11 Mar 2019, 06:37

HI Jang,
In addition to Liu's advice i think you have to rephrase your problem/interpretation of the results.
You are correct in saying that often, for OB type of decomposition, people interpret the components as shares or proportion of the raw gap. This is not the only way, specially for a case like yours.
Think about the following example. You run a OB decomposition, using wages as depend variables, and education as the ONLY explanatory variable. YOu get the following results
Group 1. Wage=0+5*educ avg education =3
group 2 Wage=0+3*Educ avg education =5

If you compare their wages, you will see, like in your example, that the RAW gap is zero.This is hiding the fact that there is discrimination (or at least differences in wage structures). Group 2 returns to education is much lower than groups 1. Group 2 however has more education, which counteracts the coefficients impact.

In your case, the raw gap (mean gap?) but the decomposition actually reveals differences in characteristics and returns to those characteristics.
HTH
Fernando
Comment
Liu Qiang

Join Date: Jun 2018

Posts: 135
#4

11 Mar 2019, 07:09

Originally posted by FernandoRios View Post

HI Jang,
In addition to Liu's advice i think you have to rephrase your problem/interpretation of the results.
You are correct in saying that often, for OB type of decomposition, people interpret the components as shares or proportion of the raw gap. This is not the only way, specially for a case like yours.
Think about the following example. You run a OB decomposition, using wages as depend variables, and education as the ONLY explanatory variable. YOu get the following results
Group 1. Wage=0+5*educ avg education =3
group 2 Wage=0+3*Educ avg education =5

If you compare their wages, you will see, like in your example, that the RAW gap is zero.This is hiding the fact that there is discrimination (or at least differences in wage structures). Group 2 returns to education is much lower than groups 1. Group 2 however has more education, which counteracts the coefficients impact.

In your case, the raw gap (mean gap?) but the decomposition actually reveals differences in characteristics and returns to those characteristics.
HTH
Fernando

thank you for your nice answer. I once again learn a lot

Code:

clear all set obs 20000 gen u=invnorm(uniform()) gen group=. replace group=1 in 1/10000 replace group=2 in 10001/l gen edu=. replace edu=invnorm(uniform())+3 in 1/10000 replace edu=invnorm(uniform())+5 in 10001/l gen wage=. replace wage=5*edu+u in 1/10000 replace wage=3*edu+u in 10001/l reg wage edu if group==1 reg wage edu if group==2 oaxaca wage edu, by(group)

This is a simulation based on Fernando's advice.
If I understand it correctly, there may still exist discrimination although the wage difference between two different groups is zero. In this way, you cannot interpret the results using the "share" perspective any longer.

2B or not 2B, that's a question!
Comment
Jang JinHee

Join Date: Jan 2019

Posts: 14
#5

11 Mar 2019, 18:00

FernandoRios and Liu Qiang, Thank you for your advice.
But I still did not understand it properly. T_T)
I want to find the share of discrimination in the total wage gap.
Studies show that the unexplained part is negative, or the total wage gap is negative.
So, should I consider other calculations regardless of the oaxaca_rif overall result?
Or should I do additional work after estimating oaxaca_rif?
I understand what you told me, but it's hard to apply to me.
I'm sorry I do not know.
Please let me know a little more detail.
Comment
FernandoRios

Join Date: Apr 2014

Posts: 2479
#6

11 Mar 2019, 18:53

Hi jang
the bottom line is that using shares will give you weird results and I would personally not recommend it.
on the other hand you haven’t said anything about your data, outcome variable or what is what you are trying to analyze
also Liu that was a nice replication
Fernando
Comment
Liu Qiang

Join Date: Jun 2018

Posts: 135
#7

11 Mar 2019, 19:38

Originally posted by Jang JinHee View Post

FernandoRios and Liu Qiang, Thank you for your advice.
But I still did not understand it properly. T_T)
I want to find the share of discrimination in the total wage gap.
Studies show that the unexplained part is negative, or the total wage gap is negative.
So, should I consider other calculations regardless of the oaxaca_rif overall result?
Or should I do additional work after estimating oaxaca_rif?
I understand what you told me, but it's hard to apply to me.
I'm sorry I do not know.
Please let me know a little more detail.

hi Jang, I may offer a new insight that may help you understand what I am saying. Let's say there are two different individuals A and B. A has a higher education level with 5 years whereas B only has 3 years. But what we can observe is that they have the same hourly wage，both 15 dollars/hour for example. One may argue that wage is not solely affected by education and you have to include other covariates, a model specification error. But if we assume the model is correctly set, we should have observed A's wage higher than B's but in fact we don't. This means there's still discrimination since A has endowment advantage that is not rewarded in his salary. But you cannot say how much of the difference is caused by discrimination because there's no total difference at all.
Hope I put it clearly
Best Regards

Last edited by Liu Qiang; 11 Mar 2019, 19:40.

2B or not 2B, that's a question!
Comment
Jang JinHee

Join Date: Jan 2019

Posts: 14
#8

11 Mar 2019, 22:18

Thank you for your advice to me that I do not know.
I understand your comments.
Nevertheless, I will ask you one more question.

I did the following :

1) The wage gap decomposition of women with children and women without children.

2) use 'oaxaca_rif'
------------------------------------------------------------------
local i = 10
while `i' <= 90 {
oaxaca_rif lnhwage age age2 dum_pt2 dum_union2-dum_union3 dum_stat2 dum_fsize2-dum_fsize4 dum_ind2-dum_ind17 dum_edu2-dum_edu3 dum_job2-dum_job9 if gender==1, by(fm) wgt(1) rif(q(`i'))
local i = `i'+10
}
----------------------------------------------------------------------
fm : women with children = 1, women without children = 0

3) As a result, I obtained these results in several quantiles.
-----------------------------------------------------------------------
example 1)

RIF : q(70)
Group 1: fm = 0 N of obs 1 = 8473
Group c: x2*b1 N of obs C = .
Group 2: fm = 1 N of obs 2 = 37546

lnhwage Coef. Std. Err. z P>z [95% Conf. Interval]

overall
group_1 .2908392 .0064496 45.09 0.000 .2781982 .3034802
group_2 .2913806 .0046469 62.70 0.000 .2822729 .3004884
difference -.0005414 .0079493 -0.07 0.946 -.0161217 .0150389
explained -.0574823 .0165785 -3.47 0.001 -.0899756 -.024989
unexplained .0569409 .0173246 3.29 0.001 .0229853 .0908964

---------------------------------------------------------------------------
example 2)

RIF : q(40)
Group 1: fm = 0 N of obs 1 = 8473
Group c: x2*b1 N of obs C = .
Group 2: fm = 1 N of obs 2 = 37546

lnhwage Coef. Std. Err. z P>z [95% Conf. Interval]

overall
group_1 -.0557652 .0062168 -8.97 0.000 -.0679499 -.0435806
group_2 -.121906 .0028722 -42.44 0.000 -.1275354 -.1162767
difference .0661408 .0068482 9.66 0.000 .0527186 .079563
explained .0641534 .0160752 3.99 0.000 .0326466 .0956602
unexplained .0019874 .0163487 0.12 0.903 -.0300554 .0340302
---------------------------------------------------------------------------------

4) In the example 2, it can be easily seen that pay discrimination is 0.3%. (unexplained/difference*100 = 0.0661408/0.0019874)
But the example 1 is somewhat difficult.
Because the difference part is negative(-0.005414).
In this case, the same calculation as the example 1 is (0.0569409/-0.0005414 = ?)
Is not calculation impossible?

5) What is the interpretation of the unexplained part in this case(q70)?
In this quantile(q70), the wages of women without children are higher than those with children?

I am so confused now.
Please help me.

p.s Do you need my data for an answer?
I will link the data if necessary.

Last edited by Jang JinHee; 11 Mar 2019, 22:25.
Comment

Announcement

Meaning of the Oaxaca_rif result?

Comment

Comment

Comment

Comment

Comment

Comment

Comment