Hello,
I am a little bit confused over my model. I am regressing log wages on being an Immigrant or not. I have splitted the Immigrant Population into different arrival waves. Now, I dont know whether I have to use dummies for each immigrant arrival wave or can I used just the categorical variable with the values of immigrant arrival waves and natives? Is there a difference between these two following models:
1. Model: Variable arrival has native (=9999) as the reference group
2. Model: Making Dummies for each Immigrant arrival wave from the Variable -arrival- such that the intercept is the native reference group
I see that the coefficients are different, but I don't see why since the reference group in both are natives.
I am a little bit confused over my model. I am regressing log wages on being an Immigrant or not. I have splitted the Immigrant Population into different arrival waves. Now, I dont know whether I have to use dummies for each immigrant arrival wave or can I used just the categorical variable with the values of immigrant arrival waves and natives? Is there a difference between these two following models:
1. Model: Variable arrival has native (=9999) as the reference group
Code:
svy: regress lnhourlyw_w i.ib9999.arrival if year==2004
(running regress on estimation sample)
Survey: Linear regression
Number of strata = 1 Number of obs = 10,726
Number of PSUs = 10,726 Population size = 1,317,293
Design df = 10,725
F( 6, 10720) = 99.52
Prob > F = 0.0000
R-squared = 0.0279
Linearized
lnhourlyw_w Coef. Std. Err. t P>t [95% Conf. Interval]
arrival
pre 1980 -.1686351 .0151419 -11.14 0.000 -.198316 -.1389543
1980-84 -.1678049 .0202635 -8.28 0.000 -.207525 -.1280847
1985-89 -.2158353 .0165672 -13.03 0.000 -.2483101 -.1833604
1990-94 -.2542113 .0122076 -20.82 0.000 -.2781405 -.2302822
1995-99 -.1508089 .0222109 -6.79 0.000 -.1943463 -.1072715
2000-04 -.0889885 .0228124 -3.90 0.000 -.133705 -.044272
_cons 3.689774 .0057737 639.07 0.000 3.678457 3.701092
Code:
svy: regress lnhourlyw_w i.arvpre1980 i.arv1980 i.arv1985 i.arv1990 i.arv1995 i.arv2000 if year==2004
(running regress on estimation sample)
Survey: Linear regression
Number of strata = 1 Number of obs = 10,726
Number of PSUs = 10,726 Population size = 1,317,293
Design df = 10,725
F( 6, 10720) = 99.52
Prob > F = 0.0000
R-squared = 0.0279
Linearized
lnhourlyw_w Coef. Std. Err. t P>t [95% Conf. Interval]
1.arvpre1980 .0406217 .0139277 2.92 0.004 .0133207 .0679226
1.arv1980 .0414519 .0174006 2.38 0.017 .0073435 .0755603
1.arv1985 -.0065785 .0148694 -0.44 0.658 -.0357253 .0225684
1.arv1990 -.0449545 .0120761 -3.72 0.000 -.0686259 -.0212832
1.arv1995 .0584479 .0187726 3.11 0.002 .0216502 .0952457
1.arv2000 .1202683 .0192005 6.26 0.000 .0826317 .1579048
_cons 3.480518 .0087417 398.15 0.000 3.463382 3.497653

Comment