Hello,
I am a little bit confused over my model. I am regressing log wages on being an Immigrant or not. I have splitted the Immigrant Population into different arrival waves. Now, I dont know whether I have to use dummies for each immigrant arrival wave or can I used just the categorical variable with the values of immigrant arrival waves and natives? Is there a difference between these two following models:
1. Model: Variable arrival has native (=9999) as the reference group
2. Model: Making Dummies for each Immigrant arrival wave from the Variable -arrival- such that the intercept is the native reference group
I see that the coefficients are different, but I don't see why since the reference group in both are natives.
I am a little bit confused over my model. I am regressing log wages on being an Immigrant or not. I have splitted the Immigrant Population into different arrival waves. Now, I dont know whether I have to use dummies for each immigrant arrival wave or can I used just the categorical variable with the values of immigrant arrival waves and natives? Is there a difference between these two following models:
1. Model: Variable arrival has native (=9999) as the reference group
Code:
svy: regress lnhourlyw_w i.ib9999.arrival if year==2004 (running regress on estimation sample) Survey: Linear regression Number of strata = 1 Number of obs = 10,726 Number of PSUs = 10,726 Population size = 1,317,293 Design df = 10,725 F( 6, 10720) = 99.52 Prob > F = 0.0000 R-squared = 0.0279 Linearized lnhourlyw_w Coef. Std. Err. t P>t [95% Conf. Interval] arrival pre 1980 -.1686351 .0151419 -11.14 0.000 -.198316 -.1389543 1980-84 -.1678049 .0202635 -8.28 0.000 -.207525 -.1280847 1985-89 -.2158353 .0165672 -13.03 0.000 -.2483101 -.1833604 1990-94 -.2542113 .0122076 -20.82 0.000 -.2781405 -.2302822 1995-99 -.1508089 .0222109 -6.79 0.000 -.1943463 -.1072715 2000-04 -.0889885 .0228124 -3.90 0.000 -.133705 -.044272 _cons 3.689774 .0057737 639.07 0.000 3.678457 3.701092
Code:
svy: regress lnhourlyw_w i.arvpre1980 i.arv1980 i.arv1985 i.arv1990 i.arv1995 i.arv2000 if year==2004 (running regress on estimation sample) Survey: Linear regression Number of strata = 1 Number of obs = 10,726 Number of PSUs = 10,726 Population size = 1,317,293 Design df = 10,725 F( 6, 10720) = 99.52 Prob > F = 0.0000 R-squared = 0.0279 Linearized lnhourlyw_w Coef. Std. Err. t P>t [95% Conf. Interval] 1.arvpre1980 .0406217 .0139277 2.92 0.004 .0133207 .0679226 1.arv1980 .0414519 .0174006 2.38 0.017 .0073435 .0755603 1.arv1985 -.0065785 .0148694 -0.44 0.658 -.0357253 .0225684 1.arv1990 -.0449545 .0120761 -3.72 0.000 -.0686259 -.0212832 1.arv1995 .0584479 .0187726 3.11 0.002 .0216502 .0952457 1.arv2000 .1202683 .0192005 6.26 0.000 .0826317 .1579048 _cons 3.480518 .0087417 398.15 0.000 3.463382 3.497653
Comment