Hi,
Say I am trying to see the effects of being from a different ethnicity on wages.
In my initial dataset ethnicities are coded from 1-4; white as 1, black as 2, indian as 3, and other as 4.
If I drop all missing values for ethnicity, and then create dummies manually using the code:
gen black = (race==2)
gen indian = (race==3)
gen other = (race==4)
And then run a regression of wage against these, would the interpretation of the coefficient of e.g. being Indian be compared to the base group of being white, or being indian compared to white, black or other?
Similarly, if I include age in the regression, would the interpretation for that coefficient relate to someone white?
I understand these questions might seem very basic but I want to check I have coded my dummies correctly to interpret them against the base group of only white rather than the base group of all other races.
Finally, if I included other categorical dummies in the equation, such as married and divorced with never married as the based group, would the coefficient on "Indian" be compared to a never married white individual or a white individual with the same marital status?
Say I am trying to see the effects of being from a different ethnicity on wages.
In my initial dataset ethnicities are coded from 1-4; white as 1, black as 2, indian as 3, and other as 4.
If I drop all missing values for ethnicity, and then create dummies manually using the code:
gen black = (race==2)
gen indian = (race==3)
gen other = (race==4)
And then run a regression of wage against these, would the interpretation of the coefficient of e.g. being Indian be compared to the base group of being white, or being indian compared to white, black or other?
Similarly, if I include age in the regression, would the interpretation for that coefficient relate to someone white?
I understand these questions might seem very basic but I want to check I have coded my dummies correctly to interpret them against the base group of only white rather than the base group of all other races.
Finally, if I included other categorical dummies in the equation, such as married and divorced with never married as the based group, would the coefficient on "Indian" be compared to a never married white individual or a white individual with the same marital status?
Comment