Question regarding Interpreting Coefficients

Ignacio Lunghi

Join Date: Sep 2022
Posts: 4

Question regarding Interpreting Coefficients

08 Jul 2023, 09:11

Hi,

The problem I have is the following. I have ran two regressions in Stata:

(1) “reg income i.treated##i.male i.experiment”

(2) “reg income i.treated i.experiment if male==1”

Where treated and male are two dummies and experiment is a categorical variable. The income variable is continuous.

I thought that the sum of the coefficients "_b[1.p treated] + _b[1.ptreated#1.man]" of the first regression and the coefficient "_b[1.ptreated]" of the second were the same, but they are not.

What explanation is there for this?

Additional information: the sum of the coefficients is different only when I incorporate dummies per experiment (I had to add them since we are grouping different experiment which were done using Qualtrics). They are equal when i have ran the following regressions.

(3) “reg income i.treated##i.male”

(4) “reg income i.treated if male==1”

Is there a way I can perform a regression like the first one but where the sum of "_b[1.ptreated] +_b[1.ptreated#1.male]" and the coefficient "_b[1.ptreated]" are equal between regressions?

Here are the results of the regressions:

	(1)	(2)	(3)	(4)
Variables	income	income	income	income

1.treated	0.0402	0.0370	0.0774*	0.0699
	(0.0438)	(0.0511)	(0.0434)	(0.0494)
1.male	0.121**		0.125**
	(0.0490)		(0.0493)
1.treated#1.male	-0.00707		-0.00750
	(0.0639)		(0.0642)
Experiment 2	-0.0920**	-0.0798
	(0.0361)	(0.0561)
Experiment 3	-0.285***	-0.263***
	(0.0441)	(0.0671)
Constant	-0.0515	0.0576	-0.172***	-0.0469
	(0.0422)	(0.0543)	(0.0333)	(0.0381)

Observations	3,771	1,731	3,771	1,731
R-squared	0.016	0.010	0.005	0.001
Standard errors in parentheses
* p<0.01, p<0.05, * p<0.1

Last edited by Ignacio Lunghi; 08 Jul 2023, 09:23.

Tags: None

Clyde Schechter

Join Date: Apr 2014

Posts: 30165
#2

08 Jul 2023, 09:25

I thought that the sum of the coefficients "_b[1.p treated] + _b[1.ptreated#1.man]" of the first regression and the coefficient "_b[1.ptreated]" of the second were the same, but they are not.

What explanation is there for this?

Additional information: the sum of the coefficients is different only when I incorporate dummies per experiment (I had to add them since we are grouping different experiment which were done using Qualtrics). They are equal when i have ran the following regressions.

You have actually answered your own question.

Stratified regression and interacting a single regression with the stratified variable are not exactly equivalent. They do produce the same result when only a single explanatory variable is present in the model. But when the regression involves multiple predictors and you interact only with one, you will get different results. To get the same results from interaction as with stratification you must interact with all of the explanatory variables. In your case:

Code:

reg income i.treated i.experiment if male == 1 reg income i.male##(i.treated i.experiment)

will be have the way you expect.

The reason -reg income i.male##i.treated i.experiment- doesn't work the way you expected it to is that when you separately regress for males and females, the experiment variable is allowed to have different values for males and females (and, except in extraordinary coincidental circumstances, it almost always will). By contrast when you interact male with treated, but not with experiment, you constrain the coefficient of treated to be the same for males and females, while that for treated can (and almost always will) differ. But unless treated and experiment are exactly independent variables, anything that changes the coefficient of one of them will also change the coefficient of the other.
1 like
Comment
Ignacio Lunghi

Join Date: Sep 2022

Posts: 4
#3

08 Jul 2023, 09:34

Ok, thank you very much for the answer

I also understand, based on what you told me, that the most "correct" way in this experiment would then be to add those interactions between the "experiment" variable and my sex and treatment variables, right?
Comment
Clyde Schechter

Join Date: Apr 2014

Posts: 30165
#4

08 Jul 2023, 11:40

Well, I didn't say that, and I don't know what the most "correct" approach is here. What I said is that adding those interactions will cause the interaction model coefficients to equal those you get from doing separate regressions for each sex.

But it is a substantive, not a statistical, question whether the correct approach is to have the effect of experiment differ between men and women or constrain it to be the same. That would depend on what the experiments are, and whether predicts that those effects should be the same or different. It's really no different from any other situation where you are trying to decide whether or not to include an interaction term in a model: it depends on what is understood about the data generating process and whether it distinguishes those groups or not. The question you are posing in #3 requires advice from somebody with expertise in the subject matter of your research--it is not a statistical question.

I can't tell from your posts what discipline you are working in: on the one hand you are at least considering sex-specific differences, which is mostly characteristic of the biomedical disciplines, but you are also modeling income as the outcome variable, suggesting an economics related discipline. I'm an epidemiologist and I wouldn't pretend to advise you on an economics-related question. If your work is sufficiently closely related to areas I have worked in, I might be able to advise you if you provided more detail about what you are studying and what these experiments involve.

Last edited by Clyde Schechter; 08 Jul 2023, 11:43.
1 like
Comment
Ignacio Lunghi

Join Date: Sep 2022

Posts: 4
#5

08 Jul 2023, 11:55

Originally posted by Clyde Schechter View Post

Well, I didn't say that, and I don't know what the most "correct" approach is here. What I said is that adding those interactions will cause the interaction model coefficients to equal those you get from doing separate regressions for each sex.

But it is a substantive, not a statistical, question whether the correct approach is to have the effect of experiment differ between men and women or constrain it to be the same. That would depend on what the experiments are, and whether predicts that those effects should be the same or different. It's really no different from any other situation where you are trying to decide whether or not to include an interaction term in a model: it depends on what is understood about the data generating process and whether it distinguishes those groups or not. The question you are posing in #3 requires advice from somebody with expertise in the subject matter of your research--it is not a statistical question.

I can't tell from your posts what discipline you are working in: on the one hand you are at least considering sex-specific differences, which is mostly characteristic of the biomedical disciplines, but you are also modeling income as the outcome variable, suggesting an economics related discipline. I'm an epidemiologist and I wouldn't pretend to advise you on an economics-related question. If your work is sufficiently closely related to areas I have worked in, I might be able to advise you if you provided more detail about what you are studying and what these experiments involve.

Yes, I thought that the answer was related more to the data generating process. Thank you, I'm going to look at it with more detail. I greatly appreciate your help!
Comment

Announcement

Question regarding Interpreting Coefficients

Comment

Comment

Comment

Comment