Dear all,
I am currently trying to understand how to run the poisson / negative binomial regression model with interactions between continuous variables for my thesis.
I have a count variable as DV, which I call "trans" (my Y) and several continuous IVs with X1 = ln_rpat_rRD (measure for rival innovation as patents per unit spending), X2= rel_size (as measure of relative firm size), X3 = rel_ROA (measure of relative firm performance), X4 = sthom (measure for strategic homogeneity among rival firms), X5 = dummy variable ... to just name the most relevant ones. I have hypothesized for X2, X3, and X4 to moderate the relationship between X1 and Y (with X2 and X3 to have a negative moderating effect and X4 to have a positive moderating effect). Furthermore, I have hypothesized the relationship between X1 and Y to be positive and significant.
I have understood that it seems to make sense to mean center all my moderating IVs first in order to give the 0 value of a variable a substantive meaning. This, I have done.
I am currently still facing two questions that I unfortunately could neither find an answer to on the internet nor in this forum. In case I have missed the respective post, I apologize for duplicating the question and would greatly appreciate if you could hint me towards it.
I'm afraid I do not yet fully understand the process for conducting my regression analysis with interaction terms as I have read that the "main effect" (i.e. the significance and the coefficient of my X1 variable ln_rpat_rRD) cannot be easily interpreted anymore once interaction terms are included in the model.
Thus, my question is whether I need to pursue a step-wise process, or whether the entire analysis can be done in one model only.
By this, I mean the following:
1) Step-wise process: I first need to identify one regression model where I have a significant and positive relationship between X1 and Y with no interaction term included but rather including all relevant IVs as control variables only (I would assume I need this somehow as this positive and significant correlation between X1 and Y builds the basis for my entire analysis). Second, once I have identified this significant and positive correlation, I then separately add an interaction term to the model and analyze its significance and respective coefficient. In this way, I would test all three different interaction effects that I have hypothesized separately and never in combination in one model.
2) One full model: I include all three interaction terms in one model directly without having to go through this step-wise process. If this is the recommended approach, I would really need help in understanding
a) Whether to run my regressions and exclude non-significant interaction terms until I end up only having significant interaction terms in the model anymore. In different contexts, I have read differing opinions, with one stating that it might make sense to exclude them, and one stating to rather keep them in as they theoretically make sense.
b) How do I interpret the main effect of my X1-Y relationship. As I mentioned before, I need this relationship to be significant in order for the interpretation of the interaction terms to even make sense. Yet, I have no experience with regard to plotting and interpreting graphs in STATA as some might suggest, so if this is the case, I would greatly appreciate a more extensive answer.
I would be very happy if someone could help me.
I am currently trying to understand how to run the poisson / negative binomial regression model with interactions between continuous variables for my thesis.
I have a count variable as DV, which I call "trans" (my Y) and several continuous IVs with X1 = ln_rpat_rRD (measure for rival innovation as patents per unit spending), X2= rel_size (as measure of relative firm size), X3 = rel_ROA (measure of relative firm performance), X4 = sthom (measure for strategic homogeneity among rival firms), X5 = dummy variable ... to just name the most relevant ones. I have hypothesized for X2, X3, and X4 to moderate the relationship between X1 and Y (with X2 and X3 to have a negative moderating effect and X4 to have a positive moderating effect). Furthermore, I have hypothesized the relationship between X1 and Y to be positive and significant.
I have understood that it seems to make sense to mean center all my moderating IVs first in order to give the 0 value of a variable a substantive meaning. This, I have done.
I am currently still facing two questions that I unfortunately could neither find an answer to on the internet nor in this forum. In case I have missed the respective post, I apologize for duplicating the question and would greatly appreciate if you could hint me towards it.
I'm afraid I do not yet fully understand the process for conducting my regression analysis with interaction terms as I have read that the "main effect" (i.e. the significance and the coefficient of my X1 variable ln_rpat_rRD) cannot be easily interpreted anymore once interaction terms are included in the model.
Thus, my question is whether I need to pursue a step-wise process, or whether the entire analysis can be done in one model only.
By this, I mean the following:
1) Step-wise process: I first need to identify one regression model where I have a significant and positive relationship between X1 and Y with no interaction term included but rather including all relevant IVs as control variables only (I would assume I need this somehow as this positive and significant correlation between X1 and Y builds the basis for my entire analysis). Second, once I have identified this significant and positive correlation, I then separately add an interaction term to the model and analyze its significance and respective coefficient. In this way, I would test all three different interaction effects that I have hypothesized separately and never in combination in one model.
2) One full model: I include all three interaction terms in one model directly without having to go through this step-wise process. If this is the recommended approach, I would really need help in understanding
a) Whether to run my regressions and exclude non-significant interaction terms until I end up only having significant interaction terms in the model anymore. In different contexts, I have read differing opinions, with one stating that it might make sense to exclude them, and one stating to rather keep them in as they theoretically make sense.
b) How do I interpret the main effect of my X1-Y relationship. As I mentioned before, I need this relationship to be significant in order for the interpretation of the interaction terms to even make sense. Yet, I have no experience with regard to plotting and interpreting graphs in STATA as some might suggest, so if this is the case, I would greatly appreciate a more extensive answer.
I would be very happy if someone could help me.
Comment