Hey all,
The question I have is not so much a Stata-specific question, but I did not know where to ask elsewhere with this empirical kind of question (if you have suggestions about where to find such a forum, feel free to do so!).
I'm working on an assignment and I have used a binary logistic regression model. Now I'm writing down the model for estimation, but I was wondering if I should include reference categories in the description of my regression equation. For instance, I have innovation (m), a categorical variable (1 = less than 25% (reference group); 2 = 25-50%; 3= 75-90%; 4= more than 90%), as one of my independent variables, but I'm not sure whether I should include the reference group in describing my model. Likewise, I have an independent variable age (k) with 5 age categories (1 = age 18-25 (reference group); 2 = 25-45; 3= 45-65; 4= older than 65).
Right now, I've written down the following in my chapter about the empirical model: "m is the index of four categories of innovation (m = 1, 2, 3, 4) and k is the index of five categories of age (k = 1, 2, 3, 4, 5)". These two represent option A, so to speak. I would like to know whether this is correct or if, because the first innovation and age category are my the reference groups, I should write something like this: "m is the index of three categories of innovation (m = 2, 3, 4) and k is the index of four categories of age (2, 3, 4, 5)". These represent option B.
Now my question is: should I include my reference group in the description of my model (like in option A) or should I remove it (like in option B) for both innovation and age?
Hopefully my small problem is clear and someone can help me, thanks in advance!
Tim
The question I have is not so much a Stata-specific question, but I did not know where to ask elsewhere with this empirical kind of question (if you have suggestions about where to find such a forum, feel free to do so!).
I'm working on an assignment and I have used a binary logistic regression model. Now I'm writing down the model for estimation, but I was wondering if I should include reference categories in the description of my regression equation. For instance, I have innovation (m), a categorical variable (1 = less than 25% (reference group); 2 = 25-50%; 3= 75-90%; 4= more than 90%), as one of my independent variables, but I'm not sure whether I should include the reference group in describing my model. Likewise, I have an independent variable age (k) with 5 age categories (1 = age 18-25 (reference group); 2 = 25-45; 3= 45-65; 4= older than 65).
Right now, I've written down the following in my chapter about the empirical model: "m is the index of four categories of innovation (m = 1, 2, 3, 4) and k is the index of five categories of age (k = 1, 2, 3, 4, 5)". These two represent option A, so to speak. I would like to know whether this is correct or if, because the first innovation and age category are my the reference groups, I should write something like this: "m is the index of three categories of innovation (m = 2, 3, 4) and k is the index of four categories of age (2, 3, 4, 5)". These represent option B.
Now my question is: should I include my reference group in the description of my model (like in option A) or should I remove it (like in option B) for both innovation and age?
Hopefully my small problem is clear and someone can help me, thanks in advance!
Tim
Comment