Dear everyone,
I am trying to learn about logit model by myself. However, I have got some problems that I can't find answers anywhere.
I am interested in the factors that influence whether a person choose to buy product A or product B. The outcome (response) variable Y is binary (0/1); product A or product B. The predictor variables of interest are the amount of money spent on that product per year (X1) and some characteristic variables (such as: age, gender,...) (X2, X3,...)
The problem is that there are some people bought both products. For example, person coded P0001 have two lines in dataset which have differences only in Y and X1:
Line 1: Y = 1, X1 = 1000, same X2, X3, ...
Line 2: Y = 0, X1 = 3000, same X2, X3, ...
And I tried to run 3 models with 3 different dataset:
Model 1: I kept all observations
Model 2: I dropped observations which bought both products.
Model 3: I kept observations which have larger value in X1. For example, for person coded P0001, I will drop line 1.
Could you please tell me which model is correct ?
I would really appreciate your help ! Sorry for my bad english.
Best regards,
Vinh
I am trying to learn about logit model by myself. However, I have got some problems that I can't find answers anywhere.
I am interested in the factors that influence whether a person choose to buy product A or product B. The outcome (response) variable Y is binary (0/1); product A or product B. The predictor variables of interest are the amount of money spent on that product per year (X1) and some characteristic variables (such as: age, gender,...) (X2, X3,...)
The problem is that there are some people bought both products. For example, person coded P0001 have two lines in dataset which have differences only in Y and X1:
Line 1: Y = 1, X1 = 1000, same X2, X3, ...
Line 2: Y = 0, X1 = 3000, same X2, X3, ...
And I tried to run 3 models with 3 different dataset:
Model 1: I kept all observations
Model 2: I dropped observations which bought both products.
Model 3: I kept observations which have larger value in X1. For example, for person coded P0001, I will drop line 1.
Could you please tell me which model is correct ?
I would really appreciate your help ! Sorry for my bad english.
Best regards,
Vinh
Comment