I am trying to calculate a Heckman adjustment because there is sample selection bias in my data. My sample includes all convicted defendants. My dependent variable is a binary outcome comparing defendants who were sentenced to prison (coded 1) versus defendants who were not sentenced to prison (coded 0). I have tried to calculate the Heckman multiple times. When I include most of my independent variables, I receive the error message: Dependent variable could not censor because of selection and it could be reduced to OLS regression. I was warned that there may be multicollinearity between some of the measures in my data set. To deal with this issue, I attempted to build the most basic Heckman model with just a couple of my independent variables, trying to get the model to run at all. Below is the syntax that I used.
heckman prissent SEX_R, select (prissent= SEX_R FELONY1) twostep
I receive the following error message: prissent collinear with _cons
My sample is all convicted offenders, so it makes sense that prison sentence and convicted would be related, but I feel like something else is going on. I do not have any missing values. I also checked my binary outcome (prison sentence) and I have variation. I have tried suppressing the constant term but I receive the same message.
Would anyone have a suggestion on what I might be doing wrong? Granted the syntax above only includes two independent variables (defendant sex and felony 1 offense), but I continue to receive the same error message regardless of the combination of IVs I include.
Thank you!
heckman prissent SEX_R, select (prissent= SEX_R FELONY1) twostep
I receive the following error message: prissent collinear with _cons
My sample is all convicted offenders, so it makes sense that prison sentence and convicted would be related, but I feel like something else is going on. I do not have any missing values. I also checked my binary outcome (prison sentence) and I have variation. I have tried suppressing the constant term but I receive the same message.
Would anyone have a suggestion on what I might be doing wrong? Granted the syntax above only includes two independent variables (defendant sex and felony 1 offense), but I continue to receive the same error message regardless of the combination of IVs I include.
Thank you!
Comment