Dear Statalist,
I have data from two years. I want to use matching to compare individuals from Wave 2 with the individuals from the Wave 1. Therefore my treatment variable is Wave. I am mainly following the steps described by Garrido et al (2014) ‘Methods for Constructing and Assessing Propensity Scores’ to create propensity scores and then use kernel matching.
The first step is ‘Choice of Variables to Include in the Propensity Score’, where authors recommend running a probit regression with treatment as the outcome variable and the potential confounders as independent variables. However, the paper also says that when a variable is related to the outcome but not the treatment, is should be included as it should reduce bias. In contrary when the variable is related only to the treatment but not to the outcome are not beneficial for the calculation of the propensity scores.
From this, I understand that I should include the variables that are statistically significant when I run a probit regression with my original outcome variable.
My question is: Do I need to include Wave 2 (which is a dummy for wave) in this regression? Because some additional variables become significant when I control for Wave 2 in the probit regression. So I am not sure which one to use to calculate the propensity scores.
Best wishes,
Mirjana
I have data from two years. I want to use matching to compare individuals from Wave 2 with the individuals from the Wave 1. Therefore my treatment variable is Wave. I am mainly following the steps described by Garrido et al (2014) ‘Methods for Constructing and Assessing Propensity Scores’ to create propensity scores and then use kernel matching.
The first step is ‘Choice of Variables to Include in the Propensity Score’, where authors recommend running a probit regression with treatment as the outcome variable and the potential confounders as independent variables. However, the paper also says that when a variable is related to the outcome but not the treatment, is should be included as it should reduce bias. In contrary when the variable is related only to the treatment but not to the outcome are not beneficial for the calculation of the propensity scores.
From this, I understand that I should include the variables that are statistically significant when I run a probit regression with my original outcome variable.
My question is: Do I need to include Wave 2 (which is a dummy for wave) in this regression? Because some additional variables become significant when I control for Wave 2 in the probit regression. So I am not sure which one to use to calculate the propensity scores.
Best wishes,
Mirjana
Comment