Hello All !
I want to model the determinants of the occupation status of individuals in the labor market.
To avoid bias selection issues I opted for the heckman two step model : the first model estimating the participation in labor force (Yes or No) first and the second one being the model of performance (indepedent variable is categorical - 5 groups : Protected wage earner, Unprotected wage earner, Self-employed, Non-paid, Unemployed).
I wish to run the two-step model to get the inverse Mills' ratio so as to include it in the performance model, and this is where it gets tricky, the two models should be run for two different samples : the first estimation is run on a sample of 158974, including those who are active and non active in the labor market whereas the second model is run only for those who are participating in the labor force (79 902).
Is there any command to specify the samples on which every equation is run in order to obtain an accurate inverse Mills' ratio?
Thanks in advance for any kind of help
I want to model the determinants of the occupation status of individuals in the labor market.
To avoid bias selection issues I opted for the heckman two step model : the first model estimating the participation in labor force (Yes or No) first and the second one being the model of performance (indepedent variable is categorical - 5 groups : Protected wage earner, Unprotected wage earner, Self-employed, Non-paid, Unemployed).
I wish to run the two-step model to get the inverse Mills' ratio so as to include it in the performance model, and this is where it gets tricky, the two models should be run for two different samples : the first estimation is run on a sample of 158974, including those who are active and non active in the labor market whereas the second model is run only for those who are participating in the labor force (79 902).
Is there any command to specify the samples on which every equation is run in order to obtain an accurate inverse Mills' ratio?
Thanks in advance for any kind of help
Comment