Dear all,
I'm a student and a new user of Stata. I have some problems with the following regression. The purpose of my analysis is to see what is the relationship between age and the probability of being employed during the Great Recession. For this reason, I have created a binary variable called "young" and have used several control variables in a probit model. Basically, when I use the marginsplot command I can see a difference in the probability of being employed between those who are young and those who are not, particularly high for the lower weekly earnings values. However, when I create a new dummy variable "poor" and insert the interaction term "youngpoor" in another regression, the interaction coefficient is not statistically significant (both if I use a probit model both if I use a linear regression model) and I can't understand why.
I attach the files that describe the variables used and a short do-file to be able to replicate my results. The data file contains data on 5412 workers who were survey in the April 2008 Current Population Survey and reported that they were employed. The data file contains their employment status in April 2009, one year later, along with some additional variables.
Thanks in advance for your reply
I'm a student and a new user of Stata. I have some problems with the following regression. The purpose of my analysis is to see what is the relationship between age and the probability of being employed during the Great Recession. For this reason, I have created a binary variable called "young" and have used several control variables in a probit model. Basically, when I use the marginsplot command I can see a difference in the probability of being employed between those who are young and those who are not, particularly high for the lower weekly earnings values. However, when I create a new dummy variable "poor" and insert the interaction term "youngpoor" in another regression, the interaction coefficient is not statistically significant (both if I use a probit model both if I use a linear regression model) and I can't understand why.
I attach the files that describe the variables used and a short do-file to be able to replicate my results. The data file contains data on 5412 workers who were survey in the April 2008 Current Population Survey and reported that they were employed. The data file contains their employment status in April 2009, one year later, along with some additional variables.
Thanks in advance for your reply
Comment