Dear Stata users,
Today I am contacting you for a question that causes me some concerns. Here is an example related to health: if one has a balanced panel data of a population that contains information about health status (represented by a dummy variable equal to 1 if the individual is seek and 0 otherwise) and several other variables such as employment, age, city of residence whether the individual has health insurance and so on.
The whole population does not have the same risk to be seek and my interest here is to define a subgroup of the whole population that can be considered as being at risk of success (i.e. to develop one specific illness).
I am wondering what sort of strategy can be used to define this sub population from the information available about individuals that experience success.
My database is a two wave balanced panel with 2000 individuals and out of the 2000 individuals 33 individuals experience "success" (dummy = 1 for health status).
The unique strategy that I have thought of so far is:
- A propensity score can make it by using available information from the seek population to define a propensity score range between which corresponding individuals can be considered as at risk.
With a code in Stata such as:
pscore seek age origin employment_status city_residence insurance smoker ..., pscore(pscore) blockid(block) detail logit level(0.1) numblo(5)
Then the bandwidth range is defined by two extreme values which are : the lowest propensity score of the individual that do experience success and the highest on.
I haven't found much information about the validity of this approach, thus I am contacting you for advice about this specific issue.
Best regards,
Today I am contacting you for a question that causes me some concerns. Here is an example related to health: if one has a balanced panel data of a population that contains information about health status (represented by a dummy variable equal to 1 if the individual is seek and 0 otherwise) and several other variables such as employment, age, city of residence whether the individual has health insurance and so on.
The whole population does not have the same risk to be seek and my interest here is to define a subgroup of the whole population that can be considered as being at risk of success (i.e. to develop one specific illness).
I am wondering what sort of strategy can be used to define this sub population from the information available about individuals that experience success.
My database is a two wave balanced panel with 2000 individuals and out of the 2000 individuals 33 individuals experience "success" (dummy = 1 for health status).
The unique strategy that I have thought of so far is:
- A propensity score can make it by using available information from the seek population to define a propensity score range between which corresponding individuals can be considered as at risk.
With a code in Stata such as:
pscore seek age origin employment_status city_residence insurance smoker ..., pscore(pscore) blockid(block) detail logit level(0.1) numblo(5)
Then the bandwidth range is defined by two extreme values which are : the lowest propensity score of the individual that do experience success and the highest on.
I haven't found much information about the validity of this approach, thus I am contacting you for advice about this specific issue.
Best regards,
Comment