Dear StataListers,
I am working with panel data on outcomes after landowners complete time in conservation programs. The nature of my data can be characterized as multiple failure, because a subset of landowners can enter /exit the program more than once, and fail more than once after exiting (failure = discontinuation of conservation action). The failures are ordered. The Stata manual and Mario Cleves's following piece (https://www.stata.com/support/faqs/statistics/multiple-failure-time-data/) have been very helpful in setting up my data for analysis.
But, can you help with the intution behind the different "stset" options in Mario Cleve's piece, specifically in sections "3.2.3 The conditional risk set model (time from entry)" versus "3.2.4 The conditional risk set model (time from the previous event)" both discuss alternatives to working with ordered failure data, and I can setup my data both ways though the second one makes more sense (resetting the clock). The difference is in the first case id() option is specified in stset, whereas in the latter id() is not specified, instead errors are clustered by id(). Stata manual notes "Specifying id() never hurts". I think I understand why it is necessary to specify id() in the first case, but not 100% sure why it is omitted for stset in the second alternative. Is it redundant?
Thank you for your time.
I am working with panel data on outcomes after landowners complete time in conservation programs. The nature of my data can be characterized as multiple failure, because a subset of landowners can enter /exit the program more than once, and fail more than once after exiting (failure = discontinuation of conservation action). The failures are ordered. The Stata manual and Mario Cleves's following piece (https://www.stata.com/support/faqs/statistics/multiple-failure-time-data/) have been very helpful in setting up my data for analysis.
But, can you help with the intution behind the different "stset" options in Mario Cleve's piece, specifically in sections "3.2.3 The conditional risk set model (time from entry)" versus "3.2.4 The conditional risk set model (time from the previous event)" both discuss alternatives to working with ordered failure data, and I can setup my data both ways though the second one makes more sense (resetting the clock). The difference is in the first case id() option is specified in stset, whereas in the latter id() is not specified, instead errors are clustered by id(). Stata manual notes "Specifying id() never hurts". I think I understand why it is necessary to specify id() in the first case, but not 100% sure why it is omitted for stset in the second alternative. Is it redundant?
Thank you for your time.
Comment