Dear Stata Users,
I am puzzled by the Stata output for the command stsum. I am working with multi-episode data (split as suggested in the stsum documentation) split by category and would like to determine within-category percentiles for survival time for my dataset. Additionally, I have a variable spellLength which captures the difference between the end of the spell (endDate) and the start of the spell (startDate) in days. However using the summary command with the by() option to find the percentiles for survival using spellLength gives different percentiles than stsum also using the by option. I have performed the Kaplan-Meier calculation in Excel and get the same results as the summary command in Stata.
Could someone please help me understand what may be leading to this discrepancy?
I am running Stata 15.1 on a Unix machine.
Thank you!
Jonathan Gomez Martinez
I am puzzled by the Stata output for the command stsum. I am working with multi-episode data (split as suggested in the stsum documentation) split by category and would like to determine within-category percentiles for survival time for my dataset. Additionally, I have a variable spellLength which captures the difference between the end of the spell (endDate) and the start of the spell (startDate) in days. However using the summary command with the by() option to find the percentiles for survival using spellLength gives different percentiles than stsum also using the by option. I have performed the Kaplan-Meier calculation in Excel and get the same results as the summary command in Stata.
Could someone please help me understand what may be leading to this discrepancy?
I am running Stata 15.1 on a Unix machine.
Thank you!
Jonathan Gomez Martinez
Comment