Question regarding discrepant results using strate(cluster)

Leo Martinez

Join Date: May 2015
Posts: 82

Question regarding discrepant results using strate(cluster)

17 Aug 2018, 11:32

Hello,

I am doing a survival analysis using ~20 or so datasets. I would like to estimate the rate of my outcome at different time periods of my follow-up.
I have a question about the use of strate and stsplit using my data.

I have used stsplit to split up my follow-up time as seen using the code below:

Code:

stsplit time_split_merge, at(1,2,5,10)

Then I run strate adjusting for the clustering of each cohort as seenn below:

Code:

. strate time_split_mer if age_mer<19 & prosp==1 , per(100) graph cluster(study)

When i do this I get this output:

HTML Code:

. strate time_split_mer if age_mer<19 & prosp==1 , per(100) graph cluster(study)

         failure _d:  inc_dis_merged
   analysis time _t:  survival_tyrs_merged
                 id:  unique_id

Estimated rates (per 100) and lower/upper bounds of 95% confidence intervals
(30960 records included in the analysis)
The jackknife was used to calculate confidence intervals
41     total clusters identified by study and time_split_merge

  +------------------------------------------------------------+
  | time_s~e     D          Y       Rate      Lower      Upper |
  |------------------------------------------------------------|
  |        0   309   144.5146   2.138193   1.090785   5.851070 |
  |        1   100    70.3475   1.421515   0.780483   2.733694 |
  |        2    11    49.3407   0.222940   0.049534   0.903587 |
  |        5     0     0.2739   0.000000          .          . |
  +------------------------------------------------------------+
Note: Jackknife confidence intervals are missing because of an insufficient
      number of failures in the dataset.

However, when I do something (at least intuitively to me) seems like it should give me the same result I get vastly different results. For this I run strate and truncate the follow-up time to only include below the first year. This should (I would think) be equal to the first row above but instead I get vastly higher rates (11.65 rate versus 2.13).
Please see below code and output:

Code:

. strate if age_mer<19 & prosp==1 & survival_tyrs_mer<=1, per(100) graph cluster(study)

HTML Code:

. strate if age_mer<19 & prosp==1 & survival_tyrs_mer<=1, per(100) graph cluster(study)

         failure _d:  inc_dis_merged
   analysis time _t:  survival_tyrs_merged
                 id:  unique_id

Estimated rates (per 100) and lower/upper bounds of 95% confidence intervals
(3519 records included in the analysis)
The jackknife was used to calculate confidence intervals
13     clusters identified by study
Warning: graphical output suppressed - no groups to plot

  +--------------------------------------------+
  |   D         Y      Rate    Lower     Upper |
  |--------------------------------------------|
  | 309   26.5146   11.6540   6.5145   23.7337 |
  +--------------------------------------------+

Can anyone explain why the rates are so different? Am I doing or assuming something inaccurately about strate?

Thank you for all your help.
Leo

Tags: None

Announcement

Question regarding discrepant results using strate(cluster)