Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Question regarding discrepant results using strate(cluster)

    Hello,

    I am doing a survival analysis using ~20 or so datasets. I would like to estimate the rate of my outcome at different time periods of my follow-up.
    I have a question about the use of strate and stsplit using my data.

    I have used stsplit to split up my follow-up time as seen using the code below:
    Code:
    stsplit time_split_merge, at(1,2,5,10)
    Then I run strate adjusting for the clustering of each cohort as seenn below:
    Code:
    . strate time_split_mer if age_mer<19 & prosp==1 , per(100) graph cluster(study)
    When i do this I get this output:
    HTML Code:
    . strate time_split_mer if age_mer<19 & prosp==1 , per(100) graph cluster(study)
    
             failure _d:  inc_dis_merged
       analysis time _t:  survival_tyrs_merged
                     id:  unique_id
    
    Estimated rates (per 100) and lower/upper bounds of 95% confidence intervals
    (30960 records included in the analysis)
    The jackknife was used to calculate confidence intervals
    41     total clusters identified by study and time_split_merge
    
      +------------------------------------------------------------+
      | time_s~e     D          Y       Rate      Lower      Upper |
      |------------------------------------------------------------|
      |        0   309   144.5146   2.138193   1.090785   5.851070 |
      |        1   100    70.3475   1.421515   0.780483   2.733694 |
      |        2    11    49.3407   0.222940   0.049534   0.903587 |
      |        5     0     0.2739   0.000000          .          . |
      +------------------------------------------------------------+
    Note: Jackknife confidence intervals are missing because of an insufficient
          number of failures in the dataset.

    However, when I do something (at least intuitively to me) seems like it should give me the same result I get vastly different results. For this I run strate and truncate the follow-up time to only include below the first year. This should (I would think) be equal to the first row above but instead I get vastly higher rates (11.65 rate versus 2.13).
    Please see below code and output:
    Code:
    . strate if age_mer<19 & prosp==1 & survival_tyrs_mer<=1, per(100) graph cluster(study)

    HTML Code:
    . strate if age_mer<19 & prosp==1 & survival_tyrs_mer<=1, per(100) graph cluster(study)
    
             failure _d:  inc_dis_merged
       analysis time _t:  survival_tyrs_merged
                     id:  unique_id
    
    Estimated rates (per 100) and lower/upper bounds of 95% confidence intervals
    (3519 records included in the analysis)
    The jackknife was used to calculate confidence intervals
    13     clusters identified by study
    Warning: graphical output suppressed - no groups to plot
    
      +--------------------------------------------+
      |   D         Y      Rate    Lower     Upper |
      |--------------------------------------------|
      | 309   26.5146   11.6540   6.5145   23.7337 |
      +--------------------------------------------+
    Can anyone explain why the rates are so different? Am I doing or assuming something inaccurately about strate?

    Thank you for all your help.
    Leo
Working...
X