Hi all,
I've data from a longitudinal study, where for each patient I have several observations, one every time they got exposed. The outcome of interest is leukaemia, and the risk of the patient developing leukaemia increases with accumulated exposure. Once the patient is diagnosed with leukaemia or reaches the end of the follow-up is out of the study.
Initially, I modelled it using Cox proportional hazards for multiple observations, creating a final dummy observation with no dose associated where it is stated if the patient has leukaemia or not at the time of exiting the study. I created also a cumulative exposure variable, where the exposure is added up observation after observation.
Find below an excerpt of the snapshot database:
An abbreviated version of what I did was:
I snapspanned the data by:
. snapspan Idpatient age leuk, generate(t0_age) replace
the result was:
and then I stsetted the data by:
. stset age , failure(leuk) id(Idpatient) origin (t0_age)
and run a Cox model with it.
stcox cum_dose
Is the Cox proportional hazard a correct approach for cumulative exposures if we are interested in modelling how the leukaemia hazard proportionally scales by the cumulative dose? I've found lots of examples in the stata guide for different designs but nothing quite like this.
Any help on this will be highly appreciated.
Thank you very much,
Ma
I've data from a longitudinal study, where for each patient I have several observations, one every time they got exposed. The outcome of interest is leukaemia, and the risk of the patient developing leukaemia increases with accumulated exposure. Once the patient is diagnosed with leukaemia or reaches the end of the follow-up is out of the study.
Initially, I modelled it using Cox proportional hazards for multiple observations, creating a final dummy observation with no dose associated where it is stated if the patient has leukaemia or not at the time of exiting the study. I created also a cumulative exposure variable, where the exposure is added up observation after observation.
Find below an excerpt of the snapshot database:
Idpatient | leuk | age | dose | cum_dose |
1 | 0 | 2 | 0,3 | 0,3 |
1 | 0 | 3 | 0,6 | 0,9 |
1 | 0 | 5 | 0,8 | 1,7 |
1 | 0 | 7 | 0,9 | 2,6 |
1 | 0 | 8 | 0,4 | 2,7 |
1 | 1 | 12 | . | 2,7 |
2 | 0 | 5 | 0,6 | 0,6 |
2 | 0 | 7 | 0,8 | 1,4 |
2 | 0 | 8 | . | 1,4 |
3 | 0 | 1 | 0,1 | 0.1 |
3 | 0 | 2 | 0,3 | 0,4 |
3 | 0 | 3 | 0,5 | 0,9 |
3 | 0 | 7 | 0,8 | 1,7 |
3 | 1 | 8 | . | 1,7 |
I snapspanned the data by:
. snapspan Idpatient age leuk, generate(t0_age) replace
the result was:
Idpatient | leuk | t0_age | age | dose | cum_dose |
1 | 0 | . | 2 | . | . |
1 | 0 | 2 | 3 | 0,3 | 0,3 |
1 | 0 | 3 | 5 | 0,6 | 0,9 |
1 | 0 | 5 | 7 | 0,8 | 1,7 |
1 | 0 | 7 | 8 | 0,9 | 2,6 |
1 | 1 | 8 | 12 | 0,4 | 2,7 |
2 | 0 | . | 5 | . | . |
2 | 0 | 5 | 7 | 0,6 | 0,6 |
2 | 0 | 7 | 8 | 0,8 | 1,4 |
3 | 0 | . | 1 | . | . |
3 | 0 | 1 | 2 | 0,1 | 0.1 |
3 | 0 | 2 | 3 | 0,3 | 0,4 |
3 | 0 | 3 | 7 | 0,5 | 0,9 |
3 | 1 | 7 | 8 | 0,8 | 1,7 |
and then I stsetted the data by:
. stset age , failure(leuk) id(Idpatient) origin (t0_age)
and run a Cox model with it.
stcox cum_dose
Is the Cox proportional hazard a correct approach for cumulative exposures if we are interested in modelling how the leukaemia hazard proportionally scales by the cumulative dose? I've found lots of examples in the stata guide for different designs but nothing quite like this.
Any help on this will be highly appreciated.
Thank you very much,
Ma
Comment