Imputation advice for panel data with goal of survival analysis

Leonardo Guizzetti

Join Date: Jul 2016

Posts: 2406
#1

Imputation advice for panel data with goal of survival analysis

25 Jun 2023, 10:22

This is a statistics question for the group. I am looking for recommendations or literature to guide what I should do.

The data I have are panel data for people who come to a clinic, start a treatment, and who are followed up at regular intervals (every 3 months for up to 1 year). The data are observational in nature and have already been collected, so there’s not much I can do about this. In terms of dimensions, I have 5 timepoints (prior to starting treatment and up to 4 follow up visits, and N >> T.

Some people will have missing followup visits, the reasons for which I don’t know. It is not unreasonable given the time frame that some missed visits are due to the pandemic, while others may be informative based on how they perceive the treatment to be working (or not). Some will just miss some followup visits but show up to others. Therefore, there are every type of pattern of missing data possible.

Though the data arise from repeated followup visits, the goal is to infer time to initial response to therapy. For this purpose, at some point I would need to examine the duration in a time-to-event analysis (using discrete-time methods probably since visits are at nominal intervals).

This got me to thinking what are recommended strategies for imputation in this kind of scenario. I am certain that some of the missing data is either random (MAR) due to things like symptom burden, while it may also be not missing at random (MNAR) if the person perceives therapy to be ineffective or effective.

Right now I am considering some sensitivity analyses where I assume best-case/worst-case analysis for the first missing values. These aren’t perfect because they are detain to over or underestimate the hazards, but might be useful to “bracket” the plausible range of estimates.

I would appreciate any thoughts or pointers to literature that considers this type of scenario.
Tags: None
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17735
#2

25 Jun 2023, 11:49

Leonardo:
1) I would sponsor the idea of sensitivity analysis;
2) I'd take a look at Stef van Buuren's entries on PubMed (I'm away from my desk at the moment and cannot provide you with the links to his articles. If I' m not mistaken, Stef and co. published an article in Statistics in Medicine in 1999 about MAR and MNAR missing values management in blood pressure data collection among a group of elderly in Leiden (The Netherlands).

Kind regards,
Carlo
(Stata 19.0)
1 like
Comment
Leonardo Guizzetti

Join Date: Jul 2016

Posts: 2406
#3

25 Jun 2023, 12:58

Thank you, Carlo, for both recommendations. That info is enough for me to google the article.
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17735
#4

26 Jun 2023, 00:38

Leonardo:
this is the link to one of my favourite Stef van Buuren's papers on missing values management: Multiple imputation of missing blood pressure covariates in survival analysis - PubMed (nih.gov)

Last edited by Carlo Lazzaro; 26 Jun 2023, 00:47.

Kind regards,
Carlo
(Stata 19.0)
1 like
Comment
Leonardo Guizzetti

Join Date: Jul 2016

Posts: 2406
#5

26 Jun 2023, 09:04

Thank you for the link, Carlo.
1 like
Comment

Announcement

Imputation advice for panel data with goal of survival analysis

Comment

Comment

Comment

Comment