Hello.
I would please require some methodological advice about how to test the impact of COVID-19 regulations on a nationwide human cohort. It would be very kind if you could advice me which method would be appropriate in this context.
Let M be the index month when a disease of interest is detected. M is a function M(i) of the individual human i. For each index month M(i) ranging from January 2011 to December 2022, my cohort contains an average of about 10000 new cases subject. Each human case subject is matched to a human control subject by age and gender.
Hence there are nearly 3 millions individual humans in this whole cohort. I hope calling people "individual humans", "human case subject" and "human control subject" is not offensive. I apologize if it is offensive.
The follow up for each individual human starts on January of the year before the year of the index month of the case subject and stops in December 2022. For instance, if the index month is May 2013, the follow up starts on January 2012. We observe nearly half the cases subjects and a third of the control subjects die during this follow up.
For each individual human i, whether case or control subject, we have the monthly observation of 9 binary variables X1 to X9. Let x1(t,i) to x9(t,i) be the binary values observed for the binary variables X1 to X9 at the month t for the individual human i, if the individual human i is still alive at month t. Previous studies have shown that the values took by X1 to X9 at month t have a non-linear relation with the time elapsed between the index month M and the current month t.
Here are the statistical hypothesis we would like to test. Null hypothesis H0: the odd ratio OR(X1|M,X1,...,X9) of X1 for cases subjects relatively to their matched control, given the index month M and the observed values of the binary variables X2 to X9, is independent from the current month t. Alternative hypothesis H1: the odd ratio OR(X1|M,X1,...,X9) of X1 for cases subjects relatively to their matched control, given the index month M and the observed values of the binary variables X2 to X9, is significantly bigger with 95% confidence when the current month t is greater or equal to March 2020 than when the current month t is strictly inferior to March 2020.
We don't know what test statistic we shall use. It would be very kind if you could advice us which method would be appropriate for this subject. I was thinking of Cox model with shared frailty on the index month M(i) as a qualitative variable, including in the model a binary time dependent variable indicating if the current month is greater or equal to March 2020. Would you do the same? Would you know a better solution?
Best regards.
Axel Renoux, biostatistician, Toulouse university hospital
I would please require some methodological advice about how to test the impact of COVID-19 regulations on a nationwide human cohort. It would be very kind if you could advice me which method would be appropriate in this context.
Let M be the index month when a disease of interest is detected. M is a function M(i) of the individual human i. For each index month M(i) ranging from January 2011 to December 2022, my cohort contains an average of about 10000 new cases subject. Each human case subject is matched to a human control subject by age and gender.
Hence there are nearly 3 millions individual humans in this whole cohort. I hope calling people "individual humans", "human case subject" and "human control subject" is not offensive. I apologize if it is offensive.
The follow up for each individual human starts on January of the year before the year of the index month of the case subject and stops in December 2022. For instance, if the index month is May 2013, the follow up starts on January 2012. We observe nearly half the cases subjects and a third of the control subjects die during this follow up.
For each individual human i, whether case or control subject, we have the monthly observation of 9 binary variables X1 to X9. Let x1(t,i) to x9(t,i) be the binary values observed for the binary variables X1 to X9 at the month t for the individual human i, if the individual human i is still alive at month t. Previous studies have shown that the values took by X1 to X9 at month t have a non-linear relation with the time elapsed between the index month M and the current month t.
Here are the statistical hypothesis we would like to test. Null hypothesis H0: the odd ratio OR(X1|M,X1,...,X9) of X1 for cases subjects relatively to their matched control, given the index month M and the observed values of the binary variables X2 to X9, is independent from the current month t. Alternative hypothesis H1: the odd ratio OR(X1|M,X1,...,X9) of X1 for cases subjects relatively to their matched control, given the index month M and the observed values of the binary variables X2 to X9, is significantly bigger with 95% confidence when the current month t is greater or equal to March 2020 than when the current month t is strictly inferior to March 2020.
We don't know what test statistic we shall use. It would be very kind if you could advice us which method would be appropriate for this subject. I was thinking of Cox model with shared frailty on the index month M(i) as a qualitative variable, including in the model a binary time dependent variable indicating if the current month is greater or equal to March 2020. Would you do the same? Would you know a better solution?
Best regards.
Axel Renoux, biostatistician, Toulouse university hospital
Comment