Hey there, I'm sure this has a simple answer, but as someone new to Stata I'm finding getting the answer a little difficult.
I'm doing a CEA for a small pilot study which uses survey data to collect costs and resources, and unfortunately there's a little bit of missingness among survey responders (about 3%) and about a 8% survey non-response rate. My PI has asked me to do a complete case analysis (which is available for 94% of survey responders, and obviously 0% among non-responders) and the crux of my questions... a MI analysis. I knew next to nothing about MI, but after reviewing a few textbooks and the literature I have planned a MICE strategy based on the pattern of missingness and number of variables with missing data.
My problem is that because we are imputing at the disaggregate level (so individual cost and resource variables, these are important in our analysis so imputing at the aggregate doesn't work) and then summing these variables together (in a micro-costing approach) I'm not sure how Stata deals with this sorta simple summation across the level of the individual to generate a total cost (there are about 9 variables that are summed to generate the final costs). Beyond that, because some variables that are imputed will be resources consumed, they will need to be multiplied by a "unit-cost" (e.g. # of hospital nights*the cost of a night in hospital specific to the hospital for the individual) before the summing to total costs.
It's my understanding that basically this will require a fair bit of registering variables as passive to generate "intermediate" costs from resources*unit-cost and also summing across individuals for "total costs". But is there anything I'm missing? Is it as simple as generating an imputation model (PMM for costs, ordinal reg for resources etc.) and then using mi xeq to register a passive variable named "total costs" as the sum of the individual costs? (obviously with the background legwork for MCAR/MAR/MNAR, patterns of missingness etc. already done)
And once these are summed to a total cost, how does Stata use the new total costs in descriptive or regression analyses (as the independent or dependent variable)? Does everything get combined using Rubin's rules for the analysis?
Thanks!
Sebby
I'm doing a CEA for a small pilot study which uses survey data to collect costs and resources, and unfortunately there's a little bit of missingness among survey responders (about 3%) and about a 8% survey non-response rate. My PI has asked me to do a complete case analysis (which is available for 94% of survey responders, and obviously 0% among non-responders) and the crux of my questions... a MI analysis. I knew next to nothing about MI, but after reviewing a few textbooks and the literature I have planned a MICE strategy based on the pattern of missingness and number of variables with missing data.
My problem is that because we are imputing at the disaggregate level (so individual cost and resource variables, these are important in our analysis so imputing at the aggregate doesn't work) and then summing these variables together (in a micro-costing approach) I'm not sure how Stata deals with this sorta simple summation across the level of the individual to generate a total cost (there are about 9 variables that are summed to generate the final costs). Beyond that, because some variables that are imputed will be resources consumed, they will need to be multiplied by a "unit-cost" (e.g. # of hospital nights*the cost of a night in hospital specific to the hospital for the individual) before the summing to total costs.
It's my understanding that basically this will require a fair bit of registering variables as passive to generate "intermediate" costs from resources*unit-cost and also summing across individuals for "total costs". But is there anything I'm missing? Is it as simple as generating an imputation model (PMM for costs, ordinal reg for resources etc.) and then using mi xeq to register a passive variable named "total costs" as the sum of the individual costs? (obviously with the background legwork for MCAR/MAR/MNAR, patterns of missingness etc. already done)
And once these are summed to a total cost, how does Stata use the new total costs in descriptive or regression analyses (as the independent or dependent variable)? Does everything get combined using Rubin's rules for the analysis?
Thanks!
Sebby
Comment