Hi everyone
I am currently using logistic regression for my analysis. Given that the NHANES (the data set am using) recommended that if a variable is missing less than 10% of values, it usually normal to carry on with the analysis without any further adjustment. However, while my individual predictor variables met this criteria (with number of respondents who had missing data and those who reported "don't know" in each variable amounting to less than 10%), including all the predictor in the multiple logistic regression model resulted in more than 10% of the missing data (see number of observation in the unadjusted and adjusted model below. Is it possible that this missing values may have altered the statistical significance of the final model? if so, how do I resolved it given that I am using survey data.

Secondly, my secondary outcome variable had upto 1098 respondents coded as missing (that is "dot" in the survey data with little or no information about the nature of the missing) out of 7765 participants - which is more than 10%. Since the outcome variable is based on cut-off value, how do I manage this?
I am currently using logistic regression for my analysis. Given that the NHANES (the data set am using) recommended that if a variable is missing less than 10% of values, it usually normal to carry on with the analysis without any further adjustment. However, while my individual predictor variables met this criteria (with number of respondents who had missing data and those who reported "don't know" in each variable amounting to less than 10%), including all the predictor in the multiple logistic regression model resulted in more than 10% of the missing data (see number of observation in the unadjusted and adjusted model below. Is it possible that this missing values may have altered the statistical significance of the final model? if so, how do I resolved it given that I am using survey data.
Secondly, my secondary outcome variable had upto 1098 respondents coded as missing (that is "dot" in the survey data with little or no information about the nature of the missing) out of 7765 participants - which is more than 10%. Since the outcome variable is based on cut-off value, how do I manage this?
Comment