imputation using multilevel data with additional info for the aggregate level

Will Hauser

Join Date: Apr 2015

Posts: 32
#1

imputation using multilevel data with additional info for the aggregate level

16 Jul 2015, 16:43

The basic issue is that I have data at two levels, both levels have missing data, but I have 'more' cases at the macro level whose information I want to incorporate into the imputation of the missing macro level data.

The data are survey data combined with macro level data on the county where R resides. At the individual level I have about 1,000 cases nested in about 400 counties. The full macro consists of about 3,600 counties. There is some missing data at the county level that I want to impute. There is also missing data at the individual level that I don't think I want to impute - I think reviewers will be more comfortable with the aggregate imputation then the imputation of survey responses (this is just a hunch and it's secondary to the basic issue).

So I could throw out all counties where there is no respondent, that leaves me with about 400 counties, and then impute county level data using data only from those 400 counties. But I want to make use of all 3,600 counties to impute the county level data and then run the actual models using just the 1,000 respondents and 400 counties in the original data.

I registered the survey variables as regular (they do have some missing values but again, I don't want to impute them at this time). I then register the county-level data as imputed. Then I run the imputation. Data are in flong format and variables to be imputed are continuous. Code is as follows:

mi mvn countyvar1...countyvarN = surveyvar1.....surveyvarN

I get warning that "the imputed data contain missing values" and the process halts.
I assume that this is because the survey data contain missing responses (which I don't want imputed) and because many of the counties are missing all the survey level data (because no respondents to the survey lived in those counties). I can force it, but I'm reluctant.

I'm not sure I'm going about this the right way at all. Even if I impute all the survey data as well as the county level data, the problem won't go away because most of the counties used for the imputation don't have any survey data attached. To be clear, I don't want to impute survey data for counties where there were no individuals selected to take the survey - that would be silly - but I want to make use of all the county level data I have to get a precise imputation of the missing county level data.

Surely there must be a way to do the imputation and incorporate the entire county level data in the prediction even though I will ultimately use about 400 of the counties in the analysis. I appreciate any advice on the matter.

Will

edited to add: I'm not actually looking to run a multi-level model. There aren't enough individuals in each county for that, but the data are technically nested and that fact is at the heart of the problem.

Last edited by Will Hauser; 16 Jul 2015, 16:50.
Tags: multiple imputation
Will Hauser

Join Date: Apr 2015

Posts: 32
#2

16 Jul 2015, 19:58

I thought maybe I could work this out using the hard missing codes but that does not seem to be the case. For my predictor variables that have missing values and for which I do not wish to do any imputing I set the missing values (".") as hard missing (".a") and I still get the error code that " missing imputed values produced. This may occur when imputation variables are used as independent variables or when independent variables contain missing values. You can specify option force if you wish to proceed anyway."

Does anyone have any advice? I'm feeling like I've opened a can of worms.
Comment
Mohammad Mansour

Join Date: Jan 2021

Posts: 21
#3

23 Jan 2021, 15:39

Hello, I am trying to use collapse with MI data. I am imputing categorical variables (dependent variable is a count). I am trying to aggregate the data across all five imputations, but collapse wont work. Any idea on hoe I can perform collapse manually using mi xeq? I want to aggregate data by time and a stateID and by time alone........
Comment
daniel klein

Join Date: Mar 2014

Posts: 3885
#4

23 Jan 2021, 15:42

Will you stop posting into every thread that has some relation to MI (or no, as the Wishlist), please!? You are not doing yourself any favor here. I have responded substantially here.
2 likes
Comment
Mohammad Mansour

Join Date: Jan 2021

Posts: 21
#5

23 Jan 2021, 16:51

I am really sorry, as I did not know how to create a post and I am new to this. May you direct me please no how to post? Regards
Comment

Announcement

imputation using multilevel data with additional info for the aggregate level

Comment

Comment

Comment

Comment