Stata2Mplus Error: variables have a variance greater than the maximum allowed of 1000000.*

Yawo Kokuvi

Join Date: May 2015

Posts: 137
#1

Stata2Mplus Error: variables have a variance greater than the maximum allowed of 1000000.*

12 Apr 2017, 20:51

Hello,

I am trying to convert a dataset from STATA to Mplus. I used the stata2mplus program and converted the data. However, when I read it into Mplus, I get an error: One or more variables have a variance greater than the maximum allowed of 1000000.

The variables that are generating this error are those that have unique IDs and contain system weights, as in the following examples.

nhispid = 20000548840101
nhishid = 2000054884
serial = 32851
perwgt = 2735
sampwgt = 6881

one of the responses to my earlier posts suggested that Mplus usually have some difficulty dealing with long IDs. Could this be the case here?

I've also posted same question to the Mplus discussion list, but thought I will do same here, in case there is something I can do to change my input file generated by STATA.

Thanks again, cY.
Tags: None
Sergiy Radyakin

Join Date: Apr 2014

Posts: 1867
#2

13 Apr 2017, 10:40

The shortest answer is probably create a new set of IDs: 1,2,3,4,5 and drop the original IDs.
Then convert to Mplus again and see what happens.
1 like
Comment
Yawo Kokuvi

Join Date: May 2015

Posts: 137
#3

13 Apr 2017, 15:50

Thanks very much, Sergly. I agree, a new set of IDs would be in the order.

What should I do about the weight variables, particularly, perwgt and sampwgt, which are really critical for taking account of my complex sampling structure. Wouldn't I be messing with my sample weights if I replace those with unique IDs?

thanks - cY
Comment
wbuchanan

Join Date: Mar 2014

Posts: 1362
#4

14 Apr 2017, 05:38

You probably could handle it by specifying the necessary arguments to the usevariables parameters in the Mplus syntax and/or specifying those values directly as weights. There are similar parameters in Mplus to identify ID variables as well.
Comment
Sergiy Radyakin

Join Date: Apr 2014

Posts: 1867
#5

14 Apr 2017, 09:15

Yawo,

if some of your observations are 1,000,000 more times more influential than others, you probably will not miss much if you drop the observations with the smallest weights. On the other hand it is a reason to double check where the data is coming from, as it is not common at least in the world of the household surveys.

Best, Sergiy
Comment
Yawo Kokuvi

Join Date: May 2015

Posts: 137
#6

14 Apr 2017, 21:43

Thanks very much, folks. I was able to resolve the issue.

it turned out that there were a number of weight (household weight, annual weight) and ID (household, family, etc) variables in the dataset that I do not need. Upon carefully reading of the NHIS technical manuals, I deleted / removed all those I do not need, with the exception of "sampleweight", and used the IDVARIABLE ooption to specify the IDs (in this case, nhispid). This helped resolved the issue.

I can now proceed with further work on the dataset.

thanks again, and happy easter to you all !!

rgds, cY.
Comment

Announcement

Stata2Mplus Error: variables have a variance greater than the maximum allowed of 1000000.*

Comment

Comment

Comment

Comment

Comment