Hello Stata,
I have 38,420 observations and 12 variables. Each variable is a numerical attribute of observation, the higher, the better. I want to select observations based on these variables by first throwing out dominated observations. That is, if observation 2 is lower than observation 1 in all attributes, I delete observation 2. Given the number of observations in my data, assuming that I do the comparison two at a time, I will be doing 15 rounds of comparisons, or 38,419 comparisons in total.
Is there any Stata command or package that can help me do it? If not, do you have any recommendations on the algorithm I should use?
Thank you so much.
I have 38,420 observations and 12 variables. Each variable is a numerical attribute of observation, the higher, the better. I want to select observations based on these variables by first throwing out dominated observations. That is, if observation 2 is lower than observation 1 in all attributes, I delete observation 2. Given the number of observations in my data, assuming that I do the comparison two at a time, I will be doing 15 rounds of comparisons, or 38,419 comparisons in total.
Is there any Stata command or package that can help me do it? If not, do you have any recommendations on the algorithm I should use?
Thank you so much.
Comment