I am merging a dataset with ~2.2 million students to a dataset with ~5,900 schools. I am merging on a school ID (called idschool) and the year in which data were collected.
If I run this command, I get 1.9 million matches:
merge m:1 year idschool using schools, nogenerate keep(match)
But if I run the same command with the update option --
merge m: 1 year idschool using schools, nogenerate keep(match) update
-- I only get ~1.7 million matches.
I don't get it. Why would the update option affect the number of matches?
Many thanks,
Paul
If I run this command, I get 1.9 million matches:
merge m:1 year idschool using schools, nogenerate keep(match)
But if I run the same command with the update option --
merge m: 1 year idschool using schools, nogenerate keep(match) update
-- I only get ~1.7 million matches.
I don't get it. Why would the update option affect the number of matches?
Many thanks,
Paul
Comment