Hey guys,
I have two datasets: one with 38,62 data points, and the other 25,000.
I need to merge the large dataset into the small dataset (1:1).
Neither data sets have unique ids. However, they have three variables (region, commune, and village) in common. Three variables together make the data point unique, so I would like to create a unique id based on the three variables.
Now, after I create the unique ids for the smaller dataset, how do I merge these unique ids into the larger dataset, using the 3 key variables?
In other words, how do I make sure that the data points in my large datasets have the same unique ids asthe data points in the smaller dataset, if they have the same region, commune, and village names?
In addition, the large dataset has many more data points than the smaller one. I only need a consistent list of unique ids for both datasets, so I can merge the large one into the smaller one.
Thanks!
I have two datasets: one with 38,62 data points, and the other 25,000.
I need to merge the large dataset into the small dataset (1:1).
Neither data sets have unique ids. However, they have three variables (region, commune, and village) in common. Three variables together make the data point unique, so I would like to create a unique id based on the three variables.
Now, after I create the unique ids for the smaller dataset, how do I merge these unique ids into the larger dataset, using the 3 key variables?
In other words, how do I make sure that the data points in my large datasets have the same unique ids asthe data points in the smaller dataset, if they have the same region, commune, and village names?
In addition, the large dataset has many more data points than the smaller one. I only need a consistent list of unique ids for both datasets, so I can merge the large one into the smaller one.
Thanks!
Comment