In my dataset, I have the variable list: year cusip conm dlc dvt lt oancf revt teq xpr xsga xstf;
when I run duplicates report cusip year
I got about 5000 obs that have 2 copies.
I found that for those copies, the variables have differing missing values (below is an example)
year cusip conm dlc dvt lt oancf revt teq xpr xsga xstf
1998 G9618E107 WHITE MTNS INS GROUP LTD 748.5 13.1 2534.2 704.3 3.8 130.2
1998 G9618E107 WHITE MTNS INS GROUP LTD 806.2 13.1 2534.2 28.8 624.8
I want to merge the copies to minimize the number of missing values for each obs, and delete duplicates. How can I do it?
Thank you
when I run duplicates report cusip year
I got about 5000 obs that have 2 copies.
I found that for those copies, the variables have differing missing values (below is an example)
year cusip conm dlc dvt lt oancf revt teq xpr xsga xstf
1998 G9618E107 WHITE MTNS INS GROUP LTD 748.5 13.1 2534.2 704.3 3.8 130.2
1998 G9618E107 WHITE MTNS INS GROUP LTD 806.2 13.1 2534.2 28.8 624.8
I want to merge the copies to minimize the number of missing values for each obs, and delete duplicates. How can I do it?
Thank you
Comment