Hi all,
I currently have 3 data sets.
1) Deal level data on M&A's.
2) Patent data for the target firms.
3) Patent data for the acquiring firms in the dataset.
The datasets 2 and 3 are just snippets of the patent data. Each target and acquiring firm have a compustat name and unique assignee code (asscode). I guess I have to do a merge twice, once by ass code, and once by name, then I want to remove the duplicates.
My aim is to have the M&A deal dataset (1) with all the patents of the acquiring and target firm (after that I plan to exploit the duplicates to see the overlaps). For example, all the patents corresponding to acquirorname = "ABBOT LABORATORIES" as well as all the patents acquirorass = 735 from dataset 3), as well as all the patents from dataset 2) corresponding to targetname = "KOS PHARMACEUTICALS INC" and targetass = "695573". There are more variables not in the dataex snippet from dataset 2) and 3) (such as number of forward citations, date, etc) but I left those out for simplicity sake.
I generated an id_deal, and also a unique ID for each acquiror (id_acquiror) as well as an id for the target id_target. I wasn't sure how to proceed from here.
Any help would be greatly appreciated!
I currently have 3 data sets.
1) Deal level data on M&A's.
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input float(id_deal id_acquiror) str30 acquirorname_compu long acquirorass float id_target str46 targetname long targetass 1 1 "ABBOTT LABORATORIES" 735 1 "KOS PHARMACEUTICALS INC" 695573 2 1 "ABBOTT LABORATORIES" 735 2 "Solvay Pharmaceuticals SA" 734307 3 3 "ASTRAZENECA PLC" 770646 3 "Novexel SA" 896419 4 3 "ASTRAZENECA PLC" 770646 4 "KuDOS Pharmaceuticals" 783835 5 3 "ASTRAZENECA PLC" 770646 5 "Cambridge Antibody Tech Grp" 739243 6 3 "ASTRAZENECA PLC" 770646 6 "NPS PHARMACEUTICALS INC" 719353 7 3 "ASTRAZENECA PLC" 770646 7 "MEDIMMUNE INC" 773774 8 3 "ASTRAZENECA PLC" 770646 8 "Arrow Therapeutics Ltd" 801361 9 9 "Abraxis BioScience Inc" 918063 9 "Shimoda Biotech (Pty) Ltd" 862380 10 10 "Actavis Group hf" 933768 10 "Zhejiang Chiral Medicine Chem" 835576 11 10 "Actavis Group hf" 933768 11 "Sindan" 1026564 12 12 "Aeterna Laboratories Inc" 729290 12 "Zentaris AG" 791707 13 13 "Akela Pharma Inc" 1033688 13 "Nventa Biopharmaceuticals Corp" 893508 14 14 "Allergan Inc" 19170 14 "EndoArt SA" 856456 15 15 "Alliance Pharmaceutical Corp" 19355 15 "Molecular Biosystems Inc" 382425 16 16 "American Pharm Partners Inc" 823833 16 "American BioScience Inc" 762274 17 17 "Amgen Inc" 651581 17 "Abgenix Inc" 755139 18 17 "Amgen Inc" 651581 18 "TULARIK INC" 714795 19 17 "Amgen Inc" 651581 19 "Alantos Pharm Hldg Inc" 909253 20 17 "Amgen Inc" 651581 20 "IMMUNEX CORP" 267870 end
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input double patent float asscode str30 targetname 5714160 22715 "ALZA CORP" 5997527 22715 "" 6465008 22715 "" 7390780 22715 "" 6276512 22715 "ALZA CORP" 6153678 22715 "ALZA CORP" 6195582 22715 "" 6283953 22715 "" 7338663 22715 "" 5698119 22715 "ALZA CORP" 6975902 22715 "" 6749575 22715 "" 8383149 22715 "" 6066619 22715 "" 6508808 22715 "ALZA CORP" 6057374 22715 "ALZA CORP" 6764697 22715 "ALZA CORP" 5985324 22715 "ALZA CORP" 6224908 22715 "ALZA CORP" 6974589 22715 "" end
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input double patent float asscode str30 acquirorname_compu 7365193 735 "" 6693136 735 "ABBOTT LABORATORIES" 8213001 735 "ABBOTT LABORATORIES" 6207380 735 "ABBOTT LABORATORIES" 7037927 735 "ABBOTT LABORATORIES" 7897332 735 "" 8071145 735 "" 7108710 735 "ABBOTT LABORATORIES" 6579882 735 "ABBOTT LABORATORIES" 7014864 735 "" 8455212 735 "" 8338466 735 "" 5883236 735 "ABBOTT LABORATORIES" 6379671 735 "" 6093179 735 "ABBOTT LABORATORIES" 6521790 735 "ABBOTT LABORATORIES" 6596302 735 "ABBOTT LABORATORIES" 8137618 735 "ABBOTT LABORATORIES" 8480691 735 "" 6437106 735 "" end
The datasets 2 and 3 are just snippets of the patent data. Each target and acquiring firm have a compustat name and unique assignee code (asscode). I guess I have to do a merge twice, once by ass code, and once by name, then I want to remove the duplicates.
My aim is to have the M&A deal dataset (1) with all the patents of the acquiring and target firm (after that I plan to exploit the duplicates to see the overlaps). For example, all the patents corresponding to acquirorname = "ABBOT LABORATORIES" as well as all the patents acquirorass = 735 from dataset 3), as well as all the patents from dataset 2) corresponding to targetname = "KOS PHARMACEUTICALS INC" and targetass = "695573". There are more variables not in the dataex snippet from dataset 2) and 3) (such as number of forward citations, date, etc) but I left those out for simplicity sake.
I generated an id_deal, and also a unique ID for each acquiror (id_acquiror) as well as an id for the target id_target. I wasn't sure how to proceed from here.
Any help would be greatly appreciated!
Comment