Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Merging two datasets based on firm name

    Dear reader,

    I am very new to stata, and this is my first post!

    My master files consists of 1500 companies from the Russell 3000 index. The dataset that should be matched consists of all the companies that have been involved in an alliance between 2015 - 2020 (more firms than in the master file)
    I only want to match the companies that are in the master file, how can i do this?

    master file
    [CODE]
    * Example generated by -dataex-. For more info, type help dataex
    clear
    input str46 firm_name
    "firm_name"
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3D Systems Corp."
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"
    "3M Corporation"



    * Example generated by -dataex-. For more info, type help dataex
    clear
    input str30 firm_name float n
    "01 Sys" 1
    "1 Ltd" 1
    "1&1 Ionos SE" 1
    "1-800-Flowers.com Inc" 2
    "10-4 Systems Inc" 1
    "1000watt LLC" 1
    "100tv.com" 1
    "10art-ni Corp" 1
    "10zig Technology Inc" 2
    "123ID Inc" 2
    "128 Technology Inc" 1
    "170 Systems Inc" 1
    "1776" 1
    "180bytwo LLC" 1
    "1901 Group LLC" 1
    "1933 Industries Inc" 1
    "1EDISource Inc" 1
    "1LINK (Guarantee) Ltd" 1
    "1TouchSoftware Solutions Inc" 1
    "1Verge Info Tech (Beijing) Co" 1
    "1WorldSync Holdings Inc" 1
    "1nce GmbH" 1
    "1oT OU" 1
    "1seo.Com" 1
    "1st Group Ltd" 1
    "1stpoint Communications LLC" 1
    "1world Online Inc" 1
    "20-20 Technologies Inc" 1
    "2020 Advisors LLC" 1
    "2021.ai ApS" 2
    "21LADY Co Ltd" 1
    "21Vianet Group Inc" 3
    "21st Century Fox Inc" 1
    "21st Century Technologies Ltd" 2
    "24/7 Customer" 1
    "24/7 Customer Inc" 1
    "24/7 Real Media Inc" 2
    "24by7sec Inc" 1
    "2618249 Ontario Corp" 6
    "2B Wireless Inc" 1
    "2C Media" 1
    "2CRSI SA" 2
    "2Hz Inc" 1
    "2bPrecise LLC" 1
    "2bcreative entertainment" 1
    "2ergo Group PLC" 1
    "360buy Jingdong Mall" 1
    "360factors Inc" 1
    "3CInteractive Corp" 1
    "3Com Corp" 1

  • #2
    See options in merge.

    Code:
    help merge

    To keep observations in master dataset and matches, e.g., in a many-to-one merge

    Code:
    merge 1:m firm_name using "using dataset", keep(master match)
    To keep only matches

    Code:
    merge 1:m firm_name using "using dataset", keep(match)

    Comment


    • #3
      This code will work, but in my opinion, merging on strings can be a risky business. Be sure the company names are consistent down to the spaces.


      And thank you for using dataex on your first post. Welcome to Statalist.

      Comment


      • #4
        Thank you for the responses and warm welcome! I keep getting a r (459) error. Any idea who I can solve this? Furthermore, the names do not match completely (e.g. master data = 3D Systems Corp. and using data = 3D Systems Corp --> no ".")

        Comment


        • #5
          Originally posted by Sanne Bergh View Post
          Furthermore, the names do not match completely (e.g. master data = 3D Systems Corp. and using data = 3D Systems Corp --> no ".")

          merge is for perfect matches. If the only issues are capitalizations and periods, then those are easy to fix. However, if you have different word placements, abbreviations and omissions, then you have a project on your hands. You need to look into fuzzy matching. There are quite a few threads in the forum that illustrate this, just search with the keywords "fuzzy matching". Here are a couple:

          https://www.statalist.org/forums/for...s-of-companies
          https://www.statalist.org/forums/for...order-to-group

          Comment


          • #6
            Thank you!! will dive into it

            Comment


            • #7
              Here is a start. You need to follow the code step-by-step so as to make any necessary changes, e.g., in terms of choosing a cutoff for the fuzzy matching command matchit (from SSC). Issues can arise if other variables in the using dataset are not consistent across imperfect name matches. You will have to verify this as you go. I have rigged your data example in #1 to create some imperfect matches. The color coding shows matches across the datasets.

              Code:
              * Example generated by -dataex-. For more info, type help dataex
              clear
              input str30 firm_name float capitalization
              "01 Sys" 1000
              "1 Ltd" 2000
              "1&1 Ionos SE" 3000
              "128 Technology Inc" 4000
              "3D Sys. Co." 5000
              "1WorldSync Holdings Inc" 6000
              "1nce GmbH" 200
              "1oT OU" 300
              "3D Systems LLC" 5000
              "1seo.Com" 500
              "1st Group Ltd" 1234
              "1stpoint Communications LLC" 5678
              "1world Online Inc" 9811
              "3m Corp." 4000
              "24by7sec Inc" 11000
              "2618249 Ontario Corp" 6000
              "2B Wireless Inc" 1234
              "2C Media" 1000
              "2CRSI SA" 2000
              "3CInteractive Corp" 1000
              "3Com Corp" 4000
              end
              
              tempfile usingfile1 usingfile2
              save `usingfile1'
              contract firm_name
              gen firmname2 = firm_name
              keep firm*
              save `usingfile2'
              
              * Example generated by -dataex-. For more info, type help dataex
              clear
              input str46 firm_name
              "3D Systems Corp."
              "3D Systems Corp."
              "3D Systems Corp."
              "3D Systems Corp."
              "3D Systems Corp."
              "3D Systems Corp."
              "2C media LLC."
              "3M Corporation"
              "3M Corporation"
              "3M Corporation"
              "3M Corporation"
              "3M Corporation"
              "3M Corporation"
              end
              
              tempfile master
              save `master'
              contract firm_name
              keep firm_name
              cross using `usingfile2'
              *REMOVE ASTERISK TO INSTALL
              *ssc install matchit, replace
              *ssc install freqindex, replace
              matchit firm_name firmname2, g(score)
              *BROWSE THE DATA TO DETERMINE A GOOD CUTOFF
              keep if score>0.45
              tempfile matches
              rename (firm_name firmname2) (firmname2 firm_name)
              bys firm_name (score): keep if _n==_N
              save `matches'
              
              *GO BACK AND MERGE WITH USING DATASET
              use `usingfile1', clear
              merge 1:m firm_name using `matches', nogen
              gen oldnames= firm_name if !missing(firmname2)
              
              *REPLACE FIRM_NAMES WITH MATCHES
              replace firm_name= firmname2 if !missing(firmname2)
              drop firmname2 score
              ds oldnames, not
              bys `r(varlist)': keep if _n==1
              save `usingfile1', replace
              
              *NOW MERGE WITH MASTER FILE
              use `master', clear
              merge m:1 firm_name using `usingfile1', keep(master match) nogen
              sort firm_name
              l, sepby(firm_name)
              Res.:

              Code:
              . l, sepby(firm_name)
              
                   +----------------------------------------------+
                   |        firm_name   capita~n         oldnames |
                   |----------------------------------------------|
                1. |    2C media LLC.       1000         2C Media |
                   |----------------------------------------------|
                2. | 3D Systems Corp.       5000   3D Systems LLC |
                3. | 3D Systems Corp.       5000   3D Systems LLC |
                4. | 3D Systems Corp.       5000   3D Systems LLC |
                5. | 3D Systems Corp.       5000   3D Systems LLC |
                6. | 3D Systems Corp.       5000   3D Systems LLC |
                7. | 3D Systems Corp.       5000   3D Systems LLC |
                   |----------------------------------------------|
                8. |   3M Corporation       4000        3Com Corp |
                9. |   3M Corporation       4000        3Com Corp |
               10. |   3M Corporation       4000        3Com Corp |
               11. |   3M Corporation       4000        3Com Corp |
               12. |   3M Corporation       4000        3Com Corp |
               13. |   3M Corporation       4000        3Com Corp |
                   +----------------------------------------------+

              Comment


              • #8
                Wow, just wow! Thank you so much!

                Comment


                • #9
                  Hi Andrew, I have one last question about the code. When and why do you use e.g. 5000 and when 1234 behind every variable?

                  Comment


                  • #10
                    In your original dataset, you have the variable "n" in the using dataset that mostly takes on the values 1 or 2

                    * Example generated by -dataex-. For more info, type help dataex
                    clear
                    input str30 firm_name float n
                    "01 Sys" 1
                    "1 Ltd" 1
                    "1&1 Ionos SE" 1
                    "1-800-Flowers.com Inc" 2
                    Instead of this variable, my simulated dataset has a variable called "capitalization". So there is no particular logic to the numbers beyond that they should differ across firms. The idea is that you want to merge the master dataset which has firm names with the using dataset which has imperfect firm names to obtain information from other variables in the using dataset, in my simulated example, the market capitalization of the firm. I started off by stating


                    Issues can arise if other variables in the using dataset are not consistent across imperfect name matches.

                    What I meant is that if a firm has different name variations in the using dataset, then the values in all other variables should be consistent (the same). An example is


                    * Example generated by -dataex-. For more info, type help dataex
                    clear
                    input str30 firm_name float capitalization
                    "01 Sys" 1000
                    "1 Ltd" 2000
                    "1&1 Ionos SE" 3000
                    "128 Technology Inc" 4000
                    "3D Sys. Co." 5000
                    "1WorldSync Holdings Inc" 6000
                    "1nce GmbH" 200
                    "1oT OU" 300
                    "3D Systems LLC" 5000
                    ..
                    ..

                    Because "3D Sys. Co." and "3D Systems LLC" are assumed to be the same firm, we expect that their values of capitalization should be the same (which is the case in this instance). If they are different, we either are not correct in our assumption that they are the same firm or we have inconsistencies in our data.

                    Comment


                    • #11
                      Thank you for your fast reply. To summarize, I can choose every random number behind the firm name (capitalization) as long as they differ across firms. However in your post on 6ht of may
                      "01 Sys" 1000 and "3CInteractive Corp" 1000 both have "1000" behind the firm name. Why is that?

                      Comment


                      • #12
                        That is just coincidental, e.g., if we reported weights of firms in the S&P 500 correct to 2 dec. places, UnitedHealth and Meta Platforms will have the same value of 1.35.
                        Click image for larger version

Name:	Capture.PNG
Views:	1
Size:	96.9 KB
ID:	1663800




                        To summarize, I can choose every random number behind the firm name (capitalization) as long as they differ across firms.
                        Yes, but note that this variable is just for illustrative purposes. You should not be generating any variable in your own dataset. What other variables do you have in the using dataset apart from firm name?
                        Last edited by Andrew Musau; 10 May 2022, 05:59.

                        Comment


                        • #13
                          in the master data set there is a lot of data on the firm, industry and CEO in the data that needs to be merged there is only data on the company name and how many alliance they had. Would it be possible to set up a call?

                          Comment


                          • #14
                            I would recommend posting a sample of the actual datasets here. If you want consulting services, it will have to be outside work times (weekdays 9:00-17:00 CET) and for a fee. You can email me for details.

                            Comment


                            • #15
                              THen I will try to formulate my question so inclusive as possible!
                              To be clear: my question is I need to merge all_alliance_2005_2020 with dataset_complete. Because I need to see how many alliances the firms in the dataset_complete have.
                              There is only one identifyer and that is firm name, the firm names do not match in both datasets.

                              master dataset. Besides these variables there are around 30 more (but that seemed a bit too much too inculde in the snippet)
                              Code:
                              * Example generated by -dataex-. For more info, type help dataex
                              clear
                              input str7 id_firm str46 firm_name str40 industry str16 sector str4 year str29 AC str7 ceo_sex
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2005" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2006" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2007" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2008" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2009" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2010" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2011" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2012" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2013" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2014" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2015" "Abraham N. Reichental" "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2016" "Vyomesh I. Joshi"      "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2017" "Vyomesh I. Joshi"      "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2018" "Vyomesh I. Joshi"      "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2019" "Vyomesh I. Joshi"      "1"
                              "1"    "3D Systems Corp."                 "Application Software"                  "Technology"       "2020" "Jeffrey A. Graves"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2005" "Michael N. Coppola"    "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2006" "Michael N. Coppola"    "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2007" "John C. Brouillard"    "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2008" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2009" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2010" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2011" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2012" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2013" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2014" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2015" "Darren R. Jackson"     "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2016" "Tom Greco"             "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2017" "Tom Greco"             "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2018" "Tom Greco"             "1"
                              "10"   "Advance Auto Parts Inc."          "Auto Parts Stores"                     "Consumer Goods"   "2019" "Tom Greco"             "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2005" "Robert J. Khoury"      "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2006" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2007" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2008" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2009" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2010" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2011" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2012" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2013" "Amin J. Khoury"        "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2014" "Werner Lieberherr"     "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2015" "Werner Lieberherr"     "1"
                              "100"  "BE Aerospace Inc."                "Aerospace/Defense Products & Services" "Industrial Goods" "2016" "Werner Lieberherr"     "1"
                              "1000" "Benefitfocus Inc"                 "Application Software"                  "Technology"       "2013" "Shawn A. Jenkins"      "1"
                              "1000" "Benefitfocus Inc"                 "Application Software"                  "Technology"       "2014" "Shawn A. Jenkins"      "1"
                              "1000" "Benefitfocus Inc"                 "Application Software"                  "Technology"       "2015" "Shawn A. Jenkins"      "1"
                              "1000" "Benefitfocus Inc"                 "Application Software"                  "Technology"       "2016" "Shawn A. Jenkins"      "1"
                              "1000" "Benefitfocus Inc"                 "Application Software"                  "Technology"       "2017" "Shawn A. Jenkins"      "1"
                              "1000" "Benefitfocus Inc"                 "Application Software"                  "Technology"       "2018" "Raymond A. August"     "1"
                              "1000" "Benefitfocus Inc"                 "Application Software"                  "Technology"       "2019" "Raymond A. August"     "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2005" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2006" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2007" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2008" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2009" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2010" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2011" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2012" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2013" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2014" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2015" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2016" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2017" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2018" "Michael P. Daly"       "1"
                              "1001" "Berkshire Hills Bancorp, Inc."    "Savings & Loans"                       "Financial"        "2019" "Richard M Moratta"     "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2012" "Jonathan D. Rich"      "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2013" "Jonathan D. Rich"      "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2014" "Jonathan D. Rich"      "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2015" "Jonathan D. Rich"      "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2016" "Jonathan D. Rich"      "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2017" "Thomas E. Salmon"      "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2018" "Thomas E. Salmon"      "1"
                              "1002" "Berry Global Group Inc"           "Packaging & Containers"                "Consumer Goods"   "2019" "Thomas E. Salmon"      "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2005" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2006" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2007" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2008" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2009" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2010" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2011" "Howard W. Lutinck"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2012" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2013" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2014" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2015" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2016" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2017" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2018" "Howard W. Lutnick"     "1"
                              "1003" "BGC Partners, Inc."               "Investment Brokerage - National"       "Financial"        "2019" "Howard W. Lutnick"     "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2005" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2006" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2007" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2008" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2009" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2010" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2011" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2012" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2013" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2014" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2015" "Steven G. Miller"      "1"
                              "1004" "Big 5 Sporting Goods Corporation" "Sporting Goods Stores"                 "Services"         "2016" "Steven G. Miller"      "1"
                              end
                              ------------------ copy up to and including the previous line ------------------

                              Listed 100 out of 21232 observations


                              Below the snippet for the allliance dataset


                              input str30 firm_name float alliance_count
                              "01 Sys" 1
                              "1 Ltd" 1
                              "1&1 Ionos SE" 1
                              "1-800-Flowers.com Inc" 2
                              "10-4 Systems Inc" 1
                              "1000watt LLC" 1
                              "100tv.com" 1
                              "10art-ni Corp" 1
                              "10zig Technology Inc" 2
                              "123ID Inc" 2
                              "128 Technology Inc" 1
                              "170 Systems Inc" 1
                              "1776" 1
                              "180bytwo LLC" 1
                              "1901 Group LLC" 1
                              "1933 Industries Inc" 1
                              "1EDISource Inc" 1
                              "1LINK (Guarantee) Ltd" 1
                              "1TouchSoftware Solutions Inc" 1
                              "1Verge Info Tech (Beijing) Co" 1
                              "1WorldSync Holdings Inc" 1
                              "1nce GmbH" 1
                              "1oT OU" 1
                              "1seo.Com" 1
                              "1st Group Ltd" 1
                              "1stpoint Communications LLC" 1
                              "1world Online Inc" 1
                              "20-20 Technologies Inc" 1
                              "2020 Advisors LLC" 1
                              "2021.ai ApS" 2
                              "21LADY Co Ltd" 1
                              "21Vianet Group Inc" 3
                              "21st Century Fox Inc" 1
                              "21st Century Technologies Ltd" 2
                              "24/7 Customer" 1
                              "24/7 Customer Inc" 1
                              "24/7 Real Media Inc" 2
                              "24by7sec Inc" 1
                              "2618249 Ontario Corp" 6
                              "2B Wireless Inc" 1
                              "2C Media" 1
                              "2CRSI SA" 2
                              "2Hz Inc" 1
                              "2bPrecise LLC" 1
                              "2bcreative entertainment" 1
                              "2ergo Group PLC" 1
                              "360buy Jingdong Mall" 1
                              "360factors Inc" 1
                              "3CInteractive Corp" 1
                              "3Com Corp" 1
                              "3Com Korea" 2
                              "3D Eye Solutions Inc" 1
                              "3D Printing Industry" 1
                              "3D Results" 1
                              "3D Robotic Inc" 1
                              "3D Systems Corp" 6
                              "3DR Laboratories LLC" 1
                              "3E Co Environmental Ecological" 1
                              "3GTV" 1
                              "3M Co" 3
                              "3P Networks Inc" 1
                              "3RD Ring" 1
                              "3V Transaction Services Ltd" 1
                              "3VR Security Inc" 1
                              "3e Technologies Intl Inc" 1
                              "3eTI" 1
                              "3i Infotech Ltd" 2
                              "3n" 1
                              "41st Parameter Inc" 1
                              "42crunch" 1
                              "482.Solutions" 1
                              "4C Insights Inc" 1
                              "4Home Inc" 1
                              "4INFO Inc" 1
                              "4Mobility SA" 1
                              "4Voice LLC" 1
                              "4th Screen Advertising Ltd" 1
                              "51Job Inc" 2
                              "5LINX Enterprises Inc" 1
                              "6 Over 6 Vision Ltd" 1
                              "631 Success Llc" 1
                              "6788289 Canada Inc" 1
                              "6Wind SA" 2
                              "6fusion USA Inc" 1
                              "7-Eleven Inc" 1
                              "701Search Pte Ltd" 2
                              "777Online" 1
                              "77Agenc Ltd" 1
                              "797738 Ontario Ltd" 1
                              "7Seas Technologies Ltd" 1
                              "7h Hldg" 1
                              "7starlake Co Ltd" 1
                              "80 Acres Farm" 1
                              "888 Holdings PLC" 2
                              "888voip" 1
                              "8digital" 1
                              "8i Holdings Ltd" 1
                              "8x8 Inc" 6
                              "99 Wuxian Ltd" 1
                              "999 Call Center Corp" 1
                              end
                              [/CODE]
                              ------------------ copy up to and including the previous line ------------------

                              Listed 100 out of 18582 observations





                              Comment

                              Working...
                              X