Dear Statalist,
I am using the ESWB database. Here is an example of my data and my variables of interest:
I am trying to estimate the next equation using logistic regression. The code is like this:
logit fin11 n_outcome lcar1 k7 b8 exporter ownership location sector i.size i.k9
In one paper that I am using as a reference [Ranasighe & Restuccia (2018), Table 2, page 253], they include region, country and time controls using dummy variables. That is,
logit fin11 n_outcome lcar1 k7 b8 exporter ownership location sector i.size i.k9 i.country i.region i.year
All year dummy variables are omitted and 5 country dummies over 60 too due to a collinearity problem. I found that those countries that stata ommits, are one per region. However, I cannot understand why there is a multicollinearity problem, and why in that paper they ignore that problem.
Thanks in advanced.
I am using the ESWB database. Here is an example of my data and my variables of interest:
Code:
* Example generated by -dataex-. For more info, type help dataex clear input str26 country_01 double year byte region double idstd float(fin11 lcar1) byte(k7 b8) float(exporter ownership location) long sector double size byte k9 "Argentina2017" 2017 4 622836 1 4.1431346 1 1 0 0 1 1 3 1 "Argentina2017" 2017 4 622402 0 2.995732 1 1 0 0 1 1 2 2 "Armenia2020" 2020 1 708684 0 2.833213 1 . 0 0 1 2 2 1 "Armenia2020" 2020 1 709091 . 2.772589 0 0 0 0 1 2 1 . "Armenia2020" 2020 1 709039 . 2.1972246 0 0 0 0 0 1 1 . "Bangladesh2013" 2013 6 532106 . 3.135494 0 1 0 0 1 2 3 . "Bangladesh2013" 2013 6 531680 . 2.564949 0 0 0 0 1 1 2 1 "Bangladesh2013" 2013 6 531475 1 3.367296 1 0 0 0 1 2 1 1 "Bangladesh2013" 2013 6 531456 1 3.73767 1 0 0 0 1 1 2 1 "Bangladesh2013" 2013 6 532327 1 3.295837 0 1 0 0 0 1 3 2 "Bangladesh2013" 2013 6 532108 . 3.7612 1 0 0 0 1 2 3 . "Belarus2018" 2018 1 650533 1 2.397895 0 0 0 0 0 1 1 2 "Belgium2020" 2020 1 712861 . 3.6635616 0 0 1 1 1 2 1 . "Bulgaria2019" 2019 1 679663 1 3.3322046 1 1 0 0 0 1 1 1 "Chile2010" 2010 4 495884 1 3.4011974 1 0 0 0 1 1 2 1 "Chile2010" 2010 4 496523 1 3.367296 1 1 0 0 0 1 2 2 "China2012" 2012 3 523867 . 2.564949 0 1 0 0 1 2 3 . "China2012" 2012 3 523997 . 2.6390574 0 0 0 0 1 2 1 . "China2012" 2012 3 522465 1 2.772589 0 1 0 0 1 1 2 2 "China2012" 2012 3 522703 . 2.397895 1 0 0 0 1 1 2 . "China2012" 2012 3 522528 . 2.944439 0 1 0 0 1 1 3 . "China2012" 2012 3 523751 . 2.1972246 0 0 0 0 1 2 1 . "China2012" 2012 3 523976 . 2.772589 0 0 1 0 1 2 1 . "China2012" 2012 3 522675 . 2.995732 1 1 1 1 1 1 3 2 "Colombia2017" 2017 4 625879 0 2.564949 0 1 0 1 1 2 1 1 "Colombia2017" 2017 4 626322 0 3.465736 1 1 0 0 1 1 1 1 "Costarica2010" 2010 4 501907 1 3.713572 1 0 0 0 1 2 1 2 "Costarica2010" 2010 4 502095 1 3.0445225 0 0 0 0 1 2 2 1 "Czech Republic2019" 2019 1 678733 . 3.3322046 1 1 0 0 0 1 2 . "Czech Republic2019" 2019 1 678750 0 3.367296 1 1 0 0 0 1 2 1 "Czech Republic2019" 2019 1 678819 1 3.295837 1 1 0 0 0 1 3 1 "Czech Republic2019" 2019 1 678790 . 3.367296 0 1 0 0 0 1 2 . "Denmark2020" 2020 1 721422 1 4.5849676 1 1 1 1 0 1 3 1 "Denmark2020" 2020 1 720880 . 2.944439 1 1 1 0 0 1 2 . "Egypt2020" 2020 5 704812 . 1.94591 0 0 0 0 0 2 1 . "Egypt2020" 2020 5 706085 . 2.0794415 0 0 0 0 1 2 1 . "Egypt2020" 2020 5 704282 . 3.135494 0 1 0 0 1 1 3 . "Egypt2020" 2020 5 706221 . 3.433987 0 0 0 0 1 2 2 . "Egypt2020" 2020 5 706325 . 1.7917595 0 0 0 0 1 2 1 . "Egypt2020" 2020 5 703814 . 3.295837 1 0 1 0 1 1 3 . "Egypt2020" 2020 5 704644 . 2.484907 1 0 0 0 1 1 2 . "ElSalvador2016" 2016 4 605851 . 3.433987 0 0 0 0 1 2 1 . "Ethiopia2015" 2015 2 590852 0 .6931472 0 0 0 0 0 1 1 3 "Ethiopia2015" 2015 2 590495 1 2.1972246 1 0 0 0 1 2 2 1 "Finland2020" 2020 1 722038 1 4.7004805 1 1 0 0 0 2 2 1 "Finland2020" 2020 1 722084 . 3.89182 1 1 1 0 0 1 3 . "Finland2020" 2020 1 722262 1 2.0794415 0 0 1 0 0 1 1 1 "Georgia2019" 2019 1 669347 . 1.3862944 0 . 1 0 0 2 1 . "Georgia2019" 2019 1 669739 . 2.772589 0 . 1 0 1 2 2 . "Georgia2019" 2019 1 669267 . 2.833213 0 0 0 0 1 2 1 . "Georgia2019" 2019 1 669631 . 1.609438 0 0 0 0 1 2 2 . "Ghana2013" 2013 2 557991 . 2.3025851 1 0 0 1 0 2 1 . "Greece2018" 2018 1 649544 1 3.0445225 1 1 0 0 0 1 3 1 "Greece2018" 2018 1 649550 1 3.433987 1 . 1 0 0 2 3 1 "Greece2018" 2018 1 649618 1 2.0794415 1 1 0 0 0 1 2 1 "Greece2018" 2018 1 649339 . 2.1972246 0 1 0 0 0 1 1 . "Hungary2019" 2019 1 680257 . 2.484907 0 0 0 0 0 2 1 . "Hungary2019" 2019 1 680544 0 2.6390574 1 0 1 0 0 2 1 1 "Hungary2019" 2019 1 680471 . 2.772589 0 . 0 0 0 1 1 . "India2014" 2014 6 566685 1 1.7917595 0 0 0 0 0 1 2 2 "India2014" 2014 6 571165 . 1.609438 1 0 0 0 1 1 2 . "India2014" 2014 6 565548 1 1.7917595 1 1 0 0 1 1 3 2 "India2014" 2014 6 563360 . 3.3322046 0 0 0 0 1 2 2 . "India2014" 2014 6 571206 1 2.833213 1 1 0 1 1 1 3 2 "India2014" 2014 6 570433 . 2.70805 1 1 0 0 0 1 3 . "India2014" 2014 6 565082 1 2.1972246 0 1 0 0 1 1 1 2 "India2014" 2014 6 570847 1 3.5263605 1 1 1 0 0 1 3 1 "India2014" 2014 6 570171 1 1.3862944 1 0 0 0 1 1 1 2 "India2014" 2014 6 567720 . 3.295837 1 0 0 0 1 1 2 . "India2014" 2014 6 567048 . 3.2580965 0 1 0 0 1 1 2 . "India2014" 2014 6 570465 . 2.564949 1 1 0 0 0 1 2 . "India2014" 2014 6 568138 . 2.3025851 1 0 0 0 0 1 1 . "India2014" 2014 6 570509 . 3.0910425 1 1 0 0 0 1 2 . "India2014" 2014 6 566869 1 2.70805 1 0 0 0 0 1 1 2 "India2014" 2014 6 571434 1 2.833213 1 1 0 0 1 1 2 2 "India2014" 2014 6 569114 . 2.833213 1 0 0 0 0 1 2 . "India2014" 2014 6 570345 . 2.70805 1 1 0 0 0 1 3 . "India2014" 2014 6 567482 . 2.833213 0 . 0 0 0 1 2 . "India2014" 2014 6 565961 1 3.912023 1 1 0 0 0 1 2 2 "India2014" 2014 6 566068 . 3.2580965 1 1 0 0 0 1 2 . "India2014" 2014 6 565400 . 3.583519 0 1 0 0 0 1 2 . "India2014" 2014 6 571318 1 2.6390574 1 0 0 0 0 1 2 2 "India2014" 2014 6 563270 . 1.609438 1 1 0 0 0 2 3 . "India2014" 2014 6 568961 . 1.609438 1 0 0 0 0 1 2 . "Indonesia2015" 2015 3 591671 . 2.772589 0 0 0 0 1 2 1 . "Indonesia2015" 2015 3 591876 . 2.564949 0 0 0 0 1 2 1 . "Indonesia2015" 2015 3 591284 1 3.8286414 0 0 0 0 1 1 3 2 "Indonesia2015" 2015 3 592396 . 2.70805 0 0 0 0 1 1 3 . "Indonesia2015" 2015 3 592454 . 2.944439 0 1 0 0 1 1 2 . "Indonesia2015" 2015 3 591237 . 2.772589 0 0 0 0 0 1 3 . "Indonesia2015" 2015 3 591314 1 2.6390574 0 0 0 0 1 1 1 2 "Iraq2011" 2011 5 512422 . 1.609438 0 0 0 0 1 1 1 . "Iraq2011" 2011 5 512689 . 1.609438 0 0 0 0 . 1 1 . "Italy2019" 2019 1 659203 . 3.912023 1 1 0 0 1 1 3 . "Italy2019" 2019 1 659189 . 4.304065 1 1 1 0 0 1 3 . "Jordan2019" 2019 5 662153 . 3.218876 0 0 0 0 . 2 1 . "Kazakhstan2019" 2019 1 663347 1 1.609438 0 0 0 0 1 1 2 1 "Kazakhstan2019" 2019 1 664042 . 1.7917595 0 0 0 0 1 1 1 . "Kazakhstan2019" 2019 1 663650 . . 0 0 0 . 0 1 2 . "Kazakhstan2019" 2019 1 664259 . 1.94591 0 0 0 0 0 2 2 . end label values region region label def region 1 "ECA", modify label def region 2 "AFR", modify label def region 3 "EAP", modify label def region 4 "LAC", modify label def region 5 "MNA", modify label def region 6 "SAR", modify label values k7 k7 label def k7 0 "No", modify label def k7 1 "Yes", modify label values b8 b8 label def b8 0 "No", modify label def b8 1 "Yes", modify label values ownership ownership label def ownership 0 "Domestic", modify label def ownership 1 "Foreign", modify label values sector sector label def sector 1 "Manufacturing", modify label def sector 2 "Services", modify label values size size label def size 1 "Small(<20)", modify label def size 2 "Medium(20-99)", modify label def size 3 "Large(100 And Over)", modify label values k9 K9 label def K9 1 "Private commercial banks", modify label def K9 2 "State-owned banks or government agency", modify label def K9 3 "Non-bank financial institutions", modify
logit fin11 n_outcome lcar1 k7 b8 exporter ownership location sector i.size i.k9
In one paper that I am using as a reference [Ranasighe & Restuccia (2018), Table 2, page 253], they include region, country and time controls using dummy variables. That is,
logit fin11 n_outcome lcar1 k7 b8 exporter ownership location sector i.size i.k9 i.country i.region i.year
All year dummy variables are omitted and 5 country dummies over 60 too due to a collinearity problem. I found that those countries that stata ommits, are one per region. However, I cannot understand why there is a multicollinearity problem, and why in that paper they ignore that problem.
Thanks in advanced.

Comment