Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to check correlation between fixed effects and regressors/interaction terms

    Dear community,

    I want to estimate a ppml model, where I regress FDI per country pair, year and sector on the traditional gravity variables (log of origin GDP, log of destination GDP, log of bilateral distance) as well as on the interaction terms between those traditional gravity variables and the 25 sector dummies. As I get the warning that my variance matrix is nonsymmetric or highly singular, I want to check the correlation between my regressors and the fixed effects I include (year, origin-country*sector, destination-country*sector FE) as well as between the interaction terms and the FE. I'm just not sure how to do this correctly.

    How would I have to code the fixed effects? At the moment, the variable for the origin-country*sector FE ("country_origin_sector") takes on different encoded strings, e.g. "AGO52" if the observation is with Angola as origin country and sector 52. I could alternatively generate dummies (one for each origin-country-sector combination, so 668 in total, which is =1 if the observation is with AGO as origin country and in sector 52). Then, I would calculate the correlation between each of those 668 dummies with one of the traditional gravity regressors, e.g. the lngdp of origin country. Is this the way to go or should I stick to the encoded string version of the FE?

    Also for the interaction terms (coded with one dummy for each sector*regressor combination), I would calculate the correlation for each of the interaction term dummies with each of the FE dummies. I'm however not sure if this is the correct way or if I should keep the FE coded as encoded strings ("AGO52", "BRA22", "DEU22" etc.).

    I tried both ways but get different results so I'm not sure which one is the correct way.

    I appreciate any help on this.

    Best
    Noemi

  • #2
    Noemi:
    the usual way to plug categorical predictors in the right-hand side of yiour regression equation is -fvvarlist- notation.
    That said, you seem to have (too) many fixed effects and interactions between them. Are you sure that you'll be able to explain, ceteris paribus, their contribution to variation in the regressand?
    Kind regards,
    Carlo
    (Stata 19.0)

    Comment


    • #3
      Dear Carlo Lazzaro ,

      thank you very much for your response. Can you tell me how I can determine whether I have too many regressors and fixed effects? The interactions are not between fixed effects, but between all of the traditional gravity regressors and the sector dummies.

      Best
      Noemi

      Comment


      • #4
        Noemi:
        if your interactions follow the literature, this is obviously OK.
        My concern was about, say, -origin-country*sector- that seems to me an interaction between two time-invariant predictors.
        Kind regards,
        Carlo
        (Stata 19.0)

        Comment


        • #5
          Dear Carlo Lazzaro

          thank you! By origin-country*sector I refer to fixed effects for each unique combination of origin country and sector, say e.g. the group of observations of FDI in the manufacturing sector coming from Angola as the origin country of investment. I thought it would make sense to control for unobservable, time-invariant characteristics per origin country and sector, hence characteristics of the Angolan firms in the manufacturing industry which could affect their outward FDI activities. So, it is not really an interaction, but rather the combination of both variables (origin country and sector). The asterisk I used was confusing.

          Comment


          • #6
            Noemi:
            thanks for clarifying.
            Kind regards,
            Carlo
            (Stata 19.0)

            Comment

            Working...
            X