Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Finding the drivers of within-variation in a panel dataset

    Hi Statalist,

    In panel data models such as
    Code:
    y = Bx_it + error
    we usually decide which model to adopt based on the source of variation in x (between changes across individuals or changes within individuals over time).

    My question is: in the case where the effect is driven by within-variation over time (and so we run a fixed effects model), is there a way to find out what drives this within-variation?

    For example, say y is household bargaining power, and x is a measure of the extent of informality in women's employment. We now observe that reduced informality of employment leads to higher bargaining power within the household. How do we now determine that within-variation over time in informality is driven by factors like higher education or changes in the kind of job that women do?

    Thanks for your time!

  • #2
    Kwasi:
    the first step I would take is comparing -fe- vs -re- specification.
    Assuming that you're dealing with a N>T panel dataset with a continuous regressand, go -xtreg,fe- and -xtreg,re- and compare them via -hausman- or the community-contributed module -xtoverid-.
    If this reply is not helpful, please provide further details. Thanks.
    Kind regards,
    Carlo
    (Stata 19.0)

    Comment


    • #3
      Thanks for your reply, Carlo. I probably did not frame the question as well as I should.
      My goal is not to determine whether to use a fixed effects or random effects model. Rather, I want to know what factors are driving changes in the regressor over time.
      Will running another fixed effects model with the regressor as the regressand be what I am looking for? Or there is way to extract that information from running the original regression?

      Comment


      • #4
        Kwasi: I think you're on the right track with using the regressor as the dependent variable and try to explain it with other variables. The problem is when those other variables maybe should be in the model.

        How about computing the within variance for each family for x. Then, you can related those variances to time-constant factors, such as education (often doesn't change over time). This would be a cross-sectional regression. I just you would use the egen command to compute the family-specific standard deviation in your x variable. Something like this:

        Code:
        egen sd_x = sd(x)
        reg sd_x z1 ... zk if year == 1
        BTW, I think we generally choose fixed effects unless it's too imprecise to work with. The key is whether the x are correlated with heterogeneity. When that's the case, we opt for FE -- even if x doesn't have as much time-variation as we'd like.

        Comment


        • #5
          Thank you so much for your help, Jeff! This looks like just what I need. I really appreciate it.

          Comment

          Working...
          X