Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Which weighting to use for analyses at different levels of aggregation

    I run a study (RCT) with subjects in 100 groups of heterogeneous sizes ranging from 2 to 20 individuals. I have data at the group level and at the individual level.

    My main estimation parameter of interest is a treatment effect on a group-level outcome. So my main regressions are run at that level and I want to keep it that way. However, I have some supplementary results I use for interpretation, which are based on individual-level treatment effect regressions.

    The issue is that, when I run individual-level regressions, subjects from larger groups mechanically make up a larger fraction of the sample and "dominate" the estimation. It thus seems appropriate to use weighting to make sure that each group weights the same total, so that results from the group-level regressions and the individual-level regressions are comparable.

    My idea is to run for the group-level data:
    Code:
    reg y_group treatment
    and for the individual level data
    Code:
    bys group_id: inverse_groupsize = 1/_N
    reg y_indiv treatment [pweight=inverse_groupsize], cluster(group_id)
    1) Are pweights the right way to do this in Stata?
    2) Is there anything else I need to worry about?

  • #2
    Addendum: treatment is assigned at the group-level.

    Also: I cross-posted that question at stackexchange: https://stats.stackexchange.com/ques...-of-aggregatio

    Comment


    • #3
      The Stack Exchange answer for J groups with \(n_j\) observations in group j was \(w_j = 1/(J n_j )\) . But the J is unnecessary because multiplying weights by a common factor will leave estimates and standard errors unchanged. Your original proposal was correct.
      Last edited by Steve Samuels; 12 Aug 2018, 12:27.
      Steve Samuels
      Statistical Consulting
      [email protected]

      Stata 14.2

      Comment


      • #4
        Thanks, yes. I saw that. I was overall a bit disappointed by the quality of the stack exchange answers. I know now that what I'm implementing what I wanted to do in the right way in Stata, but I expected more commentary on the conceptual issue.

        Comment

        Working...
        X