Frequency Weights

Alex Lampert

Join Date: Jan 2017

Posts: 27
#1

Frequency Weights

23 Apr 2018, 12:01

Hello,

Has anyone ever used frequency weights for a differences-in-differences regression? I'm obtaining unexpected standard errors for all of the control variables in my models: the standard errors in the weighted model are always larger than the unweighted model, despite the two models using the exact same data (i.e. the only difference between the two models is the frequency weight) I would expect the weighted model standard errors to be lower than the unweighted model due to "multiplied" observations.

I've done summarize on the outcome variable using weights vs. no weights, and it seems no matter how I slice and dice the data, the weighted sums always have lower standard errors (as I expect), but this seems to contradict my DiD model. I was thinking that perhaps the observations being weighted were the observations that deviated the most from the mean.

Does anyone know if I'm missing a key piece of theory that would explain why my weighted SEs are larger than unweighted in the DiD framework?

Thanks!
Alex
Tags: None
Maarten Buis

Join Date: Mar 2014

Posts: 3466
#2

24 Apr 2018, 01:18

This is possible. The simplest way this could happen if your weights contain lots of 0s, so the weighted sample size goes down rather than up. Alternatively, the standard errors for coefficients is a function of a couple of things, not just the sample size. For example, the standard deviation of the residual could go up in a weighted sample.

---------------------------------
Maarten L. Buis
University of Konstanz
Department of history and sociology
box 40
78457 Konstanz
Germany
http://www.maartenbuis.nl
---------------------------------
Comment
Alex Lampert

Join Date: Jan 2017

Posts: 27
#3

24 Apr 2018, 07:07

Thanks Maarten. My weights range between 1 and 5 (all integers), so they essentially act as multipliers or duplicators. The purpose of the weights was to avoid unnecessarily expanding my data and instead storing the duplicates in a frequency variable.

Are you surprised that the weighted means produce lower SEs than unweighted (as I expected, since I only added duplicated information), but the weighted coefficients have higher SEs?
Comment

Announcement

Comment

Comment