Hi all,
I am new to the forum. Recently I am reading some paper, and I am a little bit confused by the sentence that wrote in the paper. Hope that I could get some help, and thanks in advance!
In the paper wrote by Deshpande & Li. (2019), they regard "closing zips" as control groups, and these control groups appears multiple times in the dataset. In page 224, they wrote:
".... Note that our strategy of using future closings as controls for current closings will result in the same zip code appearing multiple times in the data. Clustering at the closing level accounts for the repeated appearance of zip codes since zip codes are fully nested within closings."
Also, in the paper wrote by Fetter & Lockwood(2018), they conduct analysis for counties that border the state, and since some counties border two or more different states, they will appear in the data as many times as there are states that it borders. In page 2187, they also wrote:
".... Since our policy of interest varies at the state level, we cluster standard errors at the state level. This level of clustering also accounts for the duplication of observations in counties lying on multiple state boundaries."
Both of these two papers mentioned that if observations appear multiple times in the dataset, then if we cluster our analysis at this level, then we can account for the duplication problem. Could I bother to ask why it is the case? Thanks!
Reference:
Deshpande, M., & Li, Y. (2019). Who is screened out? Application costs and the targeting of disability programs. American Economic Journal: Economic Policy, 11(4), 213-48.
Fetter, D. K., & Lockwood, L. M. (2018). Government old-age support and labor supply: Evidence from the old age assistance program. American Economic Review, 108(8), 2174-2211.

Comment