Hi all,
I'm analyzing a dataset based on a health-related cross-sectional survey from 15 administrative regions conduced in 2019. The dataset contains variables at two levels of observation: individual and region. I'm trying to analyze how individual's choice of healthcare providers is shaped by both individual economic conditions and regional economic characteristics. In one of the linear probability models I estimated, I inadvertently included both region dummy variables (to account for regional heterogeneity) and regional level covariates. Based on my statistical knowledge, including regional dummies and regional covariates in such a context (there is no within-region variation in the regional covariates) would result in the coefficients of the latter not being estimated. But I was surprised to find that the model produced estimations both for the dummies and regional covariates, except that a number of dummies are omitted in addition to the baseline (I later verified that the number of omitted dummies always equates the number of regional covariates specified). I'm having a hard time understand why this is the case (see the Stata output below). Any help or thought on this is much appreciated. I'm relative new to survey data analysis but is quite familiar with panel data methods. Am I missing something big here?

I'm analyzing a dataset based on a health-related cross-sectional survey from 15 administrative regions conduced in 2019. The dataset contains variables at two levels of observation: individual and region. I'm trying to analyze how individual's choice of healthcare providers is shaped by both individual economic conditions and regional economic characteristics. In one of the linear probability models I estimated, I inadvertently included both region dummy variables (to account for regional heterogeneity) and regional level covariates. Based on my statistical knowledge, including regional dummies and regional covariates in such a context (there is no within-region variation in the regional covariates) would result in the coefficients of the latter not being estimated. But I was surprised to find that the model produced estimations both for the dummies and regional covariates, except that a number of dummies are omitted in addition to the baseline (I later verified that the number of omitted dummies always equates the number of regional covariates specified). I'm having a hard time understand why this is the case (see the Stata output below). Any help or thought on this is much appreciated. I'm relative new to survey data analysis but is quite familiar with panel data methods. Am I missing something big here?
Comment