Dear Statalist,
In my research, I am addressing an issue which is called "sibling rivalry".
Sibling rivalry occurs when siblings compete for parental investments.
To account for unobserved family fixed effects common to all siblings, I think that it may be necessary to use a household level fixed effect.
However, my dataset is from a survey conducted in 2017, so the dataset is cross-sectional.
An example of my estimation strategy is the following (since my dataset is from a survey, I use svy: command):
This is the first stage regression where log_birthweight is an endogenous variable and rain_shock is an IV for log_birthweight.
It is ideal for me to add household level fixed effects in this expression, but if I execute the following code,
the error
occurs. Since I understand why this happened (because the number of hhid is very large), I thought about taking a different approach which will be explained below.
I recalled that the professor explained in the econometrics class that cross-sectional data can be handled like panel data.
The professor used the following example.
i: individual, j: school, Y: test score, X: teacher's experience, a: fixed effect, and u: error term.
I thought that this approach could be used in my own setting as well. That is, my idea is using
Is this approach useful in my setting?
Also, if this is true, how should I estimate the 2SLS regression using
command?
I would greatly appreciate any suggestions you might have.
Best,
Kentaro
In my research, I am addressing an issue which is called "sibling rivalry".
Sibling rivalry occurs when siblings compete for parental investments.
To account for unobserved family fixed effects common to all siblings, I think that it may be necessary to use a household level fixed effect.
However, my dataset is from a survey conducted in 2017, so the dataset is cross-sectional.
An example of my estimation strategy is the following (since my dataset is from a survey, I use svy: command):
Code:
svy: reg log_birthweight rain_shock i.race i.gender i.birth_district i.birth_month
It is ideal for me to add household level fixed effects in this expression, but if I execute the following code,
Code:
svy: reg log_birthweight rain_shock i.race i.gender i.birth_district i.birth_month i.hhid
Code:
r(103); too many variables specified
I recalled that the professor explained in the econometrics class that cross-sectional data can be handled like panel data.
The professor used the following example.
HTML Code:
Y_ij = \beta_0 + \beta_1 * X_ij + a_j + u_ij
I thought that this approach could be used in my own setting as well. That is, my idea is using
Code:
xtset sibling_id child_id
Also, if this is true, how should I estimate the 2SLS regression using
Code:
svy:
I would greatly appreciate any suggestions you might have.
Best,
Kentaro
Comment