Hello dear forum members,
I am estimating a panel model with fixed effects using yearly data from 2009 to 2013 (N=1848). Whereas 80% of the data encounter less than 5% of missing values, the remaining 20% encounter 75% of the missing values. These "problematic" 20% of the data are review-based rating scores, which (due to their nature) are not given every year to every entity in the data set. For example, an entity may be reviewed (and given a rating) say once or twice in 5 years (and not necessarily in consecutive years), thus resulting in missing values.
-xtnbreg, fe- estimation results in a total N=751 with 2,049 total observations, i.e., min observations per group 2, max = 5, and avg = 2.7
Does this relatively low average number of observations per group (i.e., 2.7 out of 5) bias the estimates substantially? Or poses any other problems?
Thank you in advance,
Anton
I am estimating a panel model with fixed effects using yearly data from 2009 to 2013 (N=1848). Whereas 80% of the data encounter less than 5% of missing values, the remaining 20% encounter 75% of the missing values. These "problematic" 20% of the data are review-based rating scores, which (due to their nature) are not given every year to every entity in the data set. For example, an entity may be reviewed (and given a rating) say once or twice in 5 years (and not necessarily in consecutive years), thus resulting in missing values.
-xtnbreg, fe- estimation results in a total N=751 with 2,049 total observations, i.e., min observations per group 2, max = 5, and avg = 2.7
Does this relatively low average number of observations per group (i.e., 2.7 out of 5) bias the estimates substantially? Or poses any other problems?
Thank you in advance,
Anton
Comment