Hi,
I am intending to analyze company payout policy determinants and have a set of panel data. My data set is unbalanced due to companies entering and dropping out of the stock exchange I am analyzing. My independent variables consist of financial information for the companies in my data set. The data collected for the companies that are included in the exchange any given year is complete. I would ideally not drop the companies that do not have data for all years due to bias. I have understood from a previous thread that Stata is able to handle unbalanced panel data but I am not sure what this in practice means, i.e are missing observations dropped. I saw some suggestions as to using multiple imputation when individual values are missing but I am not sure if this can or should be done when entire years are missing for certain companies.
I would be very thankful for suggestion in dealing with the unbalanced data. As this is my first time posting I hope I got the instructions for posting correct and would be happy to provide any additional information if necessary.
(Stata version 14.2 SE)
I am intending to analyze company payout policy determinants and have a set of panel data. My data set is unbalanced due to companies entering and dropping out of the stock exchange I am analyzing. My independent variables consist of financial information for the companies in my data set. The data collected for the companies that are included in the exchange any given year is complete. I would ideally not drop the companies that do not have data for all years due to bias. I have understood from a previous thread that Stata is able to handle unbalanced panel data but I am not sure what this in practice means, i.e are missing observations dropped. I saw some suggestions as to using multiple imputation when individual values are missing but I am not sure if this can or should be done when entire years are missing for certain companies.
Code:
. xtset Conumber Year panel variable: Conumber (unbalanced) time variable: Year, 2005 to 2016, but with gaps delta: 1 unit . xtdescribe Conumber: 1, 2, ..., 128 n = 128 Year: 2005, 2006, ..., 2016 T = 12 Delta(Year) = 1 unit Span(Year) = 12 periods (Conumber*Year uniquely identifies each observation) Distribution of T_i: min 5% 25% 50% 75% 95% max 1 2 9 12 12 12 12 Freq. Percent Cum. | Pattern ---------------------------+-------------- 82 64.06 64.06 | 111111111111 4 3.13 67.19 | 11.......... 4 3.13 70.31 | 111111...... 3 2.34 72.66 | ...........1 3 2.34 75.00 | ...111111111 3 2.34 77.34 | ..1111111111 3 2.34 79.69 | 111111111... 2 1.56 81.25 | ..........11 2 1.56 82.81 | .........111 22 17.19 100.00 | (other patterns) ---------------------------+-------------- 128 100.00 | XXXXXXXXXXXX
(Stata version 14.2 SE)
Comment