Hi,
I have a dependent variable absent days that is a continuous variable and is overdispersed as shown in the summary statistics below.

Would it be appropriate to convert this into a count variable by rounding up the half days and then run a negative binomial regression? Otherwise, I have read that a gamma regression with a log-link may also be used for overdispersed continuous dependent variables, would this be better?
Any other suggestions would be very helpful!
Thank you!
Best,
Surya
I have a dependent variable absent days that is a continuous variable and is overdispersed as shown in the summary statistics below.
Code:
tab absdays_vacationX_2 combined | for | 1990-2011 | Freq. Percent Cum. ------------+----------------------------------- 0 | 740 40.28 40.28 1 | 3 0.16 40.45 2 | 15 0.82 41.26 3 | 16 0.87 42.13 4 | 16 0.87 43.00 5 | 194 10.56 53.57 6 | 2 0.11 53.67 7 | 5 0.27 53.95 8 | 9 0.49 54.44 10 | 195 10.62 65.05 14 | 2 0.11 65.16 15 | 105 5.72 70.88 16 | 2 0.11 70.99 17 | 1 0.05 71.04 20 | 47 2.56 73.60 21.5 | 14 0.76 74.36 25 | 27 1.47 75.83 26.5 | 1 0.05 75.88 28 | 1 0.05 75.94 30 | 61 3.32 79.26 31.5 | 1 0.05 79.31 32 | 1 0.05 79.37 35 | 32 1.74 81.11 40 | 35 1.91 83.02 41.5 | 1 0.05 83.07 43 | 18 0.98 84.05 43.5 | 3 0.16 84.21 45 | 26 1.42 85.63 50 | 25 1.36 86.99 53.5 | 1 0.05 87.04 55 | 14 0.76 87.81 60 | 21 1.14 88.95 63.5 | 2 0.11 89.06 64.5 | 34 1.85 90.91 65 | 25 1.36 92.27 70 | 17 0.93 93.20 75 | 15 0.82 94.01 80 | 11 0.60 94.61 85 | 13 0.71 95.32 86 | 11 0.60 95.92 86.5 | 4 0.22 96.14 90 | 5 0.27 96.41 90.5 | 1 0.05 96.46 95 | 3 0.16 96.62 100 | 7 0.38 97.01 105 | 3 0.16 97.17 107.5 | 8 0.44 97.60 108.5 | 1 0.05 97.66 110 | 7 0.38 98.04 115 | 3 0.16 98.20 125 | 1 0.05 98.26 129 | 7 0.38 98.64 130 | 11 0.60 99.24 145 | 1 0.05 99.29 150 | 1 0.05 99.35 150.5 | 2 0.11 99.46 172 | 1 0.05 99.51 185 | 1 0.05 99.56 193.5 | 2 0.11 99.67 195 | 2 0.11 99.78 210 | 1 0.05 99.84 215 | 1 0.05 99.89 230 | 1 0.05 99.95 255 | 1 0.05 100.00 ------------+----------------------------------- Total | 1,837 100.00 . . sum absdays_vacationX_2 Variable | Obs Mean Std. Dev. Min Max -------------+--------------------------------------------------------- absdays_v~_2 | 1,837 19.17719 31.16322 0 255
Would it be appropriate to convert this into a count variable by rounding up the half days and then run a negative binomial regression? Otherwise, I have read that a gamma regression with a log-link may also be used for overdispersed continuous dependent variables, would this be better?
Any other suggestions would be very helpful!
Thank you!
Best,
Surya
Comment