Hi, I just took a short course on multi-level modeling and decided to use some data I had on hand to practice but I am not sure how I can be confident in my results given it is my first time running this kind of model.
My data is repeated measures and very simple as of right now (already in long format). I am looking to see if the implementation of a certain policy affected the number of pending cases in a given county, nested within districts, during a given time period.
year is displayed as 22 and 23 for their relative Fiscal Year, and 0 mean they did not have a policy and 1 is they have a policy. I am treating the data as a "treatment" so no county had a policy in year 22, (all 0's) and then whoever has submitted one recently has a 1. So it looks like this.
group N Mean SD Variance
22 No 100 7698.33 11743.41 1.38e+08
23 No 35 5660.171 6675.436 4.46e+07
23 Yes 65 6935.708 12047.6 1.45e+08
This is my code and output:
my understanding is that in FY 2023, a county has 1,702 less pending cases than in 2022, for every one unit increase in population, pending cases increase by .06 units, and that while not statistically significant, having a continuance policy actually increased the likelihood of having more pending cases?
Sorry if the interpretation is not accurate or well stated.
Thanks!
My data is repeated measures and very simple as of right now (already in long format). I am looking to see if the implementation of a certain policy affected the number of pending cases in a given county, nested within districts, during a given time period.
Code:
* Example generated by -dataex-. For more info, type help dataex clear input str12 county str3 dc_district long pop byte year long(cr_pend dc_cont_pol) "Currituck" "01" 29305 22 1541 0 "Wake" "10" 1156274 22 40508 0 "Cumberland" "12" 334660 22 16323 0 "Columbus" "13" 49307 22 5913 0 "Columbus" "13" 48355 23 4981 1 "Durham" "14" 329973 22 8560 0 "Union" "20B" 247058 23 11769 1 "Watauga" "24" 53639 22 2020 0 "Transylvania" "29B" 32785 22 1920 0 "Graham" "30" 7967 22 905 0 end label values dc_cont_pol yesno label def yesno 0 "No", modify label def yesno 1 "Yes", modify
group N Mean SD Variance
22 No 100 7698.33 11743.41 1.38e+08
23 No 35 5660.171 6675.436 4.46e+07
23 Yes 65 6935.708 12047.6 1.45e+08
This is my code and output:
Code:
mixed cr_pend pop i.dc_cont_pol i.year dc_district:, reml dfmethod(repeated) covariance(unstructured) note: single-variable random-effects specification in dc_district equation; covariance structure set to identity. Performing EM optimization ... Performing gradient-based optimization: Iteration 0: log restricted-likelihood = -1854.8598 Iteration 1: log restricted-likelihood = -1854.8598 Computing standard errors ... Computing degrees of freedom ... Mixed-effects REML regression Number of obs = 200 Group variable: dc_district Number of groups = 41 Obs per group: min = 2 avg = 4.9 max = 14 DF method: Repeated DF: min = 0.00 avg = 147.00 max = 196.00 F(3, 196.00) = 136.24 Log restricted-likelihood = -1854.8598 Prob > F = 0.0000 cr_pend Coefficient Std. err. t P>t [95% conf. interval] pop .0569237 .0029007 19.62 0.000 .051203 .0626444 dc_cont_pol Yes 673.4643 551.141 1.22 0.223 -413.4637 1760.392 23.year -1701.722 447.0522 -3.81 0.000 -2583.372 -820.0714 _cons 2102.219 1183.138 1.78 . . . Random-effects parameters Estimate Std. err. [95% conf. interval] dc_district: Identity var(_cons) 4.46e+07 1.04e+07 2.83e+07 7.05e+07 var(Residual) 3578446 405949 2865047 4469481 LR test vs. linear model: chibar2(01) = 204.95 Prob >= chibar2 = 0.0000 . estat icc Residual intraclass correlation Level ICC Std. err. [95% conf. interval] dc_district .9257837 .0180008 .8818603 .9542243
Sorry if the interpretation is not accurate or well stated.
Thanks!

Comment