Hi all,
I've got a dataset measuring campaign activities (N=1850). Candidates are members of parties (party)(n=8) and nested in electoral districts (district) (n=29) and thus also in parties within districts (=party lists) (list) (n=8*29=232; not all lists in sample --> n= 195). I got hypotheses on the effects of variables on all these levels. For a single level regression my model goes as follows (fake dataex and regression):
(Group identifiers are Variable: partydummies (party affiliation), districtid (electoral districts))
Dependent Variable pers: use of personalized campaign activities (individual level). Variable x (list level), exps10 (list level) and DM (district level) are on the context level. Rest are individual level variables.
However, my concern is that due to the nested structure the errors might not be iid and thus inference from the standard errors limited
What would be the appropriate levels for a multilevel model?
For the null model in stata notation i was guessing (since list equals party-in-a-district): (1)
However, when I am also interested in the effects of party affiliation itself, how would I incorporate this into a multilevel model?
Perhaps like this?:
The ICC of the null model (1) is rather low. 0.07 to be precise.
Also, multi-level modelling would be quite disproportional to the scope of the analysis.
Can I get any solid inferences from just the pooled OLS model? Perhaps with cluster-robust standard errors? On which level should I cluster?
Kind regards
I've got a dataset measuring campaign activities (N=1850). Candidates are members of parties (party)(n=8) and nested in electoral districts (district) (n=29) and thus also in parties within districts (=party lists) (list) (n=8*29=232; not all lists in sample --> n= 195). I got hypotheses on the effects of variables on all these levels. For a single level regression my model goes as follows (fake dataex and regression):
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input float(pers gen ido chance2 nominc) byte DM float exps10 long x float(exps102 poliexp nominwhocat partydummies districtid) 3 1 2 0 3 9 3 55 9 . 1 4 21 4 0 2 1 3 10 3 49 9 0 0 6 20 1 0 1 2 3 29 10 45 100 1 0 7 29 0 0 0 1 0 22 10 45 100 1 0 7 23 4 1 1 0 3 10 10 45 100 0 1 2 1 1 1 2 1 2 9 10 45 100 3 0 6 21 6 0 1 0 2 12 10 45 100 0 . 7 12 end
Code:
reg pers x DM exps10 exps102 i.chance2 i.nominc i.nominwhocat ido gen i.poliexp i.partydummies
Dependent Variable pers: use of personalized campaign activities (individual level). Variable x (list level), exps10 (list level) and DM (district level) are on the context level. Rest are individual level variables.
However, my concern is that due to the nested structure the errors might not be iid and thus inference from the standard errors limited
What would be the appropriate levels for a multilevel model?
For the null model in stata notation i was guessing (since list equals party-in-a-district): (1)
Code:
mixed pers(candidate level), reml || district: || party:
Perhaps like this?:
Code:
mixed pers i.party, reml || district: || party:
Also, multi-level modelling would be quite disproportional to the scope of the analysis.
Can I get any solid inferences from just the pooled OLS model? Perhaps with cluster-robust standard errors? On which level should I cluster?
Kind regards
Comment