Dear users
I have the following type of analysis that needs doing. I'm inexperienced with logistic regression so any pointers would be great.
Outcome: binary (variable name: status; values employed and unemployed).
Predictor: binary (variable name: treatment; values treatment and control)
Data: collected from entrants to youth employability programmes. Entrants are randomly assigned to clusters (training sites, variable name siteid), where one or other type of employability programme is being run (treatment/control). It is a panel dataset with 2 waves: baseline and endline. In long format this time dimension is captured in the wave variable, which takes values of either wave 1 or wave 2. Individuals are uniquely identified with variable id.
I am trying to discern whether entrants to control programmes have a significantly lower probability of becoming employed at wave 2, compared to entrants to treatment programmes.
The test I thought I should be doing was a conditional logistic regression (using clogit) - looking only at individuals who have transitioned between employment states and comparing the likelihood of this transition between treatment and control groups.
However this yields the error: outcome does not vary in any group. I suppose this is because the treatment variable is time-invariant and observation-invariant.
Should I instead be using panel regression with random effects? eg:
Thanks,
Zoheb
I have the following type of analysis that needs doing. I'm inexperienced with logistic regression so any pointers would be great.
Outcome: binary (variable name: status; values employed and unemployed).
Predictor: binary (variable name: treatment; values treatment and control)
Data: collected from entrants to youth employability programmes. Entrants are randomly assigned to clusters (training sites, variable name siteid), where one or other type of employability programme is being run (treatment/control). It is a panel dataset with 2 waves: baseline and endline. In long format this time dimension is captured in the wave variable, which takes values of either wave 1 or wave 2. Individuals are uniquely identified with variable id.
I am trying to discern whether entrants to control programmes have a significantly lower probability of becoming employed at wave 2, compared to entrants to treatment programmes.
The test I thought I should be doing was a conditional logistic regression (using clogit) - looking only at individuals who have transitioned between employment states and comparing the likelihood of this transition between treatment and control groups.
However this yields the error: outcome does not vary in any group. I suppose this is because the treatment variable is time-invariant and observation-invariant.
Should I instead be using panel regression with random effects? eg:
Code:
xtset id wave xtlogit status treatment, re vce cluster siteid
Thanks,
Zoheb
Comment