Dear users
I am looking at a longitudinal dataset with 2 waves of data (baseline and endpoint). The data is collected from entrants to employability programmes. It is a randomised controlled trial: entrants are randomly allocated either to programme type A or B. Programmes A and B are rolled out at sites, or clusters, of which there are 44. Random assignment is therefore to a cluster which is running either programme A or B.
I am trying to assess change in outcomes between wave 1 and 2. For continuous outcomes the simplest option would be to run paired t-tests. However this ignores the clustered nature of the data and leads to standard errors and p-values that are too small (on some variables the ICC is 'significantly' higher than zero). There are two types of analyses I am trying to conduct:
1) Difference between outcome at wave 1 and 2
2) Difference between waves 1 and 2 by a binary group variable - eg whether the respondent is in a treatment or control group.
The best I can come up with is:
For (1):
. clttest outcome, cluster (cluster] by (wave) [in long format]
OR
.ttest w1_outcome==w2_outcome. [in wide format]
So I'm either ignoring the dependence of the observations, or the clustering. User-written clttest seems to not support paired data.
For (2):
Generate a difference variable (ie w2_outcome-w1_outcome), and run clttest, eg:
.clttest delta_outcome, cluster (cluster) by (treatment)
In this case I think both the clustering and the pairing of observations have been dealt with but I'm not 100% sure.
Any guidance at all would be greatly appreciated.
Thanks,
Zoheb
I am looking at a longitudinal dataset with 2 waves of data (baseline and endpoint). The data is collected from entrants to employability programmes. It is a randomised controlled trial: entrants are randomly allocated either to programme type A or B. Programmes A and B are rolled out at sites, or clusters, of which there are 44. Random assignment is therefore to a cluster which is running either programme A or B.
I am trying to assess change in outcomes between wave 1 and 2. For continuous outcomes the simplest option would be to run paired t-tests. However this ignores the clustered nature of the data and leads to standard errors and p-values that are too small (on some variables the ICC is 'significantly' higher than zero). There are two types of analyses I am trying to conduct:
1) Difference between outcome at wave 1 and 2
2) Difference between waves 1 and 2 by a binary group variable - eg whether the respondent is in a treatment or control group.
The best I can come up with is:
For (1):
. clttest outcome, cluster (cluster] by (wave) [in long format]
OR
.ttest w1_outcome==w2_outcome. [in wide format]
So I'm either ignoring the dependence of the observations, or the clustering. User-written clttest seems to not support paired data.
For (2):
Generate a difference variable (ie w2_outcome-w1_outcome), and run clttest, eg:
.clttest delta_outcome, cluster (cluster) by (treatment)
In this case I think both the clustering and the pairing of observations have been dealt with but I'm not 100% sure.
Any guidance at all would be greatly appreciated.
Thanks,
Zoheb
Comment