Hi all,
I am writing a postdoc grant application right now and I have written a paragraph on a sample size calculation example, as follows:
"Assuming I wanted to detect a hazard ratio of 0.91 for overall survival in [X] patients associated with [X] use, at 80% power, with a 5% level of significance, and assuming a 10-year survival rate of 10% and prevalence of [X] use among [X] of ~25%, I would need a sample size of 5,230 patients. As I have identified 33,500 [X] from in my cohort (and assuming that ~30% of [X] are diagnosed with [X]), the sample size will be sufficient to detect HRs closer to the null (1) if the protective effect of [X] is not as strong among [X]. In fact, given my sample size of ~10,050 [X], the size of my cohort will be sufficiently powered (at 80% power) to detect a reduction in risk as small as 7% (i.e., a HR of 0.93)".
The [X]'s simply represent my cohort and exposure of interest-I didn't think they were relevant to include.
The commands I used in STATA are as follows:
In this command, I took the natural log of the HR (0.91) and derived the standard deviation from the square root of 0.25*(1-0.25) (i.e., the prevalence of the exposure multipled by 1 minus the prevalence).
In this command, I took the sample size as 30% of 33,500 (10,050).
Do these commands/calculations look correct? Any help would be appreicated, because I'm not very used to using power cox in STATA.
Warm regards, Oliver
I am writing a postdoc grant application right now and I have written a paragraph on a sample size calculation example, as follows:
"Assuming I wanted to detect a hazard ratio of 0.91 for overall survival in [X] patients associated with [X] use, at 80% power, with a 5% level of significance, and assuming a 10-year survival rate of 10% and prevalence of [X] use among [X] of ~25%, I would need a sample size of 5,230 patients. As I have identified 33,500 [X] from in my cohort (and assuming that ~30% of [X] are diagnosed with [X]), the sample size will be sufficient to detect HRs closer to the null (1) if the protective effect of [X] is not as strong among [X]. In fact, given my sample size of ~10,050 [X], the size of my cohort will be sufficiently powered (at 80% power) to detect a reduction in risk as small as 7% (i.e., a HR of 0.93)".
The [X]'s simply represent my cohort and exposure of interest-I didn't think they were relevant to include.
The commands I used in STATA are as follows:
Code:
power cox -0.094310679, p(0.8) eventpr(0.90) sd(0.433012701)
In this command, I took the natural log of the HR (0.91) and derived the standard deviation from the square root of 0.25*(1-0.25) (i.e., the prevalence of the exposure multipled by 1 minus the prevalence).
Code:
power cox, n(10050) p(0.8) eventpr(0.90) effect(hr) sd(0.433012701)
In this command, I took the sample size as 30% of 33,500 (10,050).
Do these commands/calculations look correct? Any help would be appreicated, because I'm not very used to using power cox in STATA.
Warm regards, Oliver
Comment