Dear Stata users,
I have a huge follow up data of more than 4 million visits (for cancer screening) made by about 1 million women. The maximum number of visits a woman can have is 10. There are two symptom variables in the data set, which can occur at any visit during 1-10 visits. A woman can have more than one symptoms also. Below is an example data, for w1 woman, she had symptoms in her 6th visit, now, I want to find a similar women without a visit with symptoms in the same visit (i.e., 6th visit) matched by 4 background variables (not shown below). If a woman had more than one symptoms during 1-10 visits, I just consider the first visit with symptoms. How do I find the similar non-symptomatic visit for a given visit with symptom matched by 4 variables? Is it possible to match two exposure variables (visit with symptoms) to unexposed (visit without symptoms)?
My exposure group is visit with symptoms and comparison group is visits without symptoms. The matching ratio is 1:1 and I have no difficulty finding non-symptomatic visits using 4 matching variables. My follow-up time starts from the visit date with symptoms and ends at the exact date of death (due to cancer or other cause) or at last visit date/loss to follow up.
Thank you.
kind regards,
Deependra
I have a huge follow up data of more than 4 million visits (for cancer screening) made by about 1 million women. The maximum number of visits a woman can have is 10. There are two symptom variables in the data set, which can occur at any visit during 1-10 visits. A woman can have more than one symptoms also. Below is an example data, for w1 woman, she had symptoms in her 6th visit, now, I want to find a similar women without a visit with symptoms in the same visit (i.e., 6th visit) matched by 4 background variables (not shown below). If a woman had more than one symptoms during 1-10 visits, I just consider the first visit with symptoms. How do I find the similar non-symptomatic visit for a given visit with symptom matched by 4 variables? Is it possible to match two exposure variables (visit with symptoms) to unexposed (visit without symptoms)?
My exposure group is visit with symptoms and comparison group is visits without symptoms. The matching ratio is 1:1 and I have no difficulty finding non-symptomatic visits using 4 matching variables. My follow-up time starts from the visit date with symptoms and ends at the exact date of death (due to cancer or other cause) or at last visit date/loss to follow up.
Women | number of visits | year of visit | symptom 1 | symptom 2 | death |
w1 | 1 | 1992 | 0 | 0 | 0 |
2 | 1994 | 0 | 0 | 0 | |
3 | 1996 | 0 | 0 | 0 | |
4 | 1998 | 0 | 0 | 0 | |
5 | 2000 | 0 | 0 | 0 | |
6 | 2002 | 1 | 0 | 0 | |
7 | 2004 | 0 | 0 | 0 | |
8 | 2006 | 0 | 0 | 0 | |
9 | 2008 | 0 | 0 | 1 | |
10 | 2010 | . | . | . | |
w2 | 1 | 1996 | 0 | 0 | 0 |
2 | 1998 | 1 | 0 | 0 | |
3 | 2000 | 0 | 0 | 0 | |
4 | 2002 | 0 | 0 | 0 | |
5 | 2004 | 0 | 0 | 0 | |
6 | 2006 | 0 | 0 | 0 | |
7 | 2008 | 0 | 1 | 0 | |
8 | 2010 | 0 | 0 | 0 | |
9 | 2012 | 0 | 0 | 0 | |
10 | 2014 | 0 | 0 | 0 |
kind regards,
Deependra
Comment