Hello everyone,
I plan to design a model that can predict individuals with high or low psychosis-proneness by looking at the brain connectivity measures in 114 regions, age, gender, IQ, SES, polygenic risk score for psychosis (PRS), and environmental risk score for psychosis (ERS). In this way, I plan to investigate the brain regions that distinguish high and low psychosis-proneness by using machine learning. However, my sample includes healthy twin and sibling data; hence, my data is not independent. I could group the sample into high and low psychosis-proneness based on weighted positive symptom severity.
Therefore, I would like to ask whether there are any codes or approaches in STATA that you can share. I am open to collaborations, as well.
My data looks like the image I shared (I did not add the PRS since I do not have the information yet).
Thank you for your help!
I plan to design a model that can predict individuals with high or low psychosis-proneness by looking at the brain connectivity measures in 114 regions, age, gender, IQ, SES, polygenic risk score for psychosis (PRS), and environmental risk score for psychosis (ERS). In this way, I plan to investigate the brain regions that distinguish high and low psychosis-proneness by using machine learning. However, my sample includes healthy twin and sibling data; hence, my data is not independent. I could group the sample into high and low psychosis-proneness based on weighted positive symptom severity.
Therefore, I would like to ask whether there are any codes or approaches in STATA that you can share. I am open to collaborations, as well.
My data looks like the image I shared (I did not add the PRS since I do not have the information yet).
Thank you for your help!
| Famid | id | Zygosity | Pp_situation | IQ | Age | Gender | SES | ERS | Eglob | Density |
| 1 | 1101 | M | low | 129 | 17 | female | high | -2 | 78.26 | 0.13 |
| 1 | 2101 | M | low | 114 | 17 | female | high | -1 | 101.16 | 0.14 |
| 2 | 1102 | D | high | 118 | 19 | male | low | -2 | 51.54 | 0.12 |
| 2 | 2102 | D | high | 123 | 19 | male | low | -2 | 68.79 | 0.13 |
| 3 | 1103 | S | high | 92 | 18 | female | high | 4 | 42.95 | 0.11 |
| 3 | 2103 | S | high | 97 | 20 | male | high | 1 | 18.52 | 0.07 |
