Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Machine learning with twin data

    Hello everyone,
    I plan to design a model that can predict individuals with high or low psychosis-proneness by looking at the brain connectivity measures in 114 regions, age, gender, IQ, SES, polygenic risk score for psychosis (PRS), and environmental risk score for psychosis (ERS). In this way, I plan to investigate the brain regions that distinguish high and low psychosis-proneness by using machine learning. However, my sample includes healthy twin and sibling data; hence, my data is not independent. I could group the sample into high and low psychosis-proneness based on weighted positive symptom severity.
    Therefore, I would like to ask whether there are any codes or approaches in STATA that you can share. I am open to collaborations, as well.
    My data looks like the image I shared (I did not add the PRS since I do not have the information yet).

    Thank you for your help!
    Famid id Zygosity Pp_situation IQ Age Gender SES ERS Eglob Density
    1 1101 M low 129 17 female high -2 78.26 0.13
    1 2101 M low 114 17 female high -1 101.16 0.14
    2 1102 D high 118 19 male low -2 51.54 0.12
    2 2102 D high 123 19 male low -2 68.79 0.13
    3 1103 S high 92 18 female high 4 42.95 0.11
    3 2103 S high 97 20 male high 1 18.52 0.07

Working...
X