I am using -ldagibbs- to find topics from open-end question responses. Conceptually and empirically, responses may include three domains: area of work X + encountered problem Y + irrelevant blah Z. STATA command -ldagibbs- is able to identify statistical clustering as topics that contain elements of X, Y and Z (standalone or combinations). I wonder if there is a way to run LDA with predefined domains so the output would identify, for each response, the area of work x and encountered problem y and ignore all elements of Z.
Alternatively I could run "supervised" ML using -svmachines- against predefined X categories and predefined Y categories, but that would require constructing training data set which is quite labor-intensive.
I searched -ldagibbs- in this forum and didn't see many discussions.
Thanks!
Alternatively I could run "supervised" ML using -svmachines- against predefined X categories and predefined Y categories, but that would require constructing training data set which is quite labor-intensive.
I searched -ldagibbs- in this forum and didn't see many discussions.
Thanks!
Comment