Hi, I am hoping to use a supervised text classification using STATA. Specifically, I have a dataset that contains over 10,000 abstracts of different papers that were published in the past five years. I would like to classify these abstracts into different discipline categories such as social sciences and natural sciences. I have pre-classified over 100 abstracts into these categories and would like to use this test data to train the rest of the sample. Can you please provide sample codes to help me? Please note that I am only proficient in STATA. I also could not make it work with c_ml_stata_cv. Thank you in advance for all your help!
-
Login or Register
- Log in with
Comment