Hi all,
I have an Excel data set for patents that I imported into STATA. In the ID variable of a patent (each patent corresponds to a row) I have multiple values (firm IDs) as the patents are granted to x number of applicants. The data is str144. firmIDs as string variables. If there are more than one firms involved, the IDs seem to be combined = no delimeter. I need to split each firm (and then reshape to create a dataset in which one row represent patent-firm. So if there is onyl one patent with 3 firms, I will have a dataset with 3 records. This is not the problem. The problem is the delimeter so that I can parse ands split the firmID data.
In the Excel each firm in one patent record shows as a new line, formatted Alt + Ent. In Excel the delimeter is Ctrl+J if I want to convert text to column. What is the character that corresponds to this in STATA. Spitting in Excel will take longer time.
When I ran charlist for that ID variable, the result is not helpful (as I saw in other posts) : *-0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ
These are usual characters for one firmID in the patent data
I highly appreciate your help, best
Melike
I have an Excel data set for patents that I imported into STATA. In the ID variable of a patent (each patent corresponds to a row) I have multiple values (firm IDs) as the patents are granted to x number of applicants. The data is str144. firmIDs as string variables. If there are more than one firms involved, the IDs seem to be combined = no delimeter. I need to split each firm (and then reshape to create a dataset in which one row represent patent-firm. So if there is onyl one patent with 3 firms, I will have a dataset with 3 records. This is not the problem. The problem is the delimeter so that I can parse ands split the firmID data.
In the Excel each firm in one patent record shows as a new line, formatted Alt + Ent. In Excel the delimeter is Ctrl+J if I want to convert text to column. What is the character that corresponds to this in STATA. Spitting in Excel will take longer time.
When I ran charlist for that ID variable, the result is not helpful (as I saw in other posts) : *-0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ
These are usual characters for one firmID in the patent data
I highly appreciate your help, best
Melike
Comment