Hi all,
I am working on a big dataset that includes more than a million patients with unique fake IDs. The IDs take the form of "map_code_111000" and are stored as strings. I want to convert them to a numeric format. What would be the best approach to tackle this? I was thinking of either removing the "map_code_" part or extracting the number "111000". Is this an acceptable approach? If yes, what function or syntax should I use? If not, what approach do you recommend?
I am working on a big dataset that includes more than a million patients with unique fake IDs. The IDs take the form of "map_code_111000" and are stored as strings. I want to convert them to a numeric format. What would be the best approach to tackle this? I was thinking of either removing the "map_code_" part or extracting the number "111000". Is this an acceptable approach? If yes, what function or syntax should I use? If not, what approach do you recommend?
Comment