Hello,
I am working with variables consist of strings made up of binary digits. For example, “11011”, “10000”, and “01010”, etc.
I need to compute various distance or similarity measures between any two or multiple strings. So far, I have managed to break up each string into 5 columns or variables, change them to numerical digits, export them to Excel, and calculate cosine similarities. Also, strdist can only calculate the Levenshtein distance between two strings each time.
I am wondering if there is any easier approach? To compare distance or calculate cosine similarities among multiple strings of binary digits? Thanks!
Best,
Henry
Comment