Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Comparing data among two string columns

    Dear Statalist members,
    I want to identify the political connections of the firms. For this I have companies officials names and on the other side I have names of the politicians. I want to trace the politicians who are also the company's directors/officials etc. I want to compare two columns having string data, and the names in both columns may have little difference as well. a politician name may appear little different "Muhammad Saleem Khan" in political data column but it may be "Saleem Khan" in firms data column. I have 6000 rows in corporate data column and 440000 rows in political data column. to cut it short every name of the first variable need to be searched in the complete data of the 2nd variable.

  • #2
    Others here have had success with the user-written matchit command from Julio Raffo, as discussed in the following threads.

    http://www.statalist.org/forums/foru...s-observations

    http://www.statalist.org/forums/foru...-e-fuzzy-match

    https://www.statalist.org/forums/for...ng-using-lists

    https://www.statalist.org/forums/for...it-and-reclink

    Comment

    Working...
    X