Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • similar observations within one variable

    Hi,

    I would like to spot the observations that are very much alike within one string variable.

    Let's say for instance that I have a variable with 4 observations, such as:
    var1
    observation1: "cat"
    obs 2: "caty"
    obs 3: "the cat is beautiful"
    obs 4: "cat"
    I would like to have some distance measure that tells me that observation 1 and 4 are equal, observations 1 and 2 are quite similar, but observations 1 and 3 are very different. Is it possible?

    Thanks

  • #2
    -search string distance- reveals several user-written programs that calculate string distances, none of which I have used. Using the built-in command -cross-, you could create a data set with string values for all pairs of observation represented in two different string variables, and then use one of those commands to compare the strings.

    Comment

    Working...
    X