Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • reclink

    Dear community,

    I have merged two datasets based on a unique companyid with the reclink command.


    reclink company_lender_id using "C:\Users\huett\OneDrive\Dokumente\SS_19\Kaserer\s nydicated_loans\lenders - unique.dta", idmaster(idmaster
    > ) idusing(idlender) gen(score2) _merge(mergedata2) minscore(.99)

    905 perfect matches found


    Added: idlender= identifier from C:\Users\huett\OneDrive\Dokumente\SS_19\Kaserer\sn ydicated_loans\lenders - unique.dta score2 = matching s
    > core
    Observations: Master N = 906 C:\Users\huett\OneDrive\Dokumente\SS_19\Kaserer\sn ydicated_loans\lenders - unique.dta N= 19920
    Unique Master Cases: matched = 896 (exact = 905), unmatched = 10




    After having run the command several times I noticed that there are 10 observations in total which stata refuses to match although the variable which is used for matching is exactly the same in both data sets (namely the unique identifier). I was wondering whether anyone has encountered a similar Problem with reclink so far. What strikes me is that it is the same 10 observations every time that remains unmatched whereas the algorithm works perfectly for the rest of the ~ 1000 observations. I have checked that there are no Special characters such as äüö etc.

    What am I missing?

    Thanks a lot!

    Marie

  • #2
    I'm curious why you're using reclink. If the identifier is the same in both data sets, couldn't you just use merge?

    Comment


    • #3
      Hello William,

      thanks for your reply and sorry for getting back to you so late. Eventually, reclink worked perfectly but I cannot pinpoint why that is ;D However, to answer your question: I have successfully used merge before but on much smaller datasets. It seems as if merge cannot handle files with more than ~150'000 rows. Or at least this is what has happend to me: Everytime I tried to simply merge the sets I received an error saying "op. sys. refuses to provide Memory etc". I should note that I used the m:1 merge-command.

      Perhaps I will open a new thread on this, because I'll need to merge two more files very soon based on a facilityID….

      Marie

      Comment

      Working...
      X