Monday, October 19, 2015

Fuzzy Match Using tRecordMatching Component with Lookup



Let say we have two tFixedFlowInput that have some similar street address, now we want to find out those.

Data source 1-
 


Data Source 2-

And we use tRecordMatching Component with fuzzy matching function Jaro-winkler for finding duplicates-

 


Note: we can also use blocking definition option.

My job design is as below-




And the final output is-


Please do comment if you have any questions.

2 comments:

  1. There is another component tFuzzymatch for the same task.

    ReplyDelete
    Replies
    1. Not really, no.
      For a single column, yes, but for this example, it is not possible with a tfuzzymatch.
      When you are dealing with real matching stuff, you have to use a trecordmatching.

      Delete