Friday, February 13, 2015

Match Analysis in Talend Data Quality

Hi Friends,

We can explore duplicate records effectively by using match analysis.

open Talend and select Profiling perspective as showed in below image.



To create new analysis go to Data profiling > Analysis > New Analysis


See below picture, we ICO1(EmployeeID) and Country Code. One ICO1 have multipale data. We want to find the duplicate country code with respect to ICO1.

 Duplicate country code groups are separated by different colors.

So in above image we can see country POLAND have 4 records for ICO1 6392.
GRP_SIZE column (first in each group) telling the duplicate group count.



​Above chart will also help you to understand matching coutnry code and group count.

Below are the steps to run this analysis:-

first click on Select Matching Key button then click on column on which you want do match
analysis.



​Scroll down and click on Chart button near Matching Key tab.
You will see the output. Enjoy!!




No comments:

Post a Comment