Deduplication

The goal of deduplication is to eliminate duplicate records by merging them together into a single, accurate, and consolidated golden record. This process maintains full traceability, allowing you to identify contributing records for the resulting golden record and providing the possibility to revert changes if necessary.

You can reduce the number of duplicates in the system proactively even before creating a deduplication project. For this purpose, CluedIn provides the possibility of merging by codes: those data parts that have identical entity origin codes or entity codes are merged during processing. For more information, see Codes.

The following diagram shows the basic steps for merging duplicates in CluedIn.

dedup-main.gif

This section covers the following areas:


Table of contents