Deduplication

The goal of deduplication is to eliminate duplicate records by merging them together into a single, accurate, and consolidated golden record. This process maintains full traceability, allowing you to identify contributing records for the resulting golden record and providing the possibility to revert changes if necessary.

You can reduce the number of duplicates in the system proactively even before creating a deduplication project. For this purpose, CluedIn provides the possibility of merging by identifiers: those data parts that have identical primary identifiers or additional identifiers are merged during processing. For more information, see Identifiers.

The following diagram shows the basic steps for merging duplicates in CluedIn.

dedup-main.gif

This section covers the following areas:


Table of contents