What is data matching?

Data matching is the process that enables you to find records representing the same entity in a dataset.

General definition

Data matching enables you to:

Record linkage consists of identifying records that refer to the same entity in a dataset.

Two types of data record linkage exist:

Deterministic record linkage, which is based on identifiers that match; and
Probabilistic record linkage, which is based on the probability that identifiers match.

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!