What is data matching?
General definition
- Find duplicates, potential duplicates and non-duplicates in a data source
- Analyze data and return weighted probabilities of matching
- Merge identical or similar entries into a single entry; and
- Reduce disparity across different data sources.
Record linkage
Record linkage consists of identifying records that refer to the same entity in a dataset.
- Deterministic record linkage, which is based on identifiers that match; and
- Probabilistic record linkage, which is based on the probability that identifiers match.