Fig. 1: Graphical illustration of deterministic and probabilistic data linkage methods.
From: A scoping review of routinely collected linked data in research on gambling harm

The former methods use specifically defined rules to classify data sources, such as an individual’s identity number or date of birth, while the latter methods assign probabilistic weights to records conditional on a range of identifiers to represent the likelihood that they are drawn from the same individual.