Table 3 Notations used in this paper.
From: An efficient learning based approach for automatic record deduplication with benchmark datasets
Notation | Description |
|---|---|
\({A}_{j}\) | Denotes jth attribute |
\({}_{VL}\) | Denotes word embedding of \({w}_{l}\) |
A | Denotes an attribute |
a, b | Indicate two scalar values |
D | Denotes dataset |
df(w) | Indicates count of tuples where token w is found among attributes |
idf(w) | Denotes inverse document frequency |
N | Represents pairs of tuples |
S | Denotes a vector |
T | Denotes a tuple |
x, y | Denote two k-dimensional vectors |
Y | Denotes a label vector |
α | Denotes smoothing hyperparameter |