Table 1 Additional human compiled blocklisting rules enforced in the ThemeCompound model.

From: A database of thermally activated delayed fluorescent molecules auto-generated from scientific literature with ChemDataExtractor

Field

Blocklisting Rules

Names

Separator ‘‘🙃🙃🙃🙃’’.

DOI strings of current document.

Chemical entity mentions (CEMs) followed by “-based”, part, segment and alike.

CEMs ending in “o” (amino), “yl” (phenyl) and alike prefixes.

Labels

Strings of dates, electronic states, units.

Numbers followed by % or wt %.

Strings longer than 4 characters.