Table 3 The impact of molecular representation learning pre-training with different self-supervised task combinations

MLM	MDP	MFGP	Warm Start	Drug Cold Start	Target Cold Start
✓	×	×	0.697	0.247	0.543
×	✓	×	0.586	0.203	0.509
×	×	✓	0.783	0.305	0.594
✓	✓	×	0.82	0.418	0.673
✓	×	✓	0.902	0.527	0.802
×	✓	✓	0.851	0.489	0.769
✓	✓	✓	0.931	0.568	0.865

Note: MLM, MDP, and MFGP are three self-supervised tasks for molecular pre-training, representing Masked Language Modeling, Molecular Descriptor Prediction and Molecular Functional Group Prediction, respectively. The ablation experiment is performed on the Yamanishi 08’s dataset under three experimental settings, and the evaluation metric is AUPR. Bold numbers represent the best performance, higher values are better.

Quick links

Search