Table 1 Summary of all datasets and tasks

Task	Reference	Train size	Validation size	Test size	Task	Projector used	Evaluation metric
General PPI prediction
Gold-standard PPI	²²	163019	59260	52048	Binary classification	MLP	AUPRC
Human-PPI	⁶⁷	26319	234	180	Binary classification	MLP	Accuracy
Yeast-PPI	¹²	4945	95	394	Binary classification	MLP	Accuracy
PDB-Bind	²⁸	4945	95	394	Regression	MLP	Pearson correlation
SKEMPI	²³	4777	–	1929	Regression	MLP	Pearson correlation
MutaionalPPI	²⁹	3406	–	–	Binary classification	MLP	AUPRC
Antibody tasks
FLAB (Binding 422)	³⁵	422	–	–	Regression	Ridge regression	R2
FLAB (Binding 2048)	³⁶	2048	–	–	Regression	Ridge regression	R2
FLAB (Binding 4275)	³⁷	4275	–	–	Regression	Ridge regression	R2
FLAB (Expression 4275)	³⁷	4275	–	–	Regression	Ridge regression	R2
SARS-CoV2 binding	³⁸	86929	–	–	Regression	Ridge regression	Spearman rank
TCR-Epitope-MHC tasks
TDC-Tchard	⁴⁴	522239	–	71666	Binary classification	MLP	AUROC
TCR-Epitope-HLA	¹⁷	28144	71036	2806	Binary classification	MLP	AUROC
TCR-epitope interface prediction	⁴⁶	122	–	–	Interface prediction	CNN	AUPRC

Quick links

Search