Fig. 1: Graphinity architecture and synthetic dataset preparation. | Nature Computational Science

Fig. 1: Graphinity architecture and synthetic dataset preparation.

From: Investigating the volume and diversity of data needed for generalizable antibody–antigen ΔΔG prediction

Fig. 1

a, The EGNN deep learning models are trained on graphs of three-dimensional protein structure coordinates. The graphs are built from atoms in the neighborhood of the mutated site, conceptually illustrated with circles (nodes, atoms) and connecting lines (edges, interactions). The antibody is shown in purple, the antigen in blue and inter-binding partner edges in orange. Our model architecture consists of three E(n) EGC layers23, followed by a linear layer. For ΔΔG prediction, the embeddings generated from the E(n) EGC layers for the WT and mutant complex are subtracted from one another before passing through the linear layer. b, The synthetic ΔΔG datasets were generated from structurally resolved complexes from SAbDab26,27. We mutated interface residues and predicted ΔΔG values using FoldX11 and Rosetta Flex ddG12. c, An example of ΔΔG data for a complex. PDB: 1XGP56; antibody in purple, antigen in blue; affinity values from SKEMPI 2.031. Mut, mutant.

Back to article page