Fig. 2: Predicting protein tertiary structure using ProFOLD. | Nature Communications

Fig. 2: Predicting protein tertiary structure using ProFOLD.

From: CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction

Fig. 2: Predicting protein tertiary structure using ProFOLD.The alternative text for this image may have been generated using AI.

Here, we use the CASP13 target protein T0992-D1 as an example to describe the main steps of ProFOLD. Only the first 13 residues are shown here for the sake of clear description. First, we search this protein against sequence databases to identify its homologous proteins (2,807 proteins in total). Next, we use the acquired homologous protein to construct an MSA for this protein. Then we apply CopulaNet to infer residue co-evolution directly from the MSA. CopulaNet uses an MSA encoder to model the mutation information for each residue of the target protein, and then uses a co-evolution aggregator to measure the residue co-mutations. According to the acquired residue co-evolution information, the distance estimator estimates inter-residue distances. Finally, we transform the estimated distance distributions into a potential function, and then search for the structure conformation with the minimal potential. ProFOLD reports the structural conformation with sufficiently low potential as the final prediction result (TMscore: 0.84).

Back to article page