Figure 6 | Scientific Reports

Figure 6

From: Patterns of diverse gene functions in genomic neighborhoods predict gene function and phenotype

Figure 6

Overview of the neighborhood function profile (NFP) methodology to predict gene function. Location-based approaches are trained on pairwise COG/NOG distances of corresponding genes contained within genome of different prokaryotic and eukaryotic organisms. The obtained distances are used to create a similarity table to train the k-NN model and the association network to train the Gaussian Field Label Propagation approach. Functional neighbourhoods are used to create a normalized frequency matrix which is used to train the Random Forest of Predictive Clustering trees model. “COG” in the Figure is used to denote both COG and NOG. Target Hi denotes the sub-hierarchy of GO terms associated with COGi (sub-hierarchy contains information about the GO functions assigned to a COG and the parent-child relations between these GO functions).

Back to article page