Figure 2

A 40% representative sequence similarity network (SSN) at a threshold of e−57. (A) Nodes within each cluster contain a representative protein sequence of a collection of sequences that share 40% or more sequence identity. Clusters have been annotated and color-coded based on curated protein sequences from the AKR Database21. Grey nodes represent AKRs that have no functional annotation. Metabolic pathways associated with the function of AKR enzymes have been color-coded to cross-reference with each cluster and numerically labeled to designate each AKR family number. The nodes of AKRs possessing biochemical data on coenzyme specificity and crystal structures have also been accordingly labeled as outlined in the legend. (B) SSN generated at a threshold of e−67 depicting complete segregation of the AKR18 family from AKR6, AKR12 and AKR14 members. (C) Genome Neighborhood Diagram of DepBRleg. The GND was generated using the EFI-GNT server which depicts the coding sequence region of depBRleg along with putative functions of upstream and downstream genes.