Figure 21: Protein evolution.
From: Initial sequencing and comparative analysis of the mouse genome

a, Cumulative histogram of KA/KS values for locally duplicated, paralogous mouse-specific gene clusters (black boxes) in comparison with mouse–human orthologues (red boxes). The mouse-specific paralogues are more likely to be under positive diversifying selection. b, Box plot of KA/KS values for different locally duplicated, paralogous mouse-specific gene clusters. Full descriptions are found in Table 15. The chromosome on which the clusters are found is indicated in brackets after the abbreviated cluster name. The KA/KS values for each sequence pair in the cluster was calculated from sequences aligned using ClustalW (see Supplementary Information). The red horizontal line represents the median and the box indicates the middle 67% of the data between the 16th and 83rd percentiles. All of the paralogous clusters have median KA/KS values that are higher than the mouse–human orthologue median KA/KS (0.115), and 22 out of 25 have values greater than the 83rd percentile orthologue KA/KS (0.275). The Cyp2d category includes KA/KS values calculated separately over two sequence-similar regions in the alignment.