Fig. 2: Phylogenetic analysis of Cas12k homologs.

The identified 106 complete Cas12k proteins were aligned using M-coffee74,75 and analyzed by BEAST76. Twelve instances of degenerated Cas12k proteins were intentionally left out to avoid misinterpretation. The resulting tree is depicted with branches labeled with their respective posterior probability if ≥0.7. For better recognition, the proteins were labeled with their respective host organism (further details in Supplementary Data 1). The associated regulator type (AR) and taxonomic order (O) of the respective host organism are color-coded. The Anabaena 7120 Cas12k (All3613) (NCBI: BAB75312.1) is marked with a green dashed box. Four nodes, whose branches were all representing cas12k genes with similar associated regulator genes and host organisms with similar taxonomic order, were collapsed and labeled with the respective number of cas12k genes. The expanded phylogenetic tree is depicted in Supplementary Fig. 2 and the multiple sequence alignment serving as source data for the phylogenetic analysis is available in Supplementary Data 4.