Extended Data Fig. 1: Formation of the CRISPR-Cas Atlas. | Nature

Extended Data Fig. 1: Formation of the CRISPR-Cas Atlas.

From: Design of highly functional genome editors by modelling CRISPR–Cas sequences

Extended Data Fig. 1: Formation of the CRISPR-Cas Atlas.The alternative text for this image may have been generated using AI.

a) Pipeline for discovery and annotation of 1.25 M CRISPR-Cas operons from 26.2 Tbp of genome and metagenome assemblies. b) Summary of different entities across the CRISPR-Cas atlas. c) Distribution of operon lengths across CRISPR-Cas types. d) 238,913 Cas9 proteins were identified from Type II CRISPR-Cas operons and clustered using MMseqs2. e) Comparison of the number of unique Cas9 proteins compared to previously published datasets7,53,54,55,56,57. UniProtKB was queried in March 2024 using search term: gene = Cas9. f) Length of 64,734 unique Cas9 proteins from the CRISPR-Cas Atlas. g) Summary statistics across 64,734 CRISPR-Cas operons. h) Phylogenetic tree of 8,441 Cas9s clustered at 70% identity. Phylogenetic tree built using FastTree258 and visualized using iTOL59.

Back to article page