Fig. 2: Example of a GuideSet container for gRNAs targeting KRAS using SpCas9.

a The blue box stores the genomic coordinates in GRCh38 to represent the target protospacer sequences using a GRanges object. By convention, we use the first nucleotide of the PAM sequence (in the 5\({}^{\prime}\) to 3\({}^{\prime}\) direction) as the representative genomic coordinate of protospacer sequences. The pink box stores sequence information of the protospacers and PAMs. The yellow box represents global metadata used for creating the GuideSet, including a formal CrisprNuclease object, the reference genome of the target protospacers, and gene model used for annotation. The gray boxes are examples of optional gRNA-level metadata columns that store information about enzyme restriction sites, spacer sequence features such as GC content, and on- and off-target scores. The green boxes represent optional per-gRNA annotations for SNP overlap, on- and off-target alignments, and gene context; each annotation stores a detailed table (2) dimensions) for each gRNA (3rd dimension). b Selected annotations for gRNA 1 corresponding to the row highlighted in the green boxes of (a). c The first genomic track represents the four annotated protein-coding isoforms of human gene KRAS in GRCh38 coordinates. The second track shows the 4 gRNAs shown in the blue box of (a).