Main

Igh and Igk V(D)J recombination is orchestrated by the RAG1–RAG2 heterotetramer bound to a recombination centre (RC) formed around intronic enhancers and J segments of each locus1,12. Vs, Ds and Js are flanked by bona fide recombination signal sequences (RSSs), consisting of conserved CACAGTG heptamer, a 12-base pair (bp) or 23-bp spacer, and an AT-rich nonamer that target recombination activating gene (RAG) endonuclease activity1,12. RAG cleaves robustly only paired gene segments flanked by bona fide RSSs with, respectively, 12-bp and 23-bp spacers (12RSSs and 23RSSs), which must be properly aligned in parallel in the active sites of the two RAG1 proteins in the RAG complex13,14. Cohesin-mediated chromatin loop extrusion contributes to pairing widely separated Igh and Igk RSSs for V(D)J recombination3,14,15,16,17,18,19,20. For Igh, impeded downstream loop extrusion at the RC leads to continued extrusion of upstream chromatin through the impeded cohesin ring, allowing JH-23RSS-bound RAG to linearly scan the upstream 2.5 megabases (Mb) of chromatin for D-12RSSs and, ultimately, VH-23RSSs for assembly of complete VH(D)JH exons14,15,16,17,18,19,20. This ‘single loop-based’ linear scanning mechanism only recognizes complementary RSSs in convergent orientation15,18,19,20 (Supplementary Video 1 and Extended Data Fig. 1a). The mouse Igk has 4 Jκs with upstream-oriented 23RSSs and more than 100 Vκs with 12RSSs, which lie in three clusters of mostly downstream (deletional) oriented distal Vκs, mostly upstream (inversional) oriented middle Vκs and both downstream and upstream-oriented proximal Vκs2,3 (Fig. 1a). Vκs across the 3.1 Mb locus are robustly used by the Vκ-to-Jκ1 joining process that generates initial (primary) VκJκ1 repertoires4. Vκs in inversional orientation could not be joined by the single loop-based linear RAG-scanning mechanism used by Igh. Indeed, for primary Vκ to Jκ1 joining, cohesin-mediated loop extrusion allows a Jκ1-RC-bound RAG to linearly scan just 8 kilobases (kb) upstream to the Sis CTCF-binding elements (CBEs) of Cer/Sis diffusion platform3. Upstream Vκs are moved through a second cohesin ring impeded for downstream extrusion at the Cer CBEs just upstream of Sis3. This ‘two-loop-based’ mechanism allows Vκs to be presented to a RAG-bound Jκ1 RC by short-range diffusion, allowing proper alignment of both deletional and inversional-oriented RSSs for primary Vκ-to-Jκ1 joining3 (Supplementary Videos 2 and 3 and Extended Data Fig. 1b). To achieve diffusion-based primary joining, Igk Vs and Js evolved powerful RSSs3, compared with those of Igh that rely on linked extrusion impediments to enhance their activity3,14,16,18. Whereas secondary Vκ rearrangements to Jκs2, 4 and 5 generate a large proportion (roughly 40–60% based on single cell analysis in mouse B cells) of the Igκ repertoire4,8,9,10, the mechanism of this physiologically critical process has remained speculative3.

Fig. 1: Cer/Sis deletion converts primary rearrangements from short-range diffusion into linear scanning joining mechanism in vivo.
Fig. 1: Cer/Sis deletion converts primary rearrangements from short-range diffusion into linear scanning joining mechanism in vivo.
Full size image

a, Illustration of mouse Igk (not to scale). Relative location of distal (blue), middle (grey) and proximal Vκs (orange), CBE-based Cer and Sis, four functional Jκs, Igk enhancer (iEκ, 3′ Eκ, Ed) and 3′ RS are indicated. Deletional and inversional Vκs are flanked by 12RSSs (red triangles) and Jκs by 23RSSs (yellow triangles). b, Percentage use of individual Vκs in BM pre-B cells baiting from Jκ1. c, Percentage of pooled RAG off-target junctions in the Vκ region (left panel) and region between Cer and Jκ (Cer-to-Jκ, right panel) as measured with a Jκ1 bait. d, Percentage of inversional (red) and deletional (blue) cryptic RSS junctions within distal, middle and proximal Vκ locus regions, and Cer-to-Jκ region. e, Illustration of mouse Igk with Cer/Sis deletion (indicated by the red X). fh, Use percentage of individual Vκs (f), pooled RAG off-target junctions (g) and their quantification (h) baiting from Jκ1 in Cer/Sis-deleted mice as in bd. ik, Use percentage of individual Vκs (i), pooled RAG off-target junctions (j) and their quantification (k) baiting from Jκ5 in Cer/Sis-deleted mice as in bd. Relative Vκs-Jκ5 joining is significantly shifted to further upstream within a limited region compared with Vκs-Jκ1 joining in Cer/Sis-deleted mouse (two-sided Welch’s t-test, P = 0.0002). Bona fide and cryptic RSS junctions mapping to positive strand of mouse reference genome are in red and minus strand in blue. Data and error bars in b,d,f,h,i,k are presented as mean ± s.e.m. from three biological repeats. WT, wild type.

Source data

Secondary Vκ-to-Jκ joining occurs by RAG scanning

Primary bone marrow (BM) pre-B cells harbouring Cer/Sis deletions generate repertoires skewed towards proximal Vκs21,22. In this regard, Cer/Sis deletion may convert the primary Jκ1-based RC into using a mechanism that reflects that of secondary rearrangements. To test this notion, we used high-throughput genome-wide translocation sequencing (HTGTS)-V(D)J-seq23 to assess the VκJκ repertoire generated from bona fide Jκs in pre-B cells from wild-type mice (Fig. 1a) versus pre-B cells harbouring an ‘ectopic’ primary Igk RC resulting from Cer/Sis deletion (Fig. 1e). HTGTS-V(D)J-seq also detects linear RAG chromatin scanning activity by measuring low-level joins between bona fide Jκ RSSs and prey cryptic RSSs15,20. This assay reliably assesses scanning directionality, because direct RAG scanning overwhelmingly uses cryptic RSSs in convergent orientation with the bona fide RSS15,20. In normal pre-B cells, primary Jκ1 joins use Vκs across the locus in deletional and inversional orientation, whereas RAG scanning is largely terminated at Sis (Fig. 1a–d). In Cer/Sis-deleted pre-B cells, Jκ1 dominantly uses the very most proximal deletional-oriented Vκs, with the highest use of Vκ3-2 and a lesser extent Vκ3-7, which both have strong RSSs and associated scanning impediments (Fig. 1f and Supplementary Table 1). In these cells, RAG scanning extends through the proximal Vκs, although inversional Vκs in this region are barely used (Fig. 1f–h). To test whether Jκ1 remains dominant for primary rearrangements in the absence of Cer/Sis, we analysed Jκ5 rearrangements. Scanning from Jκ5 also primarily used proximal Vκs, but with relative use significantly shifted upstream to Vκ3-7, Vκ3-10, Vκ3-12 and Vκ3-17, along with, at lower levels, several inversional-oriented Vκs (Fig. 1i). Correspondingly, initiation of RAG scanning from Jκ5 also shifted upstream in the proximal Vκs and into the middle Vκs (Fig. 1j,k). The upstream scanning pattern shift for Jκ5 indicates that Jκ5 scanning initiates from proximal Vκ/Jκ1-RCs and proceeds linearly upstream for a limited distance (diagrammed in Extended Data Fig. 2a–d). Thus, Cer/Sis-deleted pre-B cells still use Jκ1 for primary rearrangements, but the mechanism is converted into one-loop-based linear scanning with limited upstream Vκ use, and lower-level use of inversional-oriented Vκs, all potentially reflecting aspects of the normal secondary rearrangement mechanism (Extended Data Fig. 1c).

To further investigate the mechanistic basis of the dominant usage of Vκ3-2 and Vκ3-7 on Cer/Sis deletion, we performed HTGTS-V(D)J-seq and 3C-HTGTS16 on wild-type and Cer/Sis-deleted Abelson murine leukaemia virus-transformed pro-B cell lines (‘v-Abl cells’). HTGTS-V(D)J-seq results in RAG-sufficient wild-type and Cer/Sis-deleted v-Abl lines were very similar to those in pre-B cells (Extended Data Fig. 3a,b,e). On Cer/Sis deletion, Vκ3-2 and Vκ3-7 were dominantly used, and also prominently interacted with the RC, as revealed by 3C-HTGTS in RAG-deficient wild-type and Cer/Sis-deleted v-Abl lines (Extended Data Fig. 3c,f), probably due to enhanced proximal Vκ transcription17,24. In addition, assay for transposase-accessible chromatin using sequencing (ATAC-seq) in these RAG-deficient lines revealed chromatin accessibility of these two highly used proximal Vκs substantially increased, whereas upstream Vκs, which were not used, maintained their varying levels of accessibility (Extended Data Fig. 3d,g). Together, these results indicate both transcription-mediated increased Vκ-RSS RC interactions17 and Vκ-RSS chromatin accessibility25,26,27,28,29 may synergistically contribute to the high usage of proximal Vκs during linear RAG scanning on Cer/Sis deletion.

To assay for RAG-scanning activity in the context of both primary and secondary Vκ-to-Jκ recombination in the normal Igk locus, we performed HTGTS-V(D)J-seq on DNA from BM pre-B cells isolated from wild-type mice, baiting in separate assays from each of the four functional Jκs (Fig. 2a). Vκ usage patterns appear roughly similar for the different Jκ baits (Fig. 2b–d, compare with Fig. 1b). RAG scanning from the primary Jκ1-RC is largely terminated at Sis (Fig. 1c,d). Most scanning from, Jκ2, 4 and 5 was distributed across the upstream Vκ locus with local orientations mostly corresponding to deletional and inversional Vκ orientations (Fig. 2e–j). However, we did observe a low level of impeded scanning activity from Jκ2 to Sis (Fig. 2e,h), suggesting that it may also weakly contribute to primary rearrangements. Given the well-established convergent restriction of cryptic RSS usage during scanning15,20, these findings indicate that RAG-scanning activity from Jκ2-based, 4-based or 5-based secondary RCs mostly occurs after deletion/displacement of Cer/Sis and creation of VκJκ1-based RCs across the Vκ locus through primary Vκ-Jκ1 joining (Extended Data Fig. 4, see legend for details). Specifically, large blocks of RAG-scanning activity mapping in ‘inversional’ orientation in the reference genome of the middle inversional Vκ domain (Fig. 2e–j) reflect linear RAG scanning from a secondary RC across a portion of the Vκ locus that was inverted by a primary inversional Vκ to Jκ1 recombination event in the middle Vκ domain (Extended Data Fig. 4a and Supplementary Videos 3 and 4). Similarly, predominantly deletional cryptic RSS junctions in the entire distal deletional Vκ domain (Fig. 2e–j) must represent secondary RC-initiated RAG scanning in a largely deleted locus generated by a primary deletional Vκ to Jκ1 recombination event in the distal Vκ domain (Extended Data Fig. 4b). However, RAG-scanning mapping in both orientations in the reference genome of proximal Vκ domain (Fig. 2e–j) can be explained by linear RAG scanning from secondary RCs generated by either primary inversional or deletional Vκ-to-Jκ1 recombination events in the proximal Vκ domain (Extended Data Fig. 4c,d). These findings indicate that limited linear scanning from the numerous secondary VκJκ1-based RCs established during primary joining collectively covers the entire locus. (Extended Data Fig. 4 and legend). Together, our findings provide clear in vivo evidence that Igk secondary joining is dominantly mediated by linear RAG scanning.

Fig. 2: Different mechanisms of Igk primary and secondary recombination in vivo.
Fig. 2: Different mechanisms of Igk primary and secondary recombination in vivo.
Full size image

a, Illustration of mouse Igk, as in Fig. 1a. bd, Percentage use of individual Vκs in BM pre-B cells baiting from Jκ2 (b), Jκ4 (c) and Jκ5 (d). eg, Percentage of pooled RAG off-target junctions in the Vκ region (left panel) and region between Cer and Jκ (Cer-to-Jκ, right panel) as measured with a Jκ2 (e), Jκ4 (f) and Jκ5 (g) in BM pre-B cells. hj, Percentage of inversional (red) and deletional (blue) cryptic RSS junctions within distal, middle and proximal Vκ locus regions, and Cer-to-Jκ region as measured with a Jκ2 (h), Jκ4 (i) and Jκ5 (j) in BM pre-B cells. Junction numbers are shown in each panel in this and other figures for comparison of absolute levels. Bona fide and cryptic RSS junctions mapping to positive strand of mouse reference genome are in red and minus strand in blue. Data and error bars in bd and hj are presented as mean ± s.e.m. from three biological repeats.

Source data

Receptor editing uses a restricted Vκ repertoire

To investigate secondary Vκ to Jκ joining in vivo, we generated mouse models for this purpose. We previously described an approach to generate iPS cells from splenic B cells30. By screening such iPS cells for physiological VκJκ rearrangements, we identified a ‘parental’ iPS cell line with a productive Vκ6-23/Jκ5 rearrangement on one Igk allele and a non-productive Vκ4-57/Jκ2 rearrangement on the other, with the latter leaving germline Jκ4 and Jκ5 downstream as potential secondary recombination targets (Fig. 3a). The Vκ4-57/Jκ2 non-productive allele actually involved an inversional primary Vκ19-93 to Jκ1 join, followed by a deletional secondary Vκ4-57 to Jκ2 join, leaving numerous originally inversional-oriented middle Vκs in deletional orientation directly upstream of Vκ4-57/Jκ2 and downstream Jκ4 and Jκ5 (Fig. 3a and Extended Data Fig. 5a; see legend for details). We used CRISPR–Cas9-mediated gene editing to mutate the rearranged Vκ6-23/Jκ5 allele into a non-productive rearrangement in a separate (‘edited’) iPS cell line (Extended Data Fig. 5b). Parental and edited iPS cell lines were used for RAG2-deficient blastocyst complementation31 to produce chimeric mice in which B and T lineage cells derive from parental or edited iPS cells (Extended Data Fig. 5b).

Fig. 3: Secondary Vκ-to-Jκ joining uses a restricted Vκ repertoire through linear RAG scanning.
Fig. 3: Secondary Vκ-to-Jκ joining uses a restricted Vκ repertoire through linear RAG scanning.
Full size image

a, Illustration of the Igk locus in B cells of parental iPS cell-derived chimeras (parental chimera; not to scale). One Igk allele contains only a functional Vκ6-23/Jκ5 rearrangement (pre-rearranged functional allele). The other Igk allele contains a non-functional Vκ4-57/Jκ2 rearrangement (pre-rearranged non-functional allele) and downstream germline Jκ4 and Jκ5 as potential secondary recombination targets. b,c, Absolute use level (b) and percentage (c) of individual Vκs, baiting from Jκ4 in DNA from parental chimera BM immature B cells. d, Illustration of the Igk locus in B cells of edited iPS cell-derived chimeras (edited chimera, not to scale). The functional Vκ6-23/Jκ5 rearrangement was mutated by CRISPR–Cas9 editing to be non-functional (pre-rearranged non-functional allele; see Methods for details). e,f, Absolute use level (e) and percentage (f) of individual Vκs baiting from Jκ4 in DNA from edited chimera BM immature B cells. Productive Vκs/Jκ4 joins are in blue and non-productive joins in red. The percentage of productive rearrangements in edited chimeras compared with that in parental chimeras was significantly increased (two-sided Welch’s t-test, P = 0.0011). Data and error bars are presented as mean ± s.e.m. from three biological repeats.

Source data

To assay for potential secondary rearrangements on the Vκ4-57/Jκ2 allele, we isolated BM immature B cells from parental and edited chimeras and performed HTGTS-V(D)J-seq baiting from Jκ4 (Extended Data Fig. 5b). Parental chimeras have a smaller pre-B cell compartment and a larger immature B cell compartment relative to that of edited chimeras, indicating that the pre-existing productive Igk rearrangement signals rapid transition from pre-B cells to the next developmental stage9,10,32 (Extended Data Fig. 5c,d and Supplementary Fig. 1). Even so, we detected very low-level secondary rearrangements in the Vκ4-57/Jκ2 allele in parental chimeras (Fig. 3b). By contrast, chimeras derived from the edited iPS cell line had an 18-fold higher secondary rearrangements level of the Vκ4-57/Jκ2 allele (Fig. 3d,e). Secondary rearrangements in both chimeras used (originally inversional-oriented) deletional-oriented Vκs that were proximal to the secondary RC (Fig. 3c,f), consistent with their use by linear RAG scanning. Few cryptic RSS junctions were recovered from either parental or edited chimeras, probably due to selection for productive rearrangements in vivo, especially with only one allele for rearrangement. Consistent with this notion, compared with parental chimeras with a pre-existing functional allele, edited chimeras were highly enriched for productive Vκs/Jκ4 rearrangements (Fig. 3c,f). These data indicate that Igk secondary rearrangements in vivo occur by means of linear RAG scanning and use a restricted repertoire of immediately upstream Vκs.

Strong RSSs dominate secondary deletional joining

The restricted Vκ repertoire used for secondary Vκ-to-Jκ joining in vivo may reflect either confinement of scanning by extrusion impediments or saturation of potential Vκ-to-Jκ rearrangements by robust use of immediately upstream Vκs during linear RAG scanning. G1-arrested v-Abl cells undergo Vκ-to-Jκ joining with Vκ usage patterns very similar to those of BM pre-B cells3,20, providing a genetically tractable system for in depth studies of the secondary Igk V(D)J recombination mechanism. To generate Igk pre-rearranged v-Abl lines poised for secondary rearrangements, we transiently expressed RAG2 in a RAG2-deficient, single Igk allele-containing v-Abl line3 in which we mutated the downstream 3′ RS to minimize potential extra-chromosomal rearrangement events (Extended Data Fig. 6a, see legend for details). The downstream 3′ RS has been reported to efficiently join to Vκs and iRS (intronic recombining sequence between Jκ5 and Cκ), which substantially contributes to Igk secondary rearrangements and is particularly critical for κ/λ light chain isotype exclusion4,10,33,34. From this modified v-Abl line, we isolated a pre-rearranged v-Abl line that had joined a middle inversional-oriented Vκ5-37 to Jκ1, which inverted the intervening Cer/Sis-containing sequences and placed nine originally inversional Vκs normally directly downstream of Vκ5-37 in deletional orientation immediately upstream of the secondary Vκ5-37/Jκ1-RC (Fig. 4a and Extended Data Fig. 6a). HTGTS-V(D)J-seq from a Jκ2-bait in this line revealed robust use of three of the five now-deletional orientation Vκs within the roughly 100 kb region upstream of the Vκ5-37/Jκ1-RC (Fig. 4b(ii)). RAG-scanning activity was detected predominantly across this same 100 kb region upstream of the secondary Vκ5-37/Jκ1-RC (Extended Data Fig. 6b,c). These findings confirm linear RAG scanning as the secondary Vκ-to-Jκ joining mechanism in this v-Abl line, again with a pattern restricted to immediately upstream Vκs (Fig. 3).

Fig. 4: Vκ transcription and strong Vκ RSSs support robust secondary deletional Vκ-to-Jκ joining.
Fig. 4: Vκ transcription and strong Vκ RSSs support robust secondary deletional Vκ-to-Jκ joining.
Full size image

a, (i) Illustration of pre-rearranged Vκ5-37/Jκ1 v-Abl line with a primary inversional Vκ5-37/Jκ1 rearrangement that inverted the interval containing Cer/Sis and Vκs, placing a stretch of originally inversional-oriented middle Vκs in deletional orientation directly upstream Jκ2,4,5. (ii) Blow-up of indicated region. (iii) The Vκ18-36 promoter was deleted by CRISPR–Cas9 editing. (iv) The promoter-deleted Vκ18-36-12RSS was replaced with a strong Vκ11-125 12RSS. b, (i–iv) Absolute level of Vκs-to-Jκ2 rearrangements in parental pre-rearranged Vκ5-37/Jκ1 line (ii), Vκ18-36 promoter deletion line (iii) and Vκ18-36 promoter deletion line with Vκ11-125 12RSS replacement (iv). Inversional Vκ-to-Jκ2 joins indicated in red and deletional joins in blue. Absolute use of Vκ18-36 is significantly decreased on its promoter deletion (two-sided Welch’s t-test, P = 2.87 × 10−6). c, (ii–iv) 3C-HTGTS profiles of indicated Vκ locus, baiting from iEκ, in DNA from RAG-deficient pre-rearranged Vκ5-37/Jκ1 line (ii), the Vκ18-36 promoter deletion line (iii) and Vκ18-36 promoter deletion line with Vκ11-125 12RSS replacement (iv). T represents transcription-based interaction peak, and C* represents the CBE-based interaction peak more than 500 bp away from the Vκ7-33 RSS16. Relative Vκ18-36-to-RC interaction is significantly decreased on its promoter deletion (two-sided Welch’s t-test, P = 0.0093). d, (ii, iii) GRO-seq nascent transcript profiles in the indicated Vκ locus from the RAG-deficient pre-rearranged Vκ5-37/Jκ1 line (ii) or the Vκ18-36 promoter deletion line (iii). Positive strand transcripts are in red and negative strand transcripts in blue. In bd, the five most proximal Vκs are highlighted (grey shading). Data and error bars in bd are presented as mean + s.e.m. from three biological repeats. chr., chromosome.

Source data

To elucidate the mechanisms that target high use of immediately upstream Vκs during secondary RC-initiated RAG scanning, we used the high-resolution 3C-HTGTS chromatin interaction assay16 to examine interactions of secondary Vκ5-37/Jκ1-RC with sequences across the upstream Igk locus. This study revealed a reproducible interaction peak of the secondary Vκ5-37/Jκ1-RC with the inverted Cer/Sis element 750 kb upstream at the distal end of the inverted Vκ domain; but, by far, the highest peaks of interaction were with two of the five most proximal Vκs (Vκ18-36 and Vκ8-34) (Fig. 4c(ii)). These two Vκs, along with the fifth most proximal Vκ (Vκ6-32), which had a lower, but still substantial, interaction (Fig. 4c(ii)), were by far the most highly used Vκs in this line (Fig. 4b(ii)). Notably, all three of these highly used Vκs were substantially transcribed (Fig. 4d(ii)). The two interspersed Vκs that were barely used were not transcribed (Fig. 4d(ii)) and did not have a closely associated RC interaction peak (Fig. 4c; see legend for details). Together these observations suggested that Vκ transcription may act as a physiological linear scanning impediment for Igk secondary recombination. To directly test this notion, we deleted the promoter of the highly used Vκ18-36 in the Vκ5-37/Jκ1 pre-rearranged v-Abl line, which markedly reduced Vκ18-36 rearrangement (Fig. 4b(iii)), as well as its transcription and interaction with the secondary RC (Fig. 4c,d(iii)). Notably, the Vκ18-36 promoter deletion also substantially reduced use of a closely linked cryptic RSS hotspot (Extended Data Fig. 6b,c). Vκ18-36 has a relatively weak Igk RSS based on recombination information content score35,36,37 (Supplementary Table 1) and, correspondingly, is barely used in primary diffusional joining3; yet, it was used robustly with the aid of the transcription-based impediment during linear RAG scanning in the secondary rearrangement process (Fig. 4b–d). These findings indicate that use of Vκs with weaker RSSs may be directly enhanced by associated transcriptional impediments that increase their association with the secondary RC and mediate RSS chromatin accessibility during the linear RAG-scanning-based Igk secondary recombination.

We further speculated that strong Vκ-RSSs may not require scanning impediments to function during loop extrusion-mediated RAG scanning. To test this notion, we replaced Vκ18-36 RSS with a strong RSS from Vκ11-125 (Supplementary Table 1) in the Vκ18-36 promoter-deleted line (Fig. 4a). Indeed, this strong Vκ RSS alone, in the absence of interaction with secondary RC, supported robust secondary deletional rearrangement through linear RAG scanning (Fig. 4b,c(iv)), indicating strong Igk RSSs promote RC-bound RAG-mediated secondary rearrangements during linear scanning. Of note, secondary Vκ-to-Jκ rearrangements during direct RAG scanning are probably saturated, as observed in other experiments described above, as secondary RC-based interactions, done in the absence of RAG, extend much farther than RAG scanning in the Vκ5-37/Jκ1 pre-rearranged v-Abl line (Fig. 4b,c and Extended Data Fig. 6b). We conclude that transcription-based scanning impediments and strong Vκ-RSSs cooperate to promote robust use of immediately upstream Vκs and, thereby, restrict available Vκ repertoire during linear RAG-scanning-based secondary Igk recombination. Although we cannot rule out a modest role for Vκ locus CBEs, we note that Vκ locus CBEs are less dense and less potent than VH locus CBEs3, particularly those of proximal VHs that have a direct role in linear scanning16 (Extended Data Fig. 1 legend).

Occurrence of secondary inversional Vκ-to-Jκ joining

Although most physiological primary deletional or inversional Vκ to Jκ1 joining events leave a stretch of Vκs in deletional orientation upstream of the secondary RC (Extended Data Fig. 4a,b), a subset of primary rearrangements place inversional Vκs immediately upstream of the secondary RC (Extended Data Fig. 4c,d). To test whether inversional secondary Vκ-to-Jκ joining occurs, as suggested from the Cer/Sis deletion studies (Fig. 1), we isolated a v-Abl line in which Jκ1 was rearranged to a proximal, deletional-oriented Vκ3-12, placing a stretch of inversional Vκs interspersed with 4 deletional Vκs upstream of the secondary RC (Fig. 5a and Extended Data Fig. 7a). We further deleted Jκ3, 4 and 5 and downstream 3′ RS from this line to abrogate confounding extra-chromosomal rearrangements (Extended Data Fig. 7a). In this pre-rearranged Vκ3-12/Jκ1 line, the deletional-oriented Vκ6-17 is the most robustly rearranged; but, at least three inversional Vκs, including Vκ8-24 about 300 kb upstream, are significantly used (Fig. 5b(ii)). HTGTS-V(D)J-seq analyses further revealed that linear RAG scanning also proceeds about 300 kb into the upstream Vκ region (Fig. 5c(ii)). In a second experiment, targeted inversion of immediately upstream deletional Vκs in a Vκ1-117/Jκ1 pre-rearranged v-Abl line also led to their relatively robust inversional secondary Vκ-to-Jκ joining during linear RAG scanning (Extended Data Fig. 7b–d). Together, these findings clearly demonstrate that, unlike Igh, in which inversional VH-to-DJH joining rarely occurs3,19,20, some Vκs undergo substantial inversional Vκ-to-Jκ rearrangements during linear RAG-scanning-mediated secondary Vκ to Jκ rearrangements.

Fig. 5: Igk secondary recombination supports inversional Vκ-to-Jκ rearrangements.
Fig. 5: Igk secondary recombination supports inversional Vκ-to-Jκ rearrangements.
Full size image

a, (i) Illustration of mouse Igk. (ii) Illustration of Igk pre-rearranged Vκ3-12/Jκ1 v-Abl cell line with a primary deletional Vκ3-12/Jκ1 rearrangement that deleted the region containing Cer/Sis and a portion of proximal Vκ domain, placing inversional and deletional Vκs upstream of the secondary Vκ3-12/Jκ1 RC. Jκ3-5 were deleted to eliminate extra-chromosomal joining events. (iii) The inversional Vκ6-13 12RSS was replaced with a strong Vκ11-125 12RSS to generate the Vκ11-125 RSS replacement line. (iv) The Jκ2 and its 12RSS were inverted in the pre-rearranged Vκ3-12/Jκ1 line to generate the Jκ2 inversion line. b, (i) Deletional (blue) and inversional (red) Vκs in indicated region are annotated. (ii–iv) Absolute use of individual Vκs, baiting from Jκ2 in DNA from parental pre-rearranged Vκ3-12/Jκ1 line (ii), Vκ11-125 RSS replacement line (iii) and Jκ2 inversion line (iv). Inversional Vκ-to-Jκ2 joins indicated in red and deletional joins in blue. c, (ii, iv) Absolute level of pooled RAG off-target junctions in indicated Vκ region (left panel) and RC region (right panel) in pre-rearranged Vκ3-12/Jκ1 line (ii) or Jκ2 inversion line (iv). d, (ii, iv) 3C-HTGTS profiles in indicated Vκ region (left panel) and RC region (right panel), baiting from iEκ in DNA from pre-rearranged Vκ3-12/Jκ1 line (ii) or Jκ2 inversion line (iv). Five highly used Vκs in bd are highlighted (grey shading), and one highly used cryptic RSS hotspot in c and d is highlighted (orange shadow). Data and error bars in b,d presented as mean + s.e.m. from three biological repeats.

Source data

To test the role of RSS strength in inversional secondary Vκ-to-Jκ rearrangements, we replaced the weak 12RSS of Vκ6-13 (Supplementary Table 1), an inversional Vκ that is roughly 60 kb upstream of secondary RC in the pre-rearranged Vκ3-12/Jκ1 v-Abl line, with strong Vκ-12RSSs or weak DH-12RSSs (Extended Data Fig. 8a and Supplementary Table 1). The strong Vκ12-44, Vκ11-125 and Vκ19-93 12RSSs all activated robust inversional Vκ6-13-Jκ2 joining during linear RAG scanning (Fig. 5b(iii) and Extended Data Fig. 8b–d); whereas the weak DFL16.1 and DQ52-upstream RSSs did not (Extended Data Fig. 8e,f). To further elucidate mechanisms promoting inverted Vκ use during linear RAG scanning, we inverted the Jκ2 and its associated RSS in the pre-rearranged Vκ3-12/Jκ1 line to generate the pre-rearranged Vκ3-12/Jκ1/Jκ2-inv line (Fig. 5a). As anticipated, the inverted Jκ2-RSS now directs RAG scanning downstream through the iEκ and Ed to the 3′ RS and 3′ CBE (Fig. 5c(iv) and Extended Data Fig. 9a). On Jκ2 inversion, deletional-oriented Vκ6-17 joined to the inverted Jκ2 through inversion at roughly 50% of the original level, whereas inversional Vκ8-19, Vκ8-21 and Vκ8-24, underwent pseudo-normal deletional joining38,39 to the inverted Jκ2 (Fig. 5b(iv) and Extended Data Fig. 9b, see legend for details). In this regard, pseudo-normal deletional Vκ-to-Jκ joining occurs robustly in Igk primary rearrangements in the single Jκ5-inverted v-Abl line by Cer/Sis-based short-range diffusion3 (Extended Data Fig. 9c). Given lack of detectable RAG scanning upstream of the RC in the pre-rearranged Vκ3-12/Jκ1/Jκ2-inv line (Fig. 5c(iv)), these findings indicate that both inversional and pseudo-normal deletional Vκ-to-Jκ rearrangements are probably mediated through loop-extrusion-based short-range diffusion to properly align their paired 12/23 strong RSSs in the secondary RC-bound RAG complex. Consistent with this notion, the secondary RC directly interacts with upstream highly used deletional and inversional Vκs regardless of RC-Jκ orientation (Fig. 5d).

Discussion

Igk uses a short-range diffusion-based primary Vκ-to-Jκ1 recombination mechanism to generate a diverse primary VκJκ1 repertoire that uses deletional and inversional-oriented Vκs across the 3.1-Mb-long Vκ locus3. We now report that secondary Vκ-to-Jκ2, 4 and 5 rearrangements use an Igh-like linear RAG-scanning mechanism (Extended Data Fig. 1c and Supplementary Video 4). Moreover, we find that linear scanning-based secondary rearrangements mostly use a restricted repertoire of immediately upstream blocks of (normally) deletional or inversional-oriented Vκs, with the latter placed in deletional orientation by primary rearrangements (Extended Data Fig. 1c and Supplementary Video 4). Both Vκ transcription-based scanning impediments and, particularly, strong Vκ RSSs enforce such restricted Vκ use by promoting rapid saturation of linear scanning-based secondary rearrangements. Yet, we find that the developing pre-B cell population uses essentially the entire Vκ repertoire for secondary rearrangements, achieving this feat because primary VκJκ1 rearrangements generate secondary RCs across the Vκ locus. Igk secondary rearrangements mediate the widely studied receptor editing process6,7,8,9,10, studies of which were mainly based on mouse models with knocked-in VκJκ rearrangements encoding autoreactive variable regions downstream of Cer/Sis11,40. In such models, a non-physiological ‘primary-like’ RC is created due to presence of Cer/Sis, unknown at the time these models were generated, which would promote sampling of the entire upstream Vκ repertoire through a short-range diffusion, primary joining-like mechanism3. B cell-derived iPS cell lines could, in principle, be used for in depth studies of receptor editing when armed with primary VκJκ exons encoding autoreactive or non-pairing Igκ chains and equipped with related or unrelated upstream Vκs. In this regard, Vκs with high sequence similarity are semi-clustered across the Vκ locus41, which theoretically could promote fine-tuning of the Igκ repertoire by replacing autoreactive Vκs with highly related Vκs harbouring de novo generated CDR3 antigen-binding regions, which might dampen autoreactivity while promoting pairing with the existing IgH chain. In contrast to the Igh, which strictly uses deletional-oriented VHs3,19,20, some inversional Vκs are used during linear scanning-mediated secondary Vκ-to-Jκ rearrangement. This finding implies that the RAG-bound secondary RC has elements that promote diffusion-based synapsis of inversional Vκs with Jκs during linear RAG scanning for joining mediated by their strong RSSs (Extended Data Fig. 1d). Finally, our findings implicate deletion and/or displacement of the Cer/Sis diffusion platform by primary rearrangements as a developmental switch that transforms Igk from using the two-loop-domain diffusion-based mechanism into using a one-loop-domain linear scanning-based mechanism (Extended Data Fig. 1 and Supplementary Videos 24).

Methods

Experimental procedures

We did not use statistical methods to predetermine sample size. We also did not randomize experiments. During experiments and outcome assessments, investigators were not blinded to allocation.

Mice

We purchased wild-type 129SV mice from Taconic Biosciences. We have previously described the generation and characterization of 129SV mice harbouring the Cer/Sis deletion22. Mouse work was performed in compliance with the relevant ethical regulations established by the Institutional Animal Care and Use Committee of Boston Children’s Hospital. The mouse work was performed under protocols approved by the Boston Children’s Hospital Institutional Animal Care and Use Committee. Mice were maintained on a 14-h light/10-h dark schedule in a temperature (22 ± 3 °C) and humidity (35% to roughly 70% ± 5%) controlled environment, with food and water provided ad libitum. Both male and female 4–8-week-old mice were used for experiments. Mouse experiments are not randomized and not blinded to investigators.

Generation of chimeras from splenic B cell-derived iPS cells

Our laboratory previously generated multiple iPS cell lines reprogramed from splenic B cells, which could contribute to high-grade chimerism in mice30. By screening these iPS cell lines using HTGTS-V(D)J-seq assays with baits for each of the functional mouse Jκ coding ends, we determined their immunoglobulin light chain rearrangements. We selected an iPS cell line harbouring, respectively, a productive Vκ6-23/Jκ5 rearrangement on one allele and a non-productive Vκ4-57/Jκ2 rearrangement that left Jκ4 and Jκ5 intact for potential Igk secondary recombination on the other allele. We denoted this line as ‘parental iPS cell’. The sequences of the Vκ6-23/Jκ5 and Vκ4-57/Jκ2 rearrangements were presented (Supplementary Table 2). We further used HTGTS-V(D)J-seq assay with a Jκ1 RSS-end bait to determine the retained primary Vκ19-93/Jκ1 RSS junction in the Vκ4-57/Jκ2 allele of the parental iPS cell (Supplementary Table 2). To enhance Igk secondary rearrangements, we further mutated the productive Vκ6-23/Jκ5 rearrangements by CRISPR–Cas9 editing in the parental iPS cell line using single-guide RNA 1 (sgRNA1) targeting the Vκ6-23/Jκ5 segment (edited iPS cell, Supplementary Table 3). The parental and edited iPS cell lines, which tested negative for mycoplasma, were injected, respectively, into RAG-deficient blastocysts to generate chimeric mice by means of RAG2-deficient blastocyst complementation as described in ref. 31.

Purification of BM precursor B cells and immature B cells

BM pre-B cells from wild-type or Cer/Sis-deleted 129SV mice were isolated as previously described in ref. 20. Briefly, single cell suspensions were derived from BM of 4–8-week-old wild-type or Cer/Sis-deleted 129SV mice. Then, the suspended cells were staining with anti-B220-APC (eBioscience, no. 17-0452-83, 1:1,000 dilution), anti-CD43-PE (BD Biosciences, 553271, 1:400 dilution) and anti-IgM-FITC (eBioscience, no. 11-5790-81, 1:500 dilution). Finally, B220+CD43lowIgM+ population was used to determine BM precursor B cells by means of fluorescence-activated cell sorting. As our iPS cell lines contain mixed genetic background of C57BL/6 and 129SV, we used a modified method for our iPS cell-derived chimeras that distinguishes BM progenitor B cells and precursor B cells in C57BL/6 mice19. Briefly, single cell suspensions were derived from BM of 4–8-week-old parental and edited chimeras. Then, the suspended cells were staining with anti-B220-BV711 (BioLegend, 103255, 1:300 dilution), anti-CD25-PE (BD Pharmingen, 561065, 1:300 dilution), anti-IgG1-FITC (BD Biosciences, 553443, 1:500 dilution), anti-IgM-APC (Invitrogen, 17-5790-82, 1:500 dilution), anti-CD19-BV421 (BD Biosciences, 562701, 1:300 dilution) and anti-c-Kit-PE/Cy7 (eBioscience, 25-1171-81, 1:300 dilution). Finally, B220+CD19+IgMIgG1CD25+ KIT and B220+ CD19+ IgM IgG1+ populations were used to determine BM precursor B cells and immature B cells, respectively, through fluorescence-activated cell sorting. For DNA extraction and HTGTS-V(D)J-seq assays, we could not get sufficient numbers of BM precursor B cells from the parental chimeras, due to the pre-existing functional Vκ6-23/Jκ5 rearrangement pushing cells through this developmental stage. Therefore, we used BM immature B cells from both the parental and edited chimeras for DNA extraction and HTGTS-V(D)J-seq assays using a Jκ4 coding bait.

Generation of Jκ1 pre-rearranged v-Abl cell lines and derivatives

To generate various Jκ1-pre-rearranged v-Abl cell lines, the previously established Rag2−/−; Eμ-Bcl2+ v-Abl cell line with single Igk allele was used as the initial v-Abl cell line3. We further mutated the downstream 3′ RS by CRISPR–Cas9 editing to minimize potential extra-chromosomal rearrangement events. RAG2 open reading frame-containing plasmids were transiently transfected into v-Abl cell lines by nucleofection. Subsequently STI-571 (Selleck, S2475) was added to the culture medium for 1 day to induce G1-arrest and transient RAG2 expression in the v-Abl cell line. STI-571 was removed by changing culture medium and the v-Abl cells were cultured for 2 more days before plating single clones in the 96-well plates at a density of roughly 0.3 multiplicity of infection. After culturing for 7 more days, half of cells from each single clone were used for PCR screening that used specific primers for, respectively, germline Jκ1, Jκ2, Jκ4 and Jκ5. Finally, pre-rearranged Vκ1-117/Jκ1, Vκ5-37/Jκ1 and Vκ3-12/Jκ1 v-Abl cell lines in which, respectively, Jκ1 was joined to the distal deletional Vκ1-117, the middle inversional Vκ5-37 and the proximal deletional Vκ3-12, were selected and used for subsequent experiments. The sequences of retained Vκ1-117/Jκ1, Vκ5-37/Jκ1 and Vκ3-12/Jκ1 coding junctions in these lines were presented (Supplementary Table 2).

To generate various derivatives of Jκ1-pre-rearranged v-Abl cell lines, plasmids that expressing Cas9 and sgRNAs were transfected into the v-Abl cells by nucleofection to introduce targeted genetic modifications as previously described3. Specifically, sgRNA2 and sgRNA3 that, respectively, target upstream and downstream of Vκ18-36 promoter were transfected into the pre-rearranged Vκ5-37/Jκ1 v-Abl cell line to generate its Vκ18-36 promoter-deleted derivative; sgRNA4 and sgRNA5 that, respectively, target upstream of Vκ9-129 and downstream of Vκ1-118 were transfected into the pre-rearranged Vκ1-117/Jκ1 v-Abl cell line to generate a Vκs-inverted derivative; sgRNA6 and sgRNA7 that, respectively, target upstream of Jκ3 and downstream of Jκ5 were transfected into the pre-rearranged Vκ3-12/Jκ1 v-Abl cell line to delete the DNA segment from upstream of Jκ3 through downstream of Jκ5; sgRNA8 and sgRNA9 that, respectively, target upstream and downstream of Jκ2 were transfected into the pre-rearranged Vκ3-12/Jκ1 v-Abl cell line to generate its Jκ2-inverted derivative. All candidate clones with desired genetic modifications were confirmed by Sanger sequencing and presented in Supplementary Table 2. Sequences of all sgRNAs mentioned in this section are listed in Supplementary Table 3. None of the v-Abl cell lines was tested for mycoplasma contamination.

Generation of Cer/Sis-deleted v-Abl cell lines

To generate Cer/Sis-deleted v-Abl cell lines, plasmids that expressing Cas9 and sgRNAs were transfected into the Rag2−/−; Eμ-Bcl2+ v-Abl cell line by nucleofection to introduce targeted genetic modification as previously described in refs. 3,18. Specifically, sgRNA12 and sgRNA13 that, respectively, target upstream of Cer element and downstream of Sis element were transfected into the Rag2−/−;Eμ-Bcl2+ v-Abl cell line to delete Cer/Sis elements. The junctional sequence confirming the Cer/Sis-deletion in this line is presented in Supplementary Table 2. Sequences of all sgRNAs mentioned in this section are listed in Supplementary Table 3.

RSS replacement experiments

RSS replacement experiments were performed using plasmids expressing Cas9 and sgRNA, and short single-stranded DNA oligonucleotides (ssODN) as donor templates for homology-directed DNA repair as previously described3. Briefly, plasmids expressing Cas9 and sgRNA, and ssODN were cotransfected into the Jκ1-pre-rearranged v-Abl cell lines. Specifically, sgRNA10 and ssODN1 were used to replace the Vκ18-36 12RSS with a Vκ11-125 12RSS in the pre-rearranged Vκ5-37/Jκ1 v-Abl cell line; sgRNA11 and ssODN2–6, respectively, were used to replace the Vκ6-13 12RSS with the 12RSSs from Vκ11-125, DFL16.1-upstream, DQ52-upstream, Vκ12-44 and Vκ19-93 in the pre-rearranged Vκ3-12/Jκ1 v-Abl cell line. All candidate clones with desired genetic modifications were confirmed by Sanger sequencing (Supplementary Table 2). Sequences of all sgRNAs and ssODNs mentioned in this section are listed in Supplementary Table 3.

ATAC-seq and data analyses

ATAC-seq libraries were prepared using the Zymo-Seq ATAC Library Kit (D5458) according to the manual protocol. Briefly, 50,000 living cells from G1-arrest wild-type and Cer/Sis-deleted RAG2-deficient v-Abl cells were used for cell lysis and nuclei isolation. The prepared nuclei pellet was incubated with transposition mix containing Tn5 transposase for DNA tagmentation and sequencing adaptor ligation. Finally, ATAC-seq libraries were prepared from adaptor-ligated DNA by PCR amplification and sequenced on Illumina NextSeq2000 using a paired-end 100-bp sequencing kit. ATAC-seq libraries were processed with a previously described pipeline42.

HTGTS-V(D)J-seq and data analyses

HTGTS-V(D)J-seq libraries were prepared as described3. Briefly, 2 μg of DNA from sorted BM pre-B cells from wild-type and Cer/Sis-deleted mice or BM immature B cells of iPS cell-derived chimeras were used for HTGTS-V(D)J-seq library construction using indicated bait primers listed in figures and figure legends. For RAG2-deficient Jκ1-pre-rearranged v-Abl cell lines, RAG2 was reconstituted by means of retroviral infection with the pMSCV-FLAG-RAG2-GFP vector. After adding signal transduction inhibitor to arrest v-Abl cells in the G1 stage for 4 days, v-Abl cells were collected for DNA purification. Then 10 μg of DNA from G1-arrested RAG-complemented v-Abl cells was used for preparing HTGTS-V(D)J-seq libraries using indicated bait primers. The final libraries were sequenced on Illumina NextSeq550 or NextSeq2000 using a paired-end 150-bp sequencing kit. HTGTS-V(D)J-seq libraries were processed with a previously described pipeline3. All HTGTS libraries were normalized to 500,000 total reads for comparison. Sequences of bait primers used are listed in Supplementary Table 3.

We developed a mouse Igk-specific cryptic RSS usage analysis pipeline. First, cryptic RSS junctions were extracted from HTGTS-V(D)J-seq data using a described pipeline that removes annotated and unannotated bona fide RSSs in the Igk locus3. Second, for each junction, we annotated deletion length from the RSS and insertion length excluding P-elements of bona fide non-productive Vκ to Jκ rearrangements (to avoid influences of selection) in mouse BM pre-B cells and v-Abl cell lines. This analysis revealed essentially no bona fide RSS-associated junctions with greater than 5 bp deletions in Jκ coding bait sequences or greater than 7 bp deletions in Vκ coding prey sequences, and only 1–2% with greater than 2 bp insertions between Vκ-to-Jκ junctions. Accordingly, we filtered out cryptic RSS junctions containing deletions greater than 5 bp in bait, greater than 7 bp deletions in prey, or greater than 2 bp junctional insertions. Third, to exclude misaligned reads, we filtered out cryptic RSS junctions containing more than 2 mutations within 20 bp of the cryptic RSS-associated surrogate coding prey sequences. The programs for this mouse Igk-specific cryptic RSS usage analysis along with annotated data on which it is based have been added to the GitHub repository (https://github.com/Yyx2626/HTGTS_related/tree/main/Igk_specific_anno_and_filter). We note that this pipeline should not be used for off-target analysis of cryptic RSS junctions in human pre-B cells because, unlike mouse pre-B cells, human pre-B cells express terminal deoxynucleotidyl transferase and add N-regions22,38.

3C-HTGTS and data analyses

3C-HTGTS libraries using an iEκ bait were prepared from the G1-arrested RAG2-deficient v-Abl cells as described in ref. 3. Final 3C-HTGTS libraries were sequenced on Illumina NextSeq550 or NextSeq2000 using a paired-end 150-bp sequencing kit, and were processed with a previously described pipeline3 (for details, see ‘Code availability’). All 3C-HTGTS libraries were normalized to 65,488 total junctions, which represents the smallest recovered number of junctions among the 3C-HTGTS libraries that were compared. Primer sequences used for generating 3C-HTGTS libraries are listed in Supplementary Table 3.

GRO-seq and data analyses

Global run-on sequencing (GRO-seq) libraries were prepared from the G1-arrested, RAG2-deficient v-Abl cell lines as described in ref. 20, sequenced using Illumina NextSeq550 or NextSeq2000 using paired-end 150-bp sequencing kits and processed with a previously described pipeline in ref. 20.

Quantification, statistics and reproducibility

All HTGTS-V(D)J-seq, 3C-HTGTS, ATAC-seq and GRO-seq experiments have been repeated three times independently with similar results. NextSeq550 control software (v.2.2.0) and NextSeq 1000/2000 control software (v.1.5.0.42699) were used for high-throughput sequencing data collection. Graphs were generated using GraphPad Prism 9, IGV (v.2.22.1) and R version 3.6.1. FlowJo (v.9.3.2) was used for analysing the fluorescence-activated cell sorting data. After normalization, HTGTS-V(D)J-seq, 3C-HTGTS, ATAC-seq and GRO-seq signals of three repeats were merged as average ± s.e.m. of the maximum value. An unpaired, two-sided Welch’s t-test was used to determine the statistical significance of difference between indicated samples, with P values presented in relevant figure legends.

Availability of materials

All plasmids, cell lines and mouse lines used in this study are available from the authors upon request.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.