Main

During thymic selection, thymocytes test their T cell receptor (TCR) on self-peptide major histocompatibility complex (pMHC) ligands presented by thymic antigen-presenting cells (APCs). ‘Weak’ TCR–pMHC interactions promote developmental progression and CD4+/CD8+ T cell lineage commitment (termed positive selection), whereas ‘strong’ signals trigger negative selection1. Although negative selection eliminates autoreactive T cells2, how positive selection shapes a ‘useful’ repertoire remains unclear3.

Traditionally, positive selection was thought to enforce self-MHC restriction. However, T cells recognizing specific pMHC ligands arise at similar frequencies regardless of the selecting MHC allele4, and the repertoire’s apparent self-MHC restriction may indirectly result from negative selection of overly MHC-reactive TCRs5. Increasing evidence also indicates that positive selection imprints T cell functionality, such as tuning of the inhibitory TCR rheostat CD5 (ref. 6). CD5 levels are thought to reflect the strength of the selecting TCR–pMHC interactions and have been linked to responsiveness to foreign antigens3,7,8, contribution to primary versus memory responses9 and differentiation potential into helper T (TH) cell subsets or regulatory T (Treg) cells10,11,12,13,14.

Positive selection relies on a single stromal cell type, cortical thymic epithelial cells (cTECs). This specialized role appears to stem from unique pathways of self-antigen handling and processing, likely generating a partially ‘private’ pMHC ligandome1,15. For MHC class I (MHCI), cTECs express ‘thymoproteasomes’ containing the β5t subunit, which is absent from other APCs. β5t deficiency profoundly affects thymic development of CD8+ T cells15. For MHC class II (MHCII), cTECs use autophagy-associated mechanisms for unconventional endogenous MHCII loading. Disruption of these pathways perturbs CD4+ T cell selection16,17,18. Mice lacking the thymus-specific serine protease PRSS16 exhibit impaired polyclonal CD4+ T cell responses and diminished positive selection of some transgenic TCRs19,20. Of the pathways implicated in shaping the cTEC pMHC class II (pMHCII) ligandome, the most profound reduction in the thymic CD4+ T cell population is caused by ablation of cathepsin L (CTSL)21. Cathepsins are a family of lysosomal proteases. CTSL is strongly expressed in cTECs but is barely present in other MHCII+ APCs, whereas cathepsin S exhibits the opposite expression pattern. Both enzymes serve dual functions in the MHCII pathway by degrading the MHCII-associated invariant chain (Ii) and processing antigens to generate peptides for MHCII loading22. Importantly, the Ii-degrading function of CTSL alone cannot explain its requirement for efficient CD4+ T cell selection23. Limited resolution and cell numbers have so far precluded a comprehensive characterization of the cTEC pMHCII ligandome beyond relatively abundant peptides24. However, analysis of pMHCII ligands in fibroblasts engineered to express CTSL or cathepsin S indicated qualitative and quantitative differences25, suggesting that CTSL-dependent ‘private’ pMHCII ligands on cTECs may be crucial for CD4+ T cell positive selection. However, the impact of CTSL on the diversity and functionality of the CD4+ T cell repertoire remains unclear.

Here, we analyzed the CD4+ T cell repertoire selected in the absence of CTSL and characterized the clonal composition and reactivity of residual CD4+ T cells specific for a prototypical foreign antigen. Using high-resolution repertoire sequencing and re-expression of selected clones in TCR transgenic mice, we found that CTSL shapes CD4+ T cell selection by promoting full repertoire diversity and fine-tuning CD4+ T cell functionality.

Results

CTSL deficiency impairs positive selection of CD4+ T cells

Full genomic deletion of Ctsl in Ctsl−/− mice has pleiotropic effects, most prominently alopecia and epithelial hyperplasia in the skin21, reflecting ‘nonimmune’ functions. To selectively delete Ctsl in TECs, we generated CtslΔTEC mice, which carry a conditional Ctsl allele and a Foxn1-cre transgene (Supplementary Fig. 1a,b). Compared to Ctsl+/+ mice, CtslΔTEC mice showed a reduction in CD4 single-positive (CD4SP) thymocytes, as described in Ctsl−/− mice21 (Fig. 1a,b and Extended Data Fig. 1a). The phenotypic segregation of CD4SP thymocytes into three consecutive maturation stages (CD69+MHCI semimature, CD69+MHCI+ mature 1 and CD69MHCI+ mature 2; SM, M1 and M2, respectively)26 was largely preserved, although there was a reduction in the most mature M2 stage (Extended Data Fig. 1b). The proportion of Foxp3+ Treg cells among CD4SP thymocytes was unchanged; however, there was a trend toward an increased proportion of CD73+CCR7 reimmigrants from the periphery (Extended Data Fig. 1c,d). Peripheral CD4+ T cell populations were diminished and contained more ‘memory-like’ Foxp3CD44+CD62L cells and Foxp3+ Treg cells (Fig. 1c and Extended Data Fig. 1e–g). CD8+ T cells developed in normal proportions, and the thymic architecture was indistinguishable from Ctsl+/+ mice (Fig. 1a,b and Supplementary Fig. 1e). CtslΔTEC mice did not show skin defects (Supplementary Fig. 1c,d).

Fig. 1: CTSL deficiency alters the cTEC pMHCII ligandome and impairs CD4+ T cell positive selection.
Fig. 1: CTSL deficiency alters the cTEC pMHCII ligandome and impairs CD4+ T cell positive selection.The alternative text for this image may have been generated using AI.
Full size image

a, Representative flow cytometry plots of thymocyte subsets in Ctsl+/+ (n = 8) and CtslΔTEC mice (n = 10); frequency ± s.e.m. of CD4SP thymocytes is indicated. b, Percentages ± s.e.m. of thymocyte subsets as in a; NS, not significant; DN, double negative. c, Representative flow plots of lymph node cells from Ctsl+/+ (n = 3) and CtslΔTEC (n = 3) mice; frequency ± s.e.m. of CD4+ T cells is indicated. d, Representative flow cytometry plots of MHCII expression on CD45EpCAM+Ly51+ cTECs or CD45EpCAM+Ly51CD80+ mature mTECs from Ctsl+/+ and CtslΔTEC mice (n = 20 each) and MHCII−/− mice (n = 3 and 2) as background. e, Representative flow plots and mean fluorescence intensity (MFI) ± s.e.m. relative to Ctsl+/+ cells of MHCII:CLIP on cTECs or mature mTECs from Ctsl+/+ (n = 5) and CtslΔTEC mice (n = 5) and MHCII−/− mice (n = 2) as background. f, Representative flow plots and MFI ± s.e.m. relative to Ctsl+/+ cells of MHCII:non-CLIP on cTECs or mature mTECs from Ctsl+/+ (n = 12) and CtslΔTEC mice (n = 13) and MHCII−/− mice as background. g, Representative flow cytometry plots and MFI ± s.e.m. relative to Ctsl+/+ cells of I-Ab:Eα52–68 on cTECs or mature mTECs from Ctsl+/+ (n = 4) and CtslΔTEC (n = 3) mice on the C57BL/6 × BALB/c F1 background and C57BL/6 (Eα) mice as background controls. h, CD4SP thymocyte percentages ± s.e.m. in Ctsl+/+ mice (n = 6 or 4) or CtslΔTEC mice (n = 5 or 4) reconstituted with BM from MHCII+/+ or MHCII−/− mice. i, CD4SP thymocyte percentages ± s.e.m. in Ctsl+/+ mice (n = 8 or 9) or CtslΔTEC mice (n = 4 or 3) on a wild-type or Ciitakd transgenic background. j, Representative flow cytometry plots of TCRβ and CD69 surface expression on DP cells from B2m−/−Ctsl+/+ (n = 9) and B2m−/−CtslΔTEC mice (n = 13); frequency ± s.e.m. of TCRβintCD69+ cells is shown (P < 0.001). k, Representative flow cytometry analysis and MFI ± s.e.m. relative to Ctsl+/+ cells of CD5 expression on CD4SP thymocytes from Ctsl+/+ and CtslΔTEC mice (n = 7 each). P values in b were determined by two-way analysis of variance (ANOVA) and Sidak’s test for multiple comparisons and in ek by Student’s two-tailed t-test.

Source data

Total MHCII on cTECs remained unchanged (Fig. 1d). However, the fraction of pMHCII ligands on CtslΔTEC cTECs that consisted of complexes with the invariant chain-derived peptide CLIP, as detected using the monoclonal antibody (mAb) 15G4, was increased, and, conversely, non-CLIP pMHCII ligands, as indicated by the mAb BP107, were reduced (Fig. 1e,f). Nevertheless, compared to H2-Ab1−/− (hereafter MHCII−/−) cTECs, non-CLIP pMHCII ligands still constituted a major fraction of the pMHCII ligandome in CtslΔTEC cTECs (Fig. 1f). For instance, cTECs from CtslΔTEC C57BL/6 × BALB/c F1 mice presented substantial amounts of the ‘frequent’ non-CLIP ligand I-Ab:Eα52–68, recognized by the mAb Y-Ae, albeit moderately less than Ctsl+/+ control mice (Fig. 1g). These effects were seen in cTECs but not medullary TECs (mTECs; Fig. 1e–g and Extended Data Fig. 1h,i), consistent with the differential expression of CTSL between cTECs and mTECs.

The diminished CD4SP thymocyte population in Ctsl−/− mice has been suggested to result, at least in part, from positive selection of an altered TCR repertoire that is hypersusceptible to negative selection23. To test whether interference with negative selection by hematopoietic APCs ‘rescued’ the CD4SP compartment, we reconstituted Ctsl+/+, CtslΔTEC or Ctsl−/− mice with H2-Ab1+/+ (hereafter MHCII+/+) or MHCII−/− bone marrow (BM). MHCII−/− → Ctsl+/+ chimeras harbored a significantly increased percentage of CD4SP thymocytes compared to MHCII+/+ → Ctsl+/+ controls (Fig. 1h), reflecting diminished negative selection27. By contrast, the CD4SP compartment of MHCII−/− → CtslΔTEC or MHCII−/− → Ctsl−/− chimeras was not, or only marginally, increased compared to the respective MHCII+/+ BM controls (Fig. 1h and Extended Data Fig. 1j). To interfere with negative selection by mTECs, we generated Ctsl+/+, CtslΔTEC and Ctsl−/− mice carrying a transgene (Ciitakd) that mediates tissue-specific knockdown of C2TA, a transcription factor that controls multiple MHCII pathway components, leading to reduced MHCII expression on mTECs28. The Ciitakd transgene resulted in a significantly increased CD4SP compartment in Ctsl+/+ mice28 but did not ‘rescue’ the CD4SP compartment in CtslΔTEC or Ctsl−/− mice (Fig. 1i and Extended Data Fig. 1k).

The proportion of TCRβ+CD69+ cells among CD4+CD8+ (double-positive; DP) thymocytes (representing signal-selection intermediates) was reported to be normal in Ctsl−/− mice23; however, TCRβ+CD69+ DP cells also include CD8+ T cell lineage selection intermediates that engage pMHCI ligands. To exclude these, we generated MHCI-deficient B2m−/−CtslΔTEC mice, which recapitulated the reduced CD4SP compartment associated with CTSL deficiency (Extended Data Fig. 1l). In these mice, where positively selecting interactions could be exclusively attributed to pMHCII ligands, the proportion of ‘signaled’ TCRβ+CD69+ DP cells was reduced to about half that of B2m−/−Ctsl+/+ controls (Fig. 1j). CD5 expression on these DP cells (Extended Data Fig. 1m) and bulk CtslΔTEC CD4SP cells was lower than on their counterparts in Ctsl+/+ mice (Fig. 1k), suggesting that positive selection occurred through relatively weak interactions or nonselection of ‘natural’ CD5hi clones. Together, these findings indicate that the contraction of the CD4+ T cell compartment in CTSL-deficient mice was not secondary to selection of an altered repertoire that was overly susceptible to negative selection but most likely reflected a bona fide numerical constraint in positive selection as a consequence of an altered cTEC pMHCII ligandome.

CTSL deficiency causes ‘clonal holes’ and ‘newcomers’

We next addressed whether the bottleneck in positive selection in CtslΔTEC mice was TCR selective, leading to the disappearance of some clones while allowing others to persist within the repertoire. Across seven transgenic MHCII-restricted TCRs with diverse antigen specificities (OT-II (chicken ovalbumin), Dep (human C-reactive protein), AND and AD10 (pigeon cytochrome c), LLO56 and LLO118 (Listeria monocytogenes listeriolysin O; LLO) and PLP1 (myelin proteolipid protein)), all exhibited a profound blockade in the emergence of CD4SP thymocytes in CtslΔTEC mice, whereas the MHCI-restricted OT-I TCR was efficiently selected (Fig. 2a,b and Extended Data Fig. 2a–e). For each MHCII-restricted TCR transgene, CD5 expression on DP thymocytes, which is upregulated concomitant with positive selection, was substantially reduced in CtslΔTEC mice (Fig. 2c and Extended Data Fig. 2b), suggesting that these TCRs did not, or did not sufficiently, interact with pMHCII ligands to elicit positive selection. When normally selected in Ctsl+/+ mice, OT-II, AND, Dep or AD10 CD4SP thymocytes each displayed distinct CD5 levels, which varied widely between these clones and spanned the entire range of CD5 expression observed in polyclonal CD4SP cells (Fig. 2d). Thus, nonselection in CtslΔTEC mice was not tied to specific CD5 characteristics and, by inference, was not confined to a particular window in the affinity range of positively selecting TCR–pMHC interactions.

Fig. 2: CTSL is essential for positive selection of multiple MHCII-restricted transgenic TCRs.
Fig. 2: CTSL is essential for positive selection of multiple MHCII-restricted transgenic TCRs.The alternative text for this image may have been generated using AI.
Full size image

a, Representative flow cytometry plots of thymocyte subsets and percentages ± s.e.m. of CD4SP cells in Ctsl+/+ and CtslΔTEC mice reconstituted with BM from Rag1−/−OT-II (Ctsl+/+, n = 9; CtslΔTEC, n = 7), Rag1−/−AND (Ctsl+/+, n = 8; CtslΔTEC, n = 6), Rag1−/−Dep (Ctsl+/+, n = 9; CtslΔTEC, n = 7) or Rag1−/−AD10 (Ctsl+/+, n = 9; CtslΔTEC, n = 8) TCR transgenic donors. b, Representative flow cytometry plots of the thymus and percentages ± s.e.m. of CD8SP cells in Ctsl+/+ and CtslΔTEC mice reconstituted with BM from OT-I TgRag1−/− donors (n = 5 each). c, Representative flow cytometry plots and MFI ± s.e.m. of CD5 expression in DP thymocytes from BM chimeras as in a, relative to cells selected in Ctsl+/+ chimeras; WT, wild-type. d, Representative flow cytometry plots and MFI ± s.e.m. of CD5 expression in CD4SP thymocytes from Rag1−/−Ctsl+/+ AD10 (n = 3), Rag1−/−Ctsl+/+ Dep (n = 5), Rag1−/−Ctsl+/+ AND (n = 4) and Rag1−/−Ctsl+/+ OT-II (n = 4) TCR transgenic mice relative to polyclonal CD4SP thymocytes (n = 3). P values in a and b were determined by Student’s two-tailed t-test and in c by Welch’s two-tailed t-test.

Source data

To globally characterize TCR repertoire perturbations caused by CTSL deficiency, we crossed CtslΔTEC mice with mice expressing a transgenic TCRβ chain (hereafter TcrbFixed), enabling high-throughput sequencing of variable TCRα chains paired with the ‘fixed’ β-chain. TcrbFixedCtslΔTEC mice had fewer CD4SP thymocytes and peripheral CD4+ T cells (Fig. 3a and Extended Data Fig. 3a–c), with reduced CD5 expression on CD4SP cells (Fig. 3b) compared to TcrbFixedCtsl+/+ mice. Tcra sequencing was performed on CD4SP cells in the most mature M2 stage26, ensuring that the repertoire was fully shaped by thymic selection. Sampling depth approached saturation for both genotypes (Fig. 3c). Repertoire diversity, as assessed using the Shannon index, was significantly lower in TcrbFixedCtslΔTEC mice than in TcrbFixedCtsl+/+ mice (Fig. 3d). Based on the Morisita–Horn index, repertoires were highly stereotypical between genotype-matched replicates but varied significantly between genotypes (Fig. 3e). Of 9,626 ‘recurrent’ TCRs (defined as TCRs found in three or more of all six samples (three TcrbFixedCtslΔTEC and three TcrbFixedCtsl+/+)), 4,765 were shared between the two genotypes (Fig. 3f). Almost half (3,848 of 8,613) of all recurrent TCRs in the TcrbFixedCtsl+/+ repertoire were entirely absent from the TcrbFixedCtslΔTEC repertoire (Fig. 3f), and these ‘CTSL-dependent’ clones disproportionately contributed to the diversity of the ‘normal’ repertoire (Fig. 3g). Conversely, about 20% (1,013 of 5,778) of clones in the TcrbFixedCtslΔTEC repertoire were not found in the ‘normal’ TcrbFixedCtsl+/+ repertoire (Fig. 3f). These ‘newcomer’ TCRs displayed a bias toward more distal TCRα variable (V) and joining (J) elements and increased nucleotide additions or deletions at the V–J joint (Extended Data Fig. 3d–f), suggesting a selection bias for unusual TCR features. Thus, the loss of TCRs in the absence of CTSL was highly selective, affecting roughly half of the ‘normal’ TCR repertoire, whereas a similarly large array of seemingly CTSL-independent TCRs was retained.

Fig. 3: CTSL deficiency results in nonselection of ~50% of TCR clonotypes and enables emergence of ‘newcomer’ TCRs.
Fig. 3: CTSL deficiency results in nonselection of ~50% of TCR clonotypes and enables emergence of ‘newcomer’ TCRs.The alternative text for this image may have been generated using AI.
Full size image

a, Representative flow cytometry of thymocyte subsets and frequency ± s.e.m. of CD4SP thymocytes in TcrbFixedCtsl+/+ (n = 6) and TcrbFixedCtslΔTEC mice (n = 7). b, Representative flow cytometry analysis and MFI ± s.e.m. of CD5 expression on CD4SP cells from TcrbFixedCtsl+/+ (n = 3) and TcrbFixedCtslΔTEC mice (n = 4), relative to TcrbFixedCtsl+/+ samples. c, Analysis of sequencing depth by simulation of Shannon diversity as a function of the number of Tcra reads in bulk TCR-sequencing datasets generated with CD4+CD8αCD69MHCI+CD25FoxP3 M2 CD4SP cells from TcrbFixedCtsl+/+ (n = 3 with cells pooled from two to three mice) and TcrbFixedCtslΔTEC mice (n = 3 with cells pooled from two to three mice). All mice were on a Tcra+/−Foxp3GFP background to exclude dual TCR expression and enable exclusion of Foxp3+ cells. d, Shannon diversity analysis (mean ± s.e.m.) of bulk TCR-sequencing datasets as in c. e, Repertoire similarity comparison by Morisita–Horn index (mean ± s.e.m.) for all pairwise comparisons between TCRα datasets from TcrbFixedCtsl+/+ and TcrbFixedCtslΔTEC mice as in c (n = 3 for TcrbFixedCtsl+/+ versus TcrbFixedCtsl+/+; n = 9 for TcrbFixedCtsl+/+ versus TcrbFixedCtslΔTEC; n = 3 for TcrbFixedCtslΔTEC versus TcrbFixedCtslΔTEC). f, Scatter plot of the mean frequency of ‘recurrent’ TCRs in the TcrbFixedCtsl+/+ versus TcrbFixedCtslΔTEC repertoire as in c. Recurrent TCRs (n = 9,626) were defined as clonotypes found in three or more of all six samples regardless of genotype. TCRs exclusively found in Ctsl+/+ samples are highlighted in blue (CTSL-dependent TCRs; n = 3,848), and TCRs exclusively found CtslΔTEC samples are highlighted in red (‘newcomer TCRs’; n = 1,013); ND, not detected. g, Shannon diversity analysis (mean ± s.e.m.) of the CTSL-dependent (blue in f) or CTSL-independent (gray in f) subrepertoires within the TcrbFixedCtsl+/+ repertoire. P values in a, b, d and g were determined by Student’s two-tailed t-test and in e by one-way ANOVA with a Tukey’s test for multiple comparisons.

Source data

CTSL deficiency blunts CD4+ T cell responses

Having identified ‘clonal holes’ with various TCR transgenes and in the TcrbFixed repertoire, we asked whether CTSL deficiency caused corresponding ‘antigenic holes’ in a fully polyclonal setting. MHCII tetramer (Tet) staining revealed a marked reduction (or near absence) of cells recognizing the epitopes 2W, human invariant chain residues 277–285 (huCLIP) and PLP11–19 in CtslΔTEC mice (Fig. 4a,b). However, numbers of LLO190–201:I-Ab-specific CD4+ T cells were comparable between CtslΔTEC and Ctsl+/+ mice, both among peripheral CD4+ T cells and thymic CD4SP cells (Fig. 4c and Extended Data Fig. 4). Given the approximately fourfold reduction in total CD4+ T cell counts in CtslΔTEC mice (Extended Data Fig. 1e), the frequency of LLO-specific cells was thus even increased by a corresponding factor. To assess the functionality of these cells, we immunized mice with LLO peptide in adjuvant. At day 7 after challenge, ~1 × 104 LLO-Tet+ T cells were detectable in the draining lymph nodes of Ctsl+/+ mice (Fig. 4d), reflecting a >20-fold expansion following antigen exposure. By contrast, despite higher precursor frequencies, LLO-Tet+ cells were approximately tenfold less abundant in CtslΔTEC mice (Fig. 4d).

Fig. 4: CTSL deficiency imparts ‘antigenic holes’ in the repertoire and blunts CD4+ T cell responsiveness to immunization.
Fig. 4: CTSL deficiency imparts ‘antigenic holes’ in the repertoire and blunts CD4+ T cell responsiveness to immunization.The alternative text for this image may have been generated using AI.
Full size image

a, Total number ± s.e.m. of 2W- or huCLIP-specific CD4+ T cells in the naive repertoire of Ctsl+/+ (n = 3 for 2W; n = 8 for huCLIP) and CtslΔTEC mice (n = 3 for 2W; n = 8 for huCLIP), quantified by flow cytometric analysis after magnetic enrichment from pooled spleen and lymph node (LN) cells using 2W:I-Ab and huCLIP:I-Ab Tets, respectively. b, Total number ± s.e.m. of PLP11–19-specific CD4+ T cells in the naive repertoire of Ctsl+/+Plp1−/− (n = 4) and CtslΔTECPlp1−/− mice (n = 4), quantified as in a using a PLP11–19:I-Ab Tet. c, Representative flow cytometry plots of LLO190–201-specific CD4+ T cells gated on CD11bCD11cB220F4/80CD4+ T cells after magnetic enrichment from pooled spleen and lymph node cells using the LLO Tet and total number ± s.e.m. of LLO-Tet+ cells quantified as in a in the naive repertoire of Ctsl+/+ (n = 16) and CtslΔTEC mice (n = 15). d, Representative flow cytometry plots gated on activated CD11bCD11cB220F4/80CD44+CD4+ T cells without magnetic enrichment and total number ± s.e.m. of LLO190–201-specific CD4+ T cells in pooled spleen and lymph node cells of Ctsl+/+ (n = 19) and CtslΔTEC mice (n = 16) at day 7 after subcutaneous peptide immunization with adjuvant. e, Representative flow cytometry plot of CD45.1 versus CD45.2 on LLO-Tet and LLO-Tet⁺ spleen cells in CD45.1/CD45.2 wild-type recipients (n = 4) of a 1:1 mixture of bulk M2 SP cells from CD45.1 Ctsl+/+ and CD45.2 CtslΔTEC donors at day 7 after i.v. challenge with LLO peptide plus poly(I:C) pregated on activated CD11bCD11cB220F4/80CD44+CD4+ T cells as in d. f, Ratio between CD45.2+CtslΔTEC and CD45.1+Ctsl+/+ donor-derived cells among LLO-Tet and LLO-Tet⁺ in the input population (same 1:1 M2 CD4SP cell suspension administered to n = 4 recipient mice) and at day 7 after immunization (mean ± s.e.m.), as in e. Data in e and f are representative of two experiments. P values in ad were determined by Student’s two-tailed t-test.

Source data

We performed adoptive co-transfer experiments to determine whether the diminished expansion of LLO-Tet+ cells in CtslΔTEC mice reflected a cell-intrinsic defect or an indirect consequence of lymphopenia. Bulk M2 thymocytes from CD45.2 CtslΔTEC and CD45.1 Ctsl+/+ donors were co-transferred at a 1:1 ratio (thereby establishing a ~4:1 ratio of CtslΔTEC to Ctsl+/+ LLO-Tet+ cells in the input population) into CD45.1/CD45.2 Ctsl+/+ recipients that had been intravenously (i.v.) immunized with LLO peptide plus polyinosinic–polycytidylic acid (poly(I:C)) 6 h before. At day 7 after challenge, Ctsl+/+CD45.1+ donor-derived cells accounted for 2.6 ± 0.3% of splenic LLO-Tet+ cells, whereas CtslΔTECCD45.2+ donor-derived cells contributed only marginally (0.006 ± 0.004%; Fig. 4e). This corresponded to a ratio of CtslΔTEC to Ctsl+/+ LLO-Tet+ cells of ~1:400 (Fig. 4f), indicating a marked competitive disadvantage in antigen-driven expansion and/or defective homeostatic maintenance of cells selected in CtslΔTEC donors. Thus, positive selection in CtslΔTEC mice not only created ‘antigenic gaps’ but also resulted in impaired expansion and/or persistence of retained cells following antigen encounter.

Nonselection affects clones across the CD5 range

Polyclonal ‘natural’ CD5hi clones have been reported to respond more robustly to immunization than CD5lo clones8, suggesting a direct link between the modalities of positive selection and responsiveness to foreign antigens3,8. To assess whether the nonselection of CTSL-dependent clones in the CtslΔTEC thymus correlated with their CD5 level, we generated TCR inventories from sorted TcrbFixedCtsl+/+ CD4SP cells at both extremes of the CD5 spectrum (Fig. 5a). The TCR compositions were remarkably stereotypic within the four replicates of CD5lo or CD5hi CD4SP cells, respectively, yet highly distinct between the two groups, as evidenced by Morisita–Horn comparisons (Fig. 5b). This supports the notion that partitioning of a given clone into the CD5lo or CD5hi subset of the CD4+ T cell compartment is not stochastic but specified by TCR identity. We classified TCRs found in three or more of four CD5lo samples and absent from all four CD5hi datasets as ‘natural CD5lo TCRs’ and those exhibiting a reciprocal pattern as ‘natural CD5hi TCRs’. Cross-comparison with the recurrent TCRs in our previously established TcrbFixedCtsl+/+ versus TcrbFixedCtslΔTEC datasets revealed that 70% of the natural CD5lo TCR clones and 43.5% of the natural CD5hi TCR clones were not selected in the absence of CTSL compared to a loss of 44.7% across all TCRs (Fig. 5c). Thus, at the global repertoire level, nonselection in the absence of CTSL was more pronounced among the natural CD5lo subrepertoire, yet affected clones across the entire spectrum of natural CD5 expression.

Fig. 5: CTSL deficiency impedes selection of clones at both extremes of the CD5 spectrum but retains a substantial proportion of nominal ‘good-responder’ TCRs.
Fig. 5: CTSL deficiency impedes selection of clones at both extremes of the CD5 spectrum but retains a substantial proportion of nominal ‘good-responder’ TCRs.The alternative text for this image may have been generated using AI.
Full size image

a, Representative flow cytometry plot of CD5 expression on total presort M2 CD4SP cells, postsort ‘natural’ CD5lo cells and postsort CD5hi cells from TcrbFixedCtsl+/+ mice, with sorting gates marking ~15% of cells at the low and high extremes of the CD5 spectrum. Histograms of postsort CD5lo and CD5hi cells show before bulk Tcra sequencing (n = 4 each). b, Subrepertoire similarity comparison by Morisita–Horn index (mean ± s.e.m.) for all pairwise comparisons between Tcra datasets from CD5lo and CD5hi M2 CD4SP cells as in a (n = 6 for CD5lo versus CD5lo; n = 16 for CD5lo versus CD5hi; n = 6 for CD5hi versus CD5hi). c, Pie charts showing the proportion of CTSL-dependent and CTSL-independent TCRs among 8,613 recurrent TCRs in the TcrbFixedCtsl+/+ CD4SP repertoire (top) and among 1,402 natural CD5lo TCRs (bottom left) and 1,767 natural CD5hi TCRs (bottom right). Top, colored segments in the outer ring indicate the subset of TCRs cross-assigned to the natural CD5lo or natural CD5hi subrepertoires. d, Relative percentages ± s.e.m. of the ten most abundant clonotypes among expanded LLO-Tet+CD4+ T cells sorted from the spleens of TcrbFixedCtsl+/+ mice at day 7 after i.v. challenge with LLO peptide plus poly(I:C), as quantified by bulk Tcra sequencing (n = 4) (left). The pie chart shows the relative contribution of the ten most abundant clonotypes among expanded LLO-Tet+CD4+ T cells in LLO-immunized TcrbFixedCtsl+/+ mice (middle). Colored segments in the outer ring indicate cross-assignment of these clones to CTSL-dependent and CTSL-independent TCRs among recurrent TCRs in the TcrbFixedCtsl+/+ CD4SP repertoire. Percentages indicate the proportion of Tcra reads (top) or clones (bottom, in parentheses). P values in b were determined by one-way ANOVA with a Tukey’s test for multiple comparisons.

Source data

We next assessed whether the diminished LLO-specific response in CtslΔTEC mice reflected a CTSL dependency of ‘good-responder’ TCR clones within the ‘normal’ repertoire. To this end, we i.v. immunized TcrbFixedCtsl+/+ mice with LLO and performed Tcra sequencing on the expanded LLO-Tet+ population at day 7 after challenge, identifying the top ten expanded clonotypes (Fig. 5d). These clonotypes were cross-referenced with our previously established inventories of CTSL-dependent and CTSL-independent TCRs in the naive repertoire of TcrbFixedCtsl+/+ mice, revealing that five TCRs could be assigned to the CTSL-dependent category and five to the CTSL-independent category (Fig. 5d). Thus, a significant proportion of ‘good-responder’ TCRs were retained in the CtslΔTEC repertoire, suggesting that the blunted response of LLO-Tet+ cells in CtslΔTEC mice could not be explained solely by the physical absence of all such clones.

CTSL specifies selection signals in CTSL-independent clones

To assess whether and which LLO-Tet+ clonotypes were retained in the absence of CTSL in a fully TCRαβ polyclonal setting, we performed single-cell TCR sequencing of LLO-Tet+ cells sorted from Ctsl+/+ and CtslΔTEC mice, yielding 316 and 156 paired TCRαβ clonotypes, respectively (Fig. 6a). Most of these were detected only once within each genotype; however, five TCRs were shared between Ctsl+/+ and CtslΔTEC mice (Fig. 6a). To explore the characteristics of such CTSL-independent CD4+ T cell clones, we generated two TCR transgenic mouse lines, hereafter Lm54Tg and Lm6Tg. Lm54Tg or Lm6Tg mice gave rise to CD4SP cells on both the Rag1−/−Ctsl+/+ and Rag1−/−CtslΔTEC backgrounds (Fig. 6b,c). However, CD4SP cells were reduced for both TCRs on the CtslΔTEC background (markedly for Lm6 cells and subtly for Lm54 cells; Fig. 6b,c and Extended Data Fig. 5a,b), with a corresponding reduction in peripheral CD4+ T cell numbers (Fig. 6d,e), indicating a graded impairment in positive selection in the absence of CTSL, even for these apparently CTSL-independent TCRs.

Fig. 6: LLO-specific TCRs retained in the absence of CTSL indicate a role for CTSL in calibrating positive selection signal strength.
Fig. 6: LLO-specific TCRs retained in the absence of CTSL indicate a role for CTSL in calibrating positive selection signal strength.The alternative text for this image may have been generated using AI.
Full size image

a, Pie charts representing LLO-specific clonotypes in the fully polyclonal repertoire of Tcra+/−Ctsl+/+ mice (n = 316 TCRs) and Tcra+/−CtslΔTEC mice (n = 156 TCRs), identified by single-cell Tcra and Tcrb sequencing of sorted LLO-Tet+ cells (n ≥ 42 mice per genotype; clonotypes aggregated across 19 experiments). Segments in shades of gray represent TCRs exclusively found in one genotype; colored segments represent ‘public’ TCRs shared between genotypes. Segment size is proportional to the number of mice in which a given TCR was detected. b, Representative flow cytometry plots and percent ± s.e.m. of thymocyte subsets in Ctsl+/+Lm54Tg (n = 3) and CtslΔTECLm54Tg mice (n = 4; hereafter CtslΔTECLm54Tg and Ctsl+/+Lm54Tg, respectively). c, Representative flow cytometry plots and percent ± s.e.m. of thymocyte subsets in Rag1−/−Ctsl+/+Lm6Tg (n = 7) and Rag1−/−CtslΔTECLm6Tg mice (n = 9; hereafter CtslΔTECLm6Tg and Ctsl+/+Lm6Tg, respectively). d, Representative flow cytometry plots and number ± s.e.m. of lymph node CD4+ T cells in Ctsl+/+Lm54Tg (n = 4) and CtslΔTECLm54Tg mice (n = 5). e, Representative flow cytometry plots and number ± s.e.m. of lymph node CD4+ T cells in Ctsl+/+Lm6Tg (n = 7) and CtslΔTECLm6Tg mice (n = 9). f, Representative flow cytometry plots of MHCI and CD69 surface expression on CD4SP cells from Ctsl+/+Lm54Tg (n = 9) and CtslΔTECLm54Tg mice (n = 10). The percent ± s.e.m. of SM, M1 and M2 cells is indicated. g, Nur77 and surface CD5 expression (MFI ± s.e.m.) at consecutive DP and CD4SP stages of differentiation in Ctsl+/+Lm54Tg (n = 4 or 5) and CtslΔTECLm54Tg (n = 4 or 5) mice, assessed by intracellular staining and flow cytometry (Nur77) or flow cytometry (CD5). h, CD5 surface expression (MFI ± s.e.m.) at consecutive DP and CD4SP cell differentiation stages in Ctsl+/+Lm6Tg (n = 4) and CtslΔTECLm6Tg mice (n = 5), as assessed by flow cytometry. P values in b and c were determined by two-way ANOVA and a Sidak’s test for multiple comparisons. Data in d and e were analyzed by Student’s two-tailed t-test.

Source data

Although CD4SP cells were reduced in Rag1−/−CtslΔTECLm54Tg mice compared to Rag1−/−Ctsl+/+Lm54Tg mice (hereafter CtslΔTECLm54Tg and Ctsl+/+Lm54Tg, respectively), their segregation into the SM, M1 and M2 subsets was virtually identical (Fig. 6f). To determine whether selection of Lm54 thymocytes in the presence or absence of CTSL affected the underlying TCR signals (despite seemingly identical developmental progression downstream of the positive selection checkpoint), we assessed the expression of the nuclear receptor Nur77, whose levels reflect ongoing or recent TCR signaling29. Although Lm54 thymocytes in Ctsl+/+ mice showed typical dynamic Nur77 modulation, with upregulation in the signaled DP cells and return to baseline in M2 CD4SP cells, Nur77 expression was markedly attenuated in cells selected in CtslΔTEC mice (Fig. 6g), consistent with weaker selecting TCR–pMHC interactions in the absence of CTSL. In line with this, both Lm54 and Lm6 thymocytes selected in CtslΔTEC mice showed markedly lower CD5 upregulation than their counterparts selected in Ctsl+/+ mice (Fig. 6g,h), and this pattern was also observed for CD6, whose expression likewise correlates with TCR signal strength30 (Extended Data Fig. 5c). Thus, even for TCR clones that appeared CTSL independent in their repertoire seeding, CTSL was still crucial for calibrating the intensity of the positive selection signal.

Functional tuning by CTSL regulates homeostatic fitness

We next compared gene expression profiles in M2 CD4SP cells from CtslΔTECLm54Tg and Ctsl+/+Lm54Tg mice. Genes more highly expressed in CtslΔTEC cells included the genes encoding the two subunits of co-receptor CD8, as well as ion channels or solute carriers such as TMIE and SLC16A5 (Fig. 7a), which typically peak in DP cells, whereas Ccr8, which is upregulated in CD4SP cells31, was reduced, consistent with ‘less mature’ traits in thymocytes selected in the absence of CTSL. Gene set enrichment analysis (GSEA) further revealed underrepresented transcripts in CtslΔTEC cells linked to translation, mTORC1 signaling and Myc targets (Fig. 7b), and these correlated with reduced cell size and diminished upregulation of CD44, a marker of mTOR signaling32,33 (Extended Data Fig. 6a,b), suggesting reduced metabolic activity. Consistent with this, ex vivo assessment of basal protein biosynthesis showed diminished translation in the absence of CTSL, first emerging in CD69+ signaled DP cells and persisting through the M2 CD4SP cell stage (Fig. 7c).

Fig. 7: CTSL deficiency causes aberrant functional tuning and impaired homeostatic fitness of CD4+ T cells.
Fig. 7: CTSL deficiency causes aberrant functional tuning and impaired homeostatic fitness of CD4+ T cells.The alternative text for this image may have been generated using AI.
Full size image

a, MA plot of gene expression in Ctsl+/+Lm54Tg (n = 4) versus CtslΔTECLm54Tg (n = 5) M2 CD4SP cells; differentially expressed genes (adjusted P value of <0.1) are shown in color. b, GSEA showing gene sets underrepresented (normalized enrichment score (NES) < −1.8) in CtslΔTECLm54Tg cells as in a. c, Representative flow cytometry plots showing translation at thymocyte stages in Ctsl+/+Lm54Tg and CtslΔTECLm54Tg mice (n = 3 each). A translation inhibitor (background) was included. d, CD69 expression (MFI ± s.e.m.) in M2 CD4SP cells from Ctsl+/+Lm54Tg and CtslΔTECLm54Tg mice (n = 4 each) after 18 h of stimulation with plate-bound LLO190–201:I-Ab. e, PD-1 and BTLA expression (MFI ± s.e.m.) at consecutive DP and CD4SP stages of differentiation in Ctsl+/+Lm54Tg (n = 3) and CtslΔTECLm54Tg mice (n = 4), as assessed by flow cytometry. f, GSEA of Ctsl+/+Lm54Tg versus CtslΔTECLm54Tg M2 CD4SP cells (n = 2 each) after 4 h of stimulation as in d or unstimulated as in a; only gene sets with a normalized enrichment score (NES) of >| 2 | in stimulated cells are shown. g, IL-7Rα and BCL-2 expression (MFI ± s.e.m.) at consecutive DP and CD4SP stages of differentiation in Ctsl+/+Lm54Tg and CtslΔTECLm54Tg mice (n = 5 each). Samples were analyzed by flow cytometry. h, Representative flow cytometry plots showing apoptosis in Ctsl+/+Lm54Tg and CtslΔTECLm54Tg M2 CD4SP cells (n = 5 each) after 24 h of culture in normal medium. i, Viability (mean ± s.e.m.) of Ctsl+/+Lm54Tg and CtslΔTECLm54Tg M2 CD4SP cells (n = 3 each) after 24 h of culture with IL-7, as assessed by flow cytometry. j, Donor cell ratio ± s.e.m. in the spleens of 4.5-Gy-irradiated recipients (n = 4) after transfer of a 1:1 mixture of Ctsl+/+Lm54Tg and CtslΔTECLm54Tg M2 CD4SP cells, as assessed by flow cytometry. k, Ratio ± s.e.m. of donor cells in the spleens of MHCII−/− recipients (n = 3) after transfer of a 1:1 mixture of CellTrace Violet (CTV)-labeled Ctsl+/+Lm54Tg and CtslΔTECLm54Tg M2 CD4SP cells. Histogram (right) shows representative CTV profiles on day 13 after transfer. l, Ratio ± s.e.m. of donor cells in the blood of Rag1−/− recipients (n = 4) after transfer of a 1:1 mixture of Ctsl+/+Lm54Tg and CtslΔTECLm54Tg M2 CD4SP cells. m, Percent divided cells ± s.e.m. in an experimental replicate as in l with CellTrace Violet-labeled donor cells (n = 3 Rag1−/− recipients; day 9). Data were analyzed by Student’s two-tailed t-test.

Source data

After 18 h of stimulation with plate-bound I-Ab:LLO190–201 in vitro, M2 CD4SP cells from CtslΔTECLm54Tg mice upregulated the activation marker CD69 with an approximately fivefold lower half-maximum inhibitory concentration than their counterparts from Ctsl+/+Lm54Tg mice (Fig. 7d), indicating enhanced TCR sensitivity. Elevated responsiveness to TCR stimulation was likewise observed in polyclonal CtslΔTEC M2 CD4SP cells (Extended Data Fig. 6c). By contrast, short-term stimulation with phorbol 12-myristate 13-acetate (PMA), which bypasses upstream TCR signaling, elicited comparable levels of ERK phosphorylation in M2 CD4SP cells selected in CtslΔTECLm54Tg and Ctsl+/+Lm54Tg mice (Extended Data Fig. 6d), suggesting that the hyperresponsiveness of CD4SP cells selected in the absence of CTSL was confined to the proximal TCR signaling cascade. Flow cytometric analysis showed that, in addition to CD5 and CD6, two further negative regulators of TCR signaling (PD-1 and BTLA34,35) also exhibited reduced expression downstream of positive selection in CtslΔTECLm54Tg mice compared to Ctsl+/+Lm54Tg mice (Fig. 7e). These findings suggest that the observed TCR-proximal hyperresponsiveness of cells selected in the absence of CTSL may be attributable to reduced expression of multiple TCR signaling attenuators.

Gene expression profiling of M2 CD4SP cells from CtslΔTECLm54Tg or Ctsl+/+Lm54Tg mice after stimulation with plate-bound I-Ab:LLO190–201 revealed a relative enrichment of transcriptional modules related to mRNA export, processing and splicing (processes potentially more directly linked to TCR activation) in cells selected in the absence of CTSL (Fig. 7f). By contrast, these cells exhibited comparably less efficient implementation of transcriptional programs related to key metabolic pathways crucial for sustaining T cell activation, including proliferation, glycolysis, mTORC1 signaling, cholesterol homeostasis and fatty acid metabolism (Fig. 7f). Many of these differences were not exclusively triggered following TCR activation but were already evident before stimulation and became more pronounced following TCR engagement (Fig. 7f), suggesting that these traits had been differentially ‘imprinted’ during positive selection in the presence or absence of CTSL.

Given the aberrant ‘tuning’ of multiple basal metabolic programs when Lm54 CD4+ T cells were selected in the absence of CTSL, we next examined additional hallmarks associated with CD4+ T cell survivability. Flow cytometric analysis showed that, across all maturation stages downstream of positive selection, expression of the interleukin-7 receptor (IL-7R) and BCL-2 (both key orchestrators of CD4+ T cell survival36) was reduced in CtslΔTECLm54Tg mice compared to Ctsl+/+Lm54Tg mice (Fig. 7g).

We therefore next assessed whether M2 CD4SP cells from these two genotypes differed in their homeostatic properties. Following 24-h in vitro culture without added growth factors or TCR stimulation, cells selected in the absence of CTSL exhibited substantially higher apoptosis (Fig. 7h). Administration of high-dose IL-7 rescued this in vitro survival defect (Fig. 7i). To address whether selection in the absence of CTSL impaired homeostasis in vivo, we conducted competitive co-transfer experiments using M2 CD4SP cells from CtslΔTECLm54Tg and Ctsl+/+Lm54Tg mice. Two weeks after co-transfer at a 1:1 ratio into sublethally irradiated wild-type recipients, the donor cell ratio had markedly shifted in disfavor of cells selected in the absence of CTSL (Fig. 7j). Analogous co-transfers of polyclonal M2 CD4SP cells from CtslΔTEC and Ctsl+/+ donors to wild-type recipients recapitulated these observations and revealed a tendency toward diminished homeostatic proliferation of CtslΔTEC cells (Extended Data Fig. 6e,f).

To disentangle the complex interplay between tonic TCR–pMHCII interactions and competition for soluble cues such as IL-7, which together sustain naive CD4+ T cell maintenance and may also drive homeostatic proliferation36, we repeated the co-transfer of M2 CD4SP cells from CtslΔTECLm54Tg and Ctsl+/+Lm54Tg mice using MHCII−/− or Rag1−/− recipients. In MHCII−/− hosts (where homeostatic MHCII contacts are abolished but survival factors such as IL-7 are readily available due to the absence of endogenous CD4+ T cells), cells selected in the absence of CTSL again exhibited a competitive disadvantage, although neither donor population showed evidence of proliferation (Fig. 7k). This indicated a diminished survivability of cells selected in the absence of CTSL that was not explained by altered responsiveness to tonic TCR signaling and/or reduced homeostatic proliferation. In Rag1−/− recipients, which similarly provide ample access to soluble survival factors owing to their lack of an endogenous CD4+ T cell compartment but, in contrast to MHCII−/− recipients, permit homeostatic TCR–pMHCII contacts, the donor cell ratio once more shifted in disfavor of cells selected in the absence of CTSL (Fig. 7l). However, unlike in MHCII−/− recipients (Fig. 7k), a significantly smaller proportion of these cells underwent at least one cell division (Fig. 7m), revealing defective homeostatic proliferation as an additional layer of impaired functional fitness. Thus, CD4+ T cells selected in CtslΔTEC mice in a seemingly CTSL-independent manner (based on their progression to the mature CD4SP stage) retained a lasting functional imprint of their selection history, manifesting as a metabolically less poised state and diminished homeostatic responsiveness to both ‘tonic’ TCR signals and TCR-independent survival cues.

Discussion

Our findings highlight two key roles for CTSL in shaping the CD4+ T cell compartment by establishing full repertoire diversity and optimizing the fitness of clones that enter the repertoire in a seemingly CTSL-independent manner. We refer to the complete loss of certain clones as ‘essential CTSL dependency’ and to impaired functionality of retained clones as ‘functional CTSL dependency’.

Essentially CTSL-dependent clones contributed disproportionately to repertoire diversity, conceivably reflecting reliance of low-abundance clones on few (or even a single) selecting pMHCII ligand(s). High-abundance clones may be more flexible in their range of selecting pMHCII ligands. If CTSL is required for only a subset of these ligands, the frequency of TCR–pMHC contacts may be reduced, allowing such clones to be retained in the repertoire, albeit with defects characteristic of functionally CTSL-dependent clones. CD5 expression in bulk CD4SP cells was diminished in the absence of CTSL. This was not due to preferential loss of ‘natural’ CD5hi clones, suggesting that reduced signal strength affected the full range of selecting interactions. As a consequence, natural CD5lo clones may more frequently fail to reach signaling thresholds for positive selection. Indeed, the proportion of essentially CTSL-dependent clones was higher among natural CD5lo clones. By contrast, higher-affinity natural CD5hi clones may persist through compensatory sensitization of proximal TCR signaling. This may involve reduced expression not only of CD5 but also of other negative regulators such as CD6, PD-1 and BTLA, all evident in clones Lm54 and Lm6.

Although reduced peptide diversity may explain some consequences of CTSL deficiency, this does not preclude the possibility that the key role of CTSL lies in generating ‘qualitatively special’ peptides optimal for selection. Despite the loss of ~50% of TCRs in TcrbFixedCtslΔTEC mice, numerous ‘newcomer TCRs’ emerged, suggesting that the pMHCII ligandome remained diverse yet was enriched in otherwise absent or outcompeted ‘newcomer ligands’. The unusual V–J features of newcomer TCRs imply that they arose through an atypical selection process and may not be functionally equivalent to normally selected CD4+ T cells. As cTECs express at least two other cathepsins, cathepsin B and D24, these may generate diverse, yet ‘suboptimal’, selecting peptides in the absence of CTSL. Circumstantial support for this notion comes from experiments showing that occupancy of <5% of MHCII molecules on cTECs by pMHC ligands generated by the normal proteolytic machinery, including CTSL, was sufficient to sustain CD4+ T cell numbers near wild-type levels37.

Essential and functional CTSL dependency likely both contribute to the blunted anti-LLO CD4+ T cell response in CtslΔTEC mice. Half of the ten ‘best-responder’ clones in TcrbFixedCtsl+/+ mice were physically absent from the repertoire of TcrbFixedCtslΔTEC mice. We deem it likely that both in TcrbFixed mice and in the fully αβ polyclonal repertoire, a sizeable proportion of clones retained in the absence of CTSL are functionally compromised. This is exemplified by the clones Lm54 and Lm6, whose altered features when selected in the absence of CTSL may be the rule rather than the exception. However, their marked homeostatic defects complicate efforts to resolve how functional CTSL dependency affects responsiveness and effector functions following immunogenic challenge with cognate antigen.

The exact mechanism underlying the impaired homeostatic fitness of functionally CTSL-dependent clones remains unclear. These clones appeared to progress ‘on autopilot’ through differentiation stages downstream of positive selection in the absence of CTSL yet exhibited alterations in multiple cellular programs. A downshift in CD5 expression may directly contribute to their inferior homeostatic behavior. In normally selected thymocytes, elevated CD5 expression calibrates the NF-κB pool, promoting viability and responsiveness despite attenuating TCR signals38. NF-κB also establishes proliferation competence26 and augments IL-7 responsiveness, triggering prosurvival transcriptional programs including BCL-2 (ref. 39). IL-7R expression is fine-tuned by TCR signals during thymic development40, and reduced IL-7R and BCL-2 expression in functionally CTSL-dependent clones suggests that selection in the absence of CTSL led to diminished IL-7 responsiveness and compromised survivability.

The ‘altered peptide’ model, proposed over three decades ago, suggested that developing thymocytes engage unique pMHC combinations for positive selection41. This model was later abandoned when it was found that several abundant MHCII-associated peptides were shared between cTECs and other APCs42, and the ‘affinity model’ has become the prevailing concept to explain the dual role of self-pMHC ligands in both positive and negative selection. However, the discovery of distinct proteolytic pathways in cTECs has revived interest in the possibility of ‘private’ pMHC ligands in cTECs having crucial physiological relevance. The ‘peptide-switch’ model proposes that minimizing the overlap between the pMHC ligandomes of cTECs and those of negatively selecting APCs prevents positive and negative selection from canceling each other out15,43. Although we cannot formally exclude contributions from negative selection, our findings demonstrate that the diminished CD4SP compartment in CtslΔTEC mice primarily reflects a genuine bottleneck in positive selection.

How ‘private’ peptides generated by CTSL contribute to optimal CD4+ T cell selection remains speculative. Their unique role may be specified through conserved TCR contact and/or MHC anchor residues or via an allosteric influence on the ‘nonpeptide’ MHC–TCR interface. Methodological advances will be required to comprehensively characterize the cTEC pMHC ligandome. Another intriguing question is how the requirement for ‘private’ peptides in positive selection can be reconciled with the hypothesis that the very same self-peptides mediating positive selection also support naive T cell homeostasis in the periphery and act as coagonists when T cells respond to foreign antigens44,45.

Methods

Mice

Ctslfl/fl46, Ctsl−/−47, Tcr-Dep48, Ciita[kd28, Tcr-Plp1 (ref. 49), Tcr-LLO56 and Tcr-LLO118 (ref. 50), Tcr-AND and Tcr-AD10 (ref. 51), Tcr-OT-II52, Foxn1-cre53, MHCI−/− (B2m−/−)54, MHCII−/− (H2-Ab1−/−)55, Rag1−/−56, Plp1−/−57, Tcra−/−58 and Foxp3GFP reporter mice (DEREG)59 have been described previously. For the generation of Lm54Tg and Lm6Tg mice, pTα and pTβ cassette vectors60 were modified to contain the V(D)J regions of the respective TCRs identified by single-cell TCR sequencing. Transgenic mice were generated by injection of linearized DNA into the pronuclei of C57BL/6 zygotes. TcrbFixed transgenic mice only express the TCRβ chain of the Lm54 clone. All mice used were on a C57BL/6J background. Mice were maintained under specific pathogen-free conditions in individually ventilated cages at an ambient temperature of 22 °C and 55% humidity with standard light cycle conditions. All phenotypic analyses were performed in mice of 8–12 weeks of age, unless otherwise indicated, and animals of both sexes were included, as we did not find any evidence for sex differences in the parameters addressed. Animal studies and procedures were approved by local authorities (Regierung von Oberbayern; Az Vet_02-22-66).

CTSL active site labeling and western blotting

Sorted TECs were incubated for 1 h at 37 °C in conditioned culture medium containing 1 μM BMV109 and processed as previously described61. Briefly, cell pellets obtained from 3 × 104 TECs were lysed in hypotonic buffer in the presence of 4 mM DTT for 15 min on ice and centrifuged for 30 min at 10,000g. Supernatants were denatured by the addition of 3× SDS sample buffer (containing 10% β-mercaptoethanol) and resolved by SDS–PAGE (15%) at 120 V for 60 min, together with the protein marker ROTI Mark Tricolor (Roth). The gel was scanned with a Typhoon FLA9500 imager in the Cy5 channel (GE Healthcare). Proteins were either stained with Coomassie or transferred in an exact replica of the gel onto a nitrocellulose membrane via semidry western blotting. Transfer was performed with transfer buffer containing 8% methanol and at a maximum of 50 V and 160 mA for 90 min. Following protein transfer, the membrane was incubated with goat anti-mouse CTSL polyclonal IgG (AF1515, R&D; 1 μg ml–1), followed by secondary mouse anti-goat horseradish peroxidase-conjugated polyclonal IgG (205-035-108, Jackson ImmunoResearch; 40 ng ml–1) and SuperSignal West Pico Chemiluminescent Substrate (Thermo Scientific). Membranes were imaged with an iBright1500 scanner (Invitrogen). For the housekeeping control, mouse mAb to β-actin (clone AC-15, Sigma; 200 ng ml–1) was used as primary antibody, followed by secondary rabbit anti-mouse horseradish peroxidase-conjugated polyclonal IgG (P0260, Agilent Dako).

Histology

Five-micron sections of paraffin-embedded material were stained with hematoxylin (Mayer) and eosin (Morphisto). Microscopy was performed with a Leica DM2500 brightfield microscope, equipped with a DMC2900 CMOS camera and HC PL FL ×10/0.30-NA PH1 or HC PL FLUOTAR ×5/0.15-NA objective. The resulting image pixel sizes were 581 nm (×10) and 1.162 nm (×5). Microscopy images were acquired with LAS X Office v1.4.6 (Leica Microsystems).

Determination of cell size

Cell size measurements were conducted on a Countess 3 Automated Cell Counter (Thermo Fisher).

Preparation of TECs

Thymi from 3- to 5-week-old animals were cut into pieces, and thymocytes were mechanically released by pipetting up and down. The supernatant containing thymocytes was discarded. The thymus fragments were digested with liberase (0.5 U ml–1; Roche) and DNase I (10 mg ml–1; Roche) at 37 °C in three consecutive rounds of 15 min. Cells were washed and resuspended in 1 ml of high-density Percoll (ρ = 1.115; GE Healthcare) and overlaid with 1 ml of low-density Percoll (ρ = 1.055), followed by a layer of 1 ml of RPMI (Gibco). The gradient was centrifuged at 1,350g for 30 min at 4 °C (without brake). The top interphase, containing the low-density cell fraction, was collected, washed and subjected to CD45 magnetic-activated cell sorting depletion, using CD45 MicroBeads (Miltenyi Biotech). The CD45 fraction was stained with DAPI and surface antibodies. TECs were analyzed or sorted according to the expression of CD45, Ly51, EpCAM, CD80 and MHCII as follows: cTECs (CD45EpCAM+Ly51+), total mTECs (CD45EpCAM+Ly51) and mTEChi (CD45EpCAM+Ly51MHCIIhi, CD80hi).

Flow cytometry

Antibodies were purchased from Biolegend, unless otherwise specified, and used as conjugates with various fluorochromes: anti-CD4 (clone RM4-5), anti-CD8α (53-7.3), anti-CD326/EpCAM (G8.8), anti-Ly51 (6C3), anti-CD80 (16-10A1), anti-CD5 (53-7.3), anti-TCRβ (H57-597), anti-CD69 (H1.2F3), anti-H-2Kb (AF6-88.5), anti-CD45.1 (A20), anti-CD45.2 (104), anti-CD44 (IM7), anti-CD62L (MEL-14), anti-TCRvα2 (B20.1), anti-CD127/IL-7Rα (A7R34), anti-PD-1 (RMP1-30), anti-BTLA (6A6) and anti-I-A/I-E (M5/114.15.2). The following antibodies were used to distinguish MHCII structural epitopes: anti-CLIP:I-Ab (15G4)62, anti-non-CLIP:I-Ab (BP107.2.2, a kind gift from A. Rudensky, Memorial Sloan Kettering Cancer Center)63 and anti-Eα52–68:I-Ab (Y-Ae)64. To determine translation activity, thymocytes were pulsed for 5 min with 5 µg ml–1 puromycin (Merck) immediately before intracellular staining with phycoerythrin (PE)-conjugated mAb to puromycin (2A4)65. Control cells were pretreated for 15 min with 5 µg ml–1 harringtonine (MedChem Express) and, during the puromycin pulse, treated with 100 µg ml–1 cycloheximide (Roth) and 150 µg ml–1 chloramphenicol (Merck). For intracellular staining, cells were fixed and permeabilized using reagents from a Foxp3 staining kit (eBioscience) and stained with PE-conjugated anti-Nur77 (12.14, eBioscience) or PE-conjugated anti-BCL-2 (BCL/10C4). Cells were analyzed using a BD FACSCANTOII or LSRFortessa flow cytometer or sorted using a BD FACSAriaFusion sorter. Flow cytometry data were acquired using FACSDiva v6.2 software (BD Bioscience). Flow cytometry data were analyzed with FlowJo v10.9.0 software.

Tet staining

Tets conjugated to APC or PE were kindly provided by M. Jenkins (University of Minnesota)66. Single-cell suspensions from thymi or pooled lymph nodes and spleens were incubated for 1 h at 25 °C with 10 nM of both APC- and PE-conjugated Tets, as previously described67. Cells were next washed and enriched with anti-APC and anti-PE MicroBeads (Miltenyi Biotech). AccuCheck Counting beads (Thermo Fisher) were used to determine the number of Tet+ cells in the column-bound fraction. Tet staining on samples from LLO-immunized mice was followed by enrichment with anti-CD4 MicroBeads (Miltenyi Biotech), instead of anti-APC/anti-PE MicroBeads. A dump cocktail of antibodies, containing anti-CD11b (M1/70), anti-CD11c (N418), anti-B220 (RA3-6B2) and anti-F4/80 (BM8), was used to exclude non-T cells and autofluorescent cells in flow cytometric analyses.

BM chimeras

Recipient mice were irradiated with two split doses of 4.5 Gy, at least 2 h apart, the day before reconstitution. BM was depleted of differentiated cells using biotinylated mAbs to CD4 (clone RM4-5), CD8α (53-7.3), B220 (RA3-6B2), CD11b (M1/70), CD11c (N418), Gr1 (RB6-8C5) and F4/80 (BM8; Biolegend) together with streptavidin magnetic-activated cell sorting MicroBeads (Miltenyi Biotech). Recipient mice were injected i.v. with 5 × 106–10 × 106 BM cells. Neomycin (Belapharm) was supplemented in the drinking water for the first 4 weeks, and mice were killed 6 weeks after reconstitution.

Large-scale Tcra sequencing

For experiments depicted in Fig. 3 (global M2 repertoire comparison in CtslΔTEC versus Ctsl+/+ TcrbFixed mice), 3 × 105–4 × 105 mature M2 CD4SP (CD4+CD8αCD69MHCI+CD25FoxP3) cells from 5- to 8-week-old CtslΔTECTcra+/−TcrbFixedFoxp3GFP mice or corresponding controls were bulk sorted. Material from two to three mice was pooled to obtain comparable total cell counts in each sample. For experiments depicted in Fig. 5a–c (global ‘natural’ CD5lo versus ‘natural’ CD5hi M2 repertoire comparison in Ctsl+/+TcrbFixed mice), 1 × 105–2 × 105 mature M2 CD4SP cells (CD4+CD8αCD69MHCI+CD25FoxP3), corresponding to the 15% lowest or 15% highest levels of CD5 expression on total CD4SP cells, were bulk sorted from individual 5- to 8-week-old Ctsl+/+Tcra+/−TcrbFixedFoxp3GFP mice. For experiments depicted in Fig. 5d,e (TCR analysis of expanded LLO-Tet+ cells), 1 × 105–2 × 105 LLO-Tet+ cells were bulk sorted from pooled spleen and lymph node cells of individual 6- to 8-week-old Ctsl+/+Tcra+/−TcrbFixedFoxp3GFP mice 7 days after systemic immunization with LLO190–201. Samples were stored at −80 °C in RNAprotect (Qiagen). All further sample preparation steps were performed by Qiagen. RNA was isolated using an RNeasy Mini/Micro kit (Qiagen), and library preparation was performed using a QIAseq Immune Repertoire RNA Library kit (Qiagen), modified to include only Tcra-specific primers for both the reverse transcription and target enrichment steps. Library preparation quality control was performed using an Agilent DNA 7500 Chip and Qubit dsDNA HS. Libraries that passed quality control were finally sequenced on a NovaSeq 6000 (Illumina) sequencing instrument according to the manufacturer’s instructions, with a read length of 2 × 250 bp and an SP flow cell. Raw data were demultiplexed, and FASTQ files for each sample were generated using bcl2fastq software (Illumina).

Single-cell TCR sequencing

To increase the likelihood of finding shared TCRs and to ensure subsequent detection of transgenically re-expressed TCRs by fluorescence-activated cell sorting, we restricted this analysis to cells stainable with the mAbs MR9-4 (anti-TCR-Vβ5; genes Trbv5.n) and B20.1 (anti-TCR-vα2; genes Trav14.n). Dump cocktail-negative MR9-4+B20.1+Tet+ cells were single-cell sorted into 96-well plates and immediately frozen at −80 °C. All subsequent steps were adapted from Dössinger et al.68. Reverse transcription was performed using an iScript Select cDNA Synthesis kit (Bio-Rad) with a mix of primers specific for the TCRα (Trac) and TCRβ (Trbc) constant regions. Reverse transcription was followed by a digestion step with exonuclease I (Thermo Scientific) to remove single-stranded primers. The exonuclease digestion product was further subjected to a dGTP tailing step and split into two 96-well plates to continue with separate PCRs for Trac and Trbc sequencing. The first PCR was performed with an anchor primer complementary to the introduced 3′-guanosine overhang together with primers specific for Trac or Trbc, respectively. The following two rounds of nested PCR were performed with primers binding Trav14 and Trac or Trbv5 and Trbc, respectively. PCR products were Sanger sequenced by Eurofins Genomics by using either a Trav14-specific primer (for TCRα) or a Trbv5-specific primer (for TCRβ). Both Trac and Trbc sequences were annotated using IMGT/V-Quest69 with the C57BL/6-specific library. TCR clonotypes were defined at the level of amino acid sequence (V-region, CDR3 and J-region), and only those consisting of both an in-frame Trac and an in-frame Trbc were retained. As each TCR clonotype found in a given mouse was counted only once, counts in the Source Data related to Fig. 6a refer to the number of individual mice where the respective clonotype was detected.

TCRα repertoire analysis

The CLC Genomics Workbench software (v23.0.3) provided by Qiagen was used to generate clonotype reads. Briefly, sequences with the same unique molecular index were merged. Further quality control steps included merging and trimming of overlapping paired-end sequences. Results were annotated using the C57BL/6J-specific functional Tcra genes extracted from the IMGT mouse TCR database. Analyses were performed using RStudio (version 2024.04.2+764). A clonotype was defined as a unique combination of Trav gene and Traj gene and an in-frame CDR3 amino acid sequence. Clonotype abundance was defined as the number of clonotype-encoding unique cDNA molecules (distinguished by unique molecular index) detected in a sample. TCR diversity was assessed using the approach of Chao et al.70, wherein one unique cDNA molecule was defined as one ‘individual’ and one clonotype as one ‘species’. Rarefaction with Hill numbers was performed using ‘iNEXT’ (versions 2.0.20 and 3.0.1). Shannon diversity is the exponential of Shannon entropy and is calculated in ‘iNEXT’ using q = 1. Shannon diversity (or effective number of TCR clonotypes) is the number of equally abundant clonotypes that would be needed to give the same Shannon diversity as the sample. As sampling completeness approached the maximum (coverage = 1) for each sample and subrepertoire examined, the observed Shannon diversities were used to calculate ‘relative diversity’. As adding 95% confidence intervals constructed from five bootstrap replications made a negligible difference to the appearance of the rarefaction curves and were smaller than the symbols on the ‘relative diversity’ summaries, these details were omitted. To assess TCR overlap between pairs of samples, we used ‘abdiv’ (version 0.2.0) to calculate the Morisita–Horn index. Quantification of nucleotide deletion or addition at Trav–Traj junctions was performed using results from the IMGT Junction Analysis tool. The ‘no. nucleotides deleted or added at the TravTraj junction’ was defined as the sum of the absolute values of the number of nucleotides deleted from the germline Trav and/or Traj gene(s) plus the number of P and/or N nucleotides added between the remaining germline Trav and/or Traj nucleotides. Due to duplication events at the Tcra locus in C57BL/6 mice, 23–30% of unique cDNA molecules per sample aligned with greater than one Trav paralog; these sequences were excluded from the analyses of chromosomal location of Trav gene usage and nucleotide deletion or addition at Trav–Traj junctions.

Immunization

For hock immunization, isoflurane-anesthetized mice were subcutaneously injected with an emulsion of LLO190–201 peptide (NEKYAQAYPNVS, GenScript) in PBS and Freund’s adjuvant (Sigma). Systemic immunization was performed by i.v. injection of 50 μg of LLO190–201 peptide and 50 μg of high-molecular-weight poly(I:C) (InvivoGen) in PBS. When combined with adoptive cell transfers, mice were immunized 6 h before the injection of donor cells.

Adoptive T cell transfer

For experiments depicted in Fig. 4d, mature CD4SP conventional thymocytes from CtslΔTEC Foxp3GFP (CD45.2) and Foxp3GFP littermate control (CD45.1) mice were sorted (CD4+CD8αCD69MHCI+CD25FoxP3) and mixed at a 1:1 ratio. Cells were injected i.v. into CD45.1/CD45.2 recipients (106 total cells per mouse), previously immunized with LLO plus poly(I:C). For transfer experiments depicted in Fig. 7, M2 CD4SP thymocytes from CtslΔTECLm54TgRag1−/− and Ctsl+/+Lm54TgRag1−/− mice were sorted, mixed at a 1:1 ratio and i.v. injected into CD45.1/CD45.2 recipients (1 × 106 total cells per mouse). Where indicated, the cells were labeled with CellTrace Violet (Invitrogen; 5 μM for 5 min at 37 °C) before adoptive transfer. All results were confirmed with inverted combinations of congenic markers.

In vitro stimulation of Lm54 CD4SP thymocytes

For antigen-specific in vitro stimulation, M2 CD4SP thymocytes were stimulated for 18 h at 37 °C in the presence of soluble anti-CD28 (clone 37.51, Bio X Cell; 250 ng ml–1) and titrated amounts of plate-bound LLO190–201:I-Ab monomer. All stimulations were performed with 1.5 × 105 cells per well in 96-well, U-bottom plates and HL-1 serum-free medium (Lonza). The viability dyes Zombie NIR and Apotracker Green (Biolegend) were used to exclude dead and apoptotic cells. Short-term PMA stimulation was performed with 50 ng ml–1 PMA for 5 min at 37 °C.

RNA sequencing and analysis

Cells were collected and stored at −80 °C in RNAprotect (Qiagen). All further sample preparation steps (RNA isolation, rRNA depletion and library preparation) were performed by Eurofins Genomics, using the INVIEW transcriptome discovery package. Libraries that passed quality control were sequenced on a NovaSeq 6000 (Illumina) sequencing instrument, with 2 × 150 bp read length and S4 flow cells. All data processing methods were applied using default parameters unless specified. Expression quantification was performed using kallisto (version 0.48) with Ensembl release version 106 for M. musculus. In R/Bioconductor, expression data were collapsed from the isoform level to the gene level for downstream processing. Differential expression was assessed using DESeq2 (version 1.36). GSEAs were conducted using fgsea (version 1.22).

Statistical analysis

Statistical analyses were performed using GraphPad Prism (v9), except for calculations of Shannon diversity and Morisita–Horn indexes. The specific statistical tests used are indicated in the corresponding figure legends. Data distribution was assumed to be normal, but this was not formally tested. No a priori sample size calculations were performed; sample sizes were based on prior experience with similar experiments and were deemed sufficient to detect biologically meaningful differences. Measurements were not conducted blind to the conditions of the experiment, as flow cytometric outputs were analyzed using standardized, objective gating strategies. No data points were excluded from analysis.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.