De novo identification of CD4+ T cell epitopes

Zdinak, Paul M.; Trivedi, Nishtha; Grebinoski, Stephanie; Torrey, Jessica; Martinez, Eduardo Zarate; Martinez, Salome; Hicks, Louise; Ranjan, Rashi; Makani, Venkata Krishna Kanth; Roland, Mary Melissa; Kublo, Lyubov; Arshad, Sanya; Anderson, Mark S.; Vignali, Dario A. A.; Joglekar, Alok V.

doi:10.1038/s41592-024-02255-0

Download PDF

Article
Open access
Published: 24 April 2024

De novo identification of CD4⁺ T cell epitopes

Paul M. Zdinak^1,2,3,
Nishtha Trivedi^1,2^na1,
Stephanie Grebinoski^1,3^na1,
Jessica Torrey^1,2,
Eduardo Zarate Martinez ORCID: orcid.org/0000-0001-6411-3563^1,2,4,
Salome Martinez^1,2,
Louise Hicks^1,2,
Rashi Ranjan ORCID: orcid.org/0000-0002-8303-7815^1,2,
Venkata Krishna Kanth Makani^1,2,
Mary Melissa Roland^1,2,
Lyubov Kublo^1,2,
Sanya Arshad^1,2,
Mark S. Anderson ORCID: orcid.org/0000-0002-3093-4758⁵,
Dario A. A. Vignali ORCID: orcid.org/0000-0002-2771-5992^1,6,7 &
…
Alok V. Joglekar ORCID: orcid.org/0000-0001-7554-7447^1,2,7

Nature Methods (2024)Cite this article

2321 Accesses
23 Altmetric
Metrics details

Subjects

Abstract

CD4⁺ T cells recognize peptide antigens presented on class II major histocompatibility complex (MHC-II) molecules to carry out their function. The remarkable diversity of T cell receptor sequences and lack of antigen discovery approaches for MHC-II make profiling the specificities of CD4⁺ T cells challenging. We have expanded our platform of signaling and antigen-presenting bifunctional receptors to encode MHC-II molecules presenting covalently linked peptides (SABR-IIs) for CD4⁺ T cell antigen discovery. SABR-IIs can present epitopes to CD4⁺ T cells and induce signaling upon their recognition, allowing a readable output. Furthermore, the SABR-II design is modular in signaling and deployment to T cells and B cells. Here, we demonstrate that SABR-IIs libraries presenting endogenous and non-contiguous epitopes can be used for antigen discovery in the context of type 1 diabetes. SABR-II libraries provide a rapid, flexible, scalable and versatile approach for de novo identification of CD4⁺ T cell ligands from single-cell RNA sequencing data using experimental and computational approaches.

Bispecific T cell engager therapy for refractory rheumatoid arthritis

Article 26 April 2024

Structures of human γδ T cell receptor–CD3 complex

Article 24 April 2024

An autoantibody signature predictive for multiple sclerosis

Article 19 April 2024

Main

A hallmark of the adaptive immune system is the ability to raise antigen-specific responses. This is accomplished for αβT cells through the T cell receptor (TCR), which comprises TCRα and TCRβ chains¹. Specifically, TCRs from CD4⁺ T cells recognize peptide epitopes on MHC-II or human leukocyte antigen (HLA)-II. The estimated size of the mature TCR repertoire is 10⁸–10¹⁰ unique TCRs in mice and 10⁹–10¹² unique TCRs in humans^2,3,4. Recognition of foreign antigens such as those from SARS-CoV-2 and tumor neoantigens by CD4⁺ T cells leads to their protective function^5,6. On the other hand, recognition of self-antigens such as insulin in type 1 diabetes (T1D), leads to pathogenic CD4⁺ T cell responses^7,8. Furthermore, regulatory T cells can bind to self-antigens and prevent autoimmunity⁹. The specificity of CD4⁺ T cells is key to their function, highlighting a need for antigen discovery approaches tailored for MHC-II and HLA-II¹⁰.

Traditionally, antigen-specific CD4⁺ T cells have been studied using functional assays that measure proliferation, cytokine release or cytotoxicity^11,12,13,14. These assays are sensitive but are limited to investigating tens of peptides simultaneously. Techniques such as barcoded tetramers can efficiently detect antigen-specific T cells but are limited to the interrogation of 100s of specificities simultaneously^{15,16,17,18,19,20} and are further limited by the instability of multimers and lower affinities of CD4⁺ TCRs^21,22. Unbiased approaches such as yeast display and combinatorial peptide libraries have been used to identify epitopes de novo, but these methods often identify nonphysiological epitopes (altered peptide ligands or mimotopes), are highly laborious, and in the case of yeast display, rely on soluble TCR generation^23,24,25,26. Cell-based methods are emerging approaches for TCR-directed antigen discovery. These methods preserve physiological TCR–pMHC interactions, can present large and defined epitope libraries and do not require substantial a priori knowledge of antigen specificity^{27,28,29,30,31,32}. The interchangeability between approaches for MHC-I and MHC-II is not trivial. The utility of cell-based, MHC/HLA-II, antigen discovery was demonstrated by Kisielow et al. using pMHC–TCR (MCR-TCR)^28,33,34, which allowed for the identification of cognate epitopes by iterative screening against libraries encoded through complementary DNA or defined libraries³⁴. More recently, TScan-II was deployed for antigen discovery of CD4⁺ T cells but requires separately engineered antigen-presenting cells (APCs)³⁵.

With the increasingly widespread use of single-cell RNA sequencing (scRNA-seq) to interrogate T cell responses, it is paramount that T cell antigen discovery methods can be scaled to investigate tens to 100s of TCRs rapidly³⁶. Recently, several algorithms for computational antigen discovery have been reported, including grouping of lymphocyte interactions by paratope hotspots (GLIPH/GLIPH2), distance measure on space of TCRs that permits clustering and visualization (tcrdist/tcrdist3) and clonotype neighbor graph analysis (CoNGA)^37,38,39. These algorithms identify TCR specificity groups comprising TCRs that share sequence similarity and/or motifs and are therefore predicted to share antigenic specificity. Recently, ‘reverse epitope discovery’ has been explored to leverage large datasets for comparison of TCR amino acid similarity⁴⁰. Ultimately, Rosati et al. were able to identify public, immunodominant CD4⁺ T cell responses across 59 individuals; however, it remains challenging to predict the antigens of private clonotypes in private datasets, highlighting the need for high-throughput methods that synergize both experimental and computational approaches¹⁰.

Here we showcase a combination of several methodological advances in applying experimental and computational tools for antigen discovery. First, we report a modular cell-based method for antigen discovery using signaling and antigen-presenting bifunctional receptors to encode MHC/HLA-II molecules presenting covalently linked peptides (SABR-IIs) for mouse and human CD4⁺ T cells. Second, we show de novo identification of epitope specificities of TCRs derived from scRNA-seq data in a mouse model of T1D. Finally, we demonstrate that experimental antigen discovery can be amplified post hoc by computational approaches. Together, we have developed an experimental and computational workflow to rapidly de-convolute the specificity of scRNA-seq-derived CD4⁺ T cells de novo.

Results

Signaling and antigen-presenting bifunctional receptors II

We have previously described SABRs, which are chimeric receptors containing an extracellular pMHC complex attached to an intracellular CD28-CD3ζ signaling domain. We demonstrated that SABRs can read out TCR–pMHC interactions, allowing the construction of SABR libraries for antigen discovery for class I HLA alleles²⁷. We sought to expand this platform to allow antigen discovery for MHC/HLA-II with seamless integration with class I alleles. Here, we created SABRs to present epitopes in MHC-II alleles, by covalently linking the epitope to the β-chain of MHC-II that is attached to the CD28-CD3ζ signaling domains downstream, along with a 2A peptide-linked MHC-II α-chain (Fig. 1a,b). To test whether SABR-IIs could present epitopes to TCRs and induce a signal, we expressed them using lentiviral vectors in NFAT–GFP Jurkat cells, which express green fluorescent protein (GFP) upon NFAT activation and translocation downstream of CD3ζ activation (a kind gift from Y. Chen and A. Weiss). We constructed murine SABR-IIs presenting epitopes in I-Ab, I-Ad and I-Ag7 (Ova, ISQAVHAAHAEINEAGR⁴¹; ATEG, ATEGRVRVNSAYQDK⁴²; and 2.5mimo, YVRPLWVRME⁴³, respectively). We co-incubated the SABR-II-expressing NFAT–GFP Jurkat cells with a separate population of Jurkat cells expressing either the BDC2.5 TCR (recognizes I-Ag7-2.5mimo), OT-II TCR (recognizes I-Ab-Ova), 5-4-E8 TCR (recognizes I-Ad-ATEG) or no TCR. Robust GFP and CD69 expression in SABR-II-expressing NFAT–GFP Jurkat cells was observed 18–20 h later in only the correctly paired assays (Fig. 1c and Extended Data Fig. 1a,b). The signal from the NFAT–GFP reporter offered minimal background in absence of a cognate TCR and correlated with surface SABR expression in the presence of a cognate TCR (Extended Data Fig. 1c–e). To demonstrate the application of SABR-IIs for human antigen discovery, we generated SABR-IIs to present the InsB9:23 epitope (SHLVEALYLVCGERG) in HLA-DQ8 (DQA1*0301:DQB1*0302, an HLA-II allele that is associated with increased risk of T1D and celiac disease^44,45). We confirmed the ability of the DQ8-InsB9:23 SABR-II to present the epitope to two previously described, T1D patient-derived TCRs GSE.6H9 and GSE.20D11 (ref. ⁴⁶). As expected, a high frequency of GFP⁺CD69⁺ cells were found only when the TCRs interacted with the InsB9:23 epitope and not a control hen egg lysozyme epitope (Fig. 1d and Extended Data Fig. 1f,g).

**Fig. 1: SABR-IIs identify cognate TCR–pMHC interactions for antigen discovery.**

To test the compatibility between human and mouse cells for the function of SABR-IIs, we performed co-incubation assays using 5KC cells (a mouse thymoma line, which was a kind gift from M. Nakayama). We observed that SABR-II–TCR interactions were retained irrespective of the host species (Extended Data Fig. 2a,b). Furthermore, we demonstrated that SABR-IIs consisting of B cell signaling domains (CD79A and CD79B), could also signal through NFAT (Extended Data Fig. 2c,d). As a further demonstration of the modularity of the SABR-II design and its potential for deployment in professional APCs, we expressed SABR-IIs containing either the CD28-CD3ζ or CD79A/B domains in Daudi B cells. We observed that the cognate interaction of both the SABRs with their TCRs resulted in upregulation of surface FAS on Daudi cells, showing that the SABR-II platform can signal in professional APCs^47,48,49 (Extended Data Fig. 2e,f).

We then asked whether SABR-IIs could be used to present a library of epitopes for CD4⁺ T cell antigen discovery. To that end, we constructed a SABR-II library to present epitopes derived from pancreatic islets in I-Ag7 by curating a list of 4,075 published epitopes from the Immune Epitope Database (iedb.org)⁵⁰ and a study by Wan et al.⁵¹ (Supplementary Table 1). Of note, this defined library consisted of unmodified epitopes from endogenous proteins, synthetic mimotopes, deamidated epitopes and hybrid insulin peptides (HIPs) that arise from post-translational fusion and are not genetically encoded in vivo^52,53. The epitope library was inserted into the I-Ag7 SABR-II backbone through pooled oligonucleotide synthesis, amplification and ligation-free cloning (Extended Data Fig. 3a). The I-Ag7 SABR-II-library was then expressed in NFAT–GFP Jurkat cells. We confirmed that after sequencing, the library accounted for a mean of 708 reads per epitope (Extended Data Fig. 3b). As a proof of concept, we performed co-incubation assays with Jurkat cells expressing the BDC2.5 TCR and sorted the top 1–2% of GFP⁺CD69⁺ cells at a rate of ~20 min per replicate with three replicates per TCR. We extracted the genomic DNA from sorted cells, amplified the SABR portion of the integrated proviruses and subjected the amplicons to Illumina sequencing (Extended Data Fig. 3c,d). The 1–2% sort gate represents >50-fold enrichment of cognate epitopes with minimal loss of signal (Extended Data Fig. 3e–h). Sequence reads were aligned to the I-Ag7 SABR-II backbone and the corresponding epitopes were scored based on their read counts. For each TCR under investigation, an enrichment score (ES) was determined for all the epitopes in a library. In each experiment, three replicates of a sort with TCR-expressing Jurkat cells were performed and reads were counted post-sequencing. In addition, three replicates of the unsorted library were also sequenced. A linear regression model was built using the unsorted library counts and used to determine the expected abundance of each epitope in the library. The ES was calculated based on the difference between the measured and the expected abundance of each epitope on a per-TCR basis (Fig. 1e). Based on ES values, two quantitative thresholds were used to determine putative cognate epitopes of a given TCR. A high-confidence zone containing clear outliers with a high ES and a low-confidence zone containing weak outliers with a moderately high ES were determined (Fig. 1e). This two-tiered strategy was used to call putative hits from screens. All the top-scoring epitopes for the BDC2.5 TCR were known BDC2.5 ligands containing the WXRM(D/E) motif (Fig. 1f,g, enriched ligands in red), a well-characterized trait of the BDC2.5 TCR^43,54,55,56. Across several independent experiments there was limited variation in ES values for the same TCRs and several epitopes fell into high- or low-confidence zones consistently (Extended Data Fig. 4a). Using a different TCR that was isolated from NOD mice, 4-8Ins⁵⁷, which recognized the InsB9:23 epitope (SHLVEALYLVCGERG), we observed a similar pattern of ES for cognate epitopes (Extended Data Fig. 4b).

To test whether HLA-DQ8 SABR-II could be used for antigen discovery, we curated a list of insulin B, insulin C and HIP epitopes published by Wiles et al.⁵⁸ and cloned them into the DQ8 SABR-II backbone using the same pooled cloning strategy as the I-Ag7 SABR-II library (Supplementary Table 2). Furthermore, we combined SABR-I (the HLA-A*0201 library reported in our previous work²⁷) and SABR-II libraries at a cellular level and screened against the GSE.20D11 TCR. As expected, the cognate epitope of the GSE.20D11 TCR, SHLVEALYLVCGERG (red), was enriched at a high confidence level from a combined class I and II library (Fig. 1h). This demonstrates that a combined library approach using the SABR platform can be implemented to increase throughput. Together, these results demonstrate the ability of SABR-IIs to successfully read out pMHC-II–TCR interactions across species and cell types and serve as a method for CD4⁺ TCR antigen discovery.

Single-cell profiling of islet-infiltrating CD4⁺ T cells

We sought to apply SABR-II libraries to identify the specificities of islet-infiltrating CD4⁺ T cells in NOD mice. Although NOD mice recapitulate many features of T1D and share several autoantigens with individuals with T1D^53,59,60,61, the overall antigenic landscape of islet-infiltrating CD4⁺ T cells in NOD mice remains undefined. Therefore, we performed scRNA-seq with V(D)J enrichment on T cells from individual pancreatic islets of 6-, 8- and 10-week-old NOD mice. We sorted Thy1.2⁺TCRβ⁺ T cells from 3–4 mice at each time point, combined them using TotalSeq cell-hashing oligonucleotides and proceeded to scRNA-seq using the 10x Genomics platform. In total, T cells from 11 mice were sequenced in three batches and the data were pooled for analysis. Hierarchical clustering in Seurat⁶², followed by bioinformatic gating on CD4⁺ T cells and re-clustering, revealed seven distinct CD4⁺ T cell clusters with no obvious bias between mice (Fig. 2a and Extended Data Fig. 5a,b). Next, we integrated TCR clonotypes with the transcriptomes using scRepertoire⁶³ and identified the clonally expanded populations of CD4⁺ T cells (Fig. 2b). Clonal expansion was categorized as single (one clone per TCR), low (2–9 clones per TCR) or medium (≥10 clones per TCR). Clonal expansion was evident in clusters 0 and 3–6 (Fig. 2c). Generally, clonal expansion correlated with the expression of activation and exhaustion markers (Nkg7, Ccl5, Lag3 and Tigit), whereas naive T cell markers (Sell and Ccr7) coincided with un-expanded populations. We reasoned that clonally expanded cells within the islets were the most likely to target islet antigens and contribute to β-cell destruction. Therefore, we used clonal expansion as the sole criterion for selecting TCRs for antigen discovery. Overall, clonally expanded TCRs showed a slight skew toward certain Vα and Vβ alleles (Extended Data Fig. 5c–e), as has been reported previously^64,65. Notably, expanded clones did not segregate solely based on their gene expression as indicated by the high degree of clonal sharing between CD4⁺ TCR clusters determined by the Morisita–Horn Index (Fig. 2d and Extended Data Fig. 5f). Clonally expanded TCRs showed increased expression of Lag3, similar to a restrained CD8⁺ T cell phenotype that was reported previously in NOD mice⁶⁶. Further investigations into the transcriptional signatures of expanded T cells were reported previously⁶⁷. Specifically, we identified 35 clonally expanded TCRs for screening, corresponding to 19 TCRs from three 8-week-old mice and 16 TCR from two 10-week-old mice (Supplementary Table 3). We reconstructed the TCRs using a home-brewed Python script that reconstructs full TCRα/β chains using the IMGT TCR allele dataset (Methods)⁶⁸. The reconstructed TCR genes were synthesized through commercial vendors and subcloned into the pMIG-II–IRES–GFP vector containing a partial Cβ-chain derived from the BDC2.5 TCR. TCRs in the pMIG-II vector were then packaged intro retroviruses and expressed in Jurkat cells. Surface expression was confirmed by staining for murine TCRβ followed by flow cytometry. For TCRs with low transduction levels, we enriched the TCRβ⁺ population using either fluorescence-activated cell sorting or magnetic selection and proceeded with antigen discovery with SABR-II libraries (Fig. 2e).

**Fig. 2: Single-cell RNA sequencing of islet-infiltrating CD4⁺ T cells.**

Identifying cognate epitopes of CD4⁺ TCRs de novo

We performed systematic screening of the cloned TCRs against the I-Ag7 SABR-II library. Several TCRs along with a positive control (such as BDC2.5 or 4-8Ins) were screened individually against the library for each sort (Extended Data Fig. 6a and Supplementary Table 4). High- and low-confidence ES zones for each screened TCR were defined by the ES values of the control TCR’s cognate epitopes. For all putative cognate epitopes, single SABR-IIs were constructed, expressed in NFAT–GFP Jurkat cells and used for co-incubation with the corresponding TCRs. Co-incubation assays that yielded a GFP signal higher than that obtained in assays with no TCR were determined to be positive and the epitopes deemed true cognate ligands (Extended Data Fig. 6b,c). Using this strategy, we obtained epitopes in the high-confidence zone for eight TCRs (Fig. 3a and Extended Data Fig. 6b,d). Among numerous altered peptide ligands (APLs), these TCRs recognized the physiological InsC-ChgA HIP (LQTLALWSRMD and analogs, recognized by TCR5, TCR6B, TCR9 and TCR34), InsC-Iapp HIP (LQTLALNAARDP and analogs, recognized by TCR4 and TCR15) and InsB9:23 (SHLVEALYLVCGERG and analogs recognized by TCR24 and TCR37). These cognate high-confidence hits were validated using single SABR-II co-incubation assays (Fig. 3b and Extended Data Fig. 7a). Further validations using in vitro mouse interleukin-2 (mIL-2) secretion by TCR-expressing 5KC reporter cells¹³ or CD25 expression by TCR-expressing splenic CD4⁺ T cells upon stimulation with the cognate epitope were performed (Extended Data Fig. 7b,c). Furthermore, low-confidence hits were called for ten TCRs and tested in co-incubation assays. Upon co-incubations, two out of the ten TCRs (TCR11 and TCR30) showed confirmation of reactivity, both recognizing InsB9:23 (SHLVEALYLVCGERG and analogs; Extended Data Fig. 7d and Fig. 3b). Notably, visualization of the cells corresponding to each de-convoluted TCR clone did not reveal overt differences in the transcriptional phenotype of cells recognizing the three different antigens (Fig. 3c). Taken together, these results indicate that SABR-II libraries can successfully identify cognate epitopes of CD4⁺ TCRs among thousands of epitopes for TCR-directed antigen discovery, starting simply from a TCR sequence with little a priori knowledge.

**Fig. 3: De novo identification of cognate epitopes for expanded CD4⁺ T cells.**

TCR similarity predictions amplify antigen discovery

We hypothesized that computational grouping of TCR specificities may reveal closely related TCRs that potentially recognize the same epitope(s), similar to the reverse epitope discovery approach (Fig. 4a). In the absence of experimental antigen discovery, grouping of TCRs is not informative of reactivity; however, we hypothesized that TCRs that co-clustered with SABR-II de-convoluted TCRs bind to the same antigens. To test this, we used three TCR-similarity search algorithms: GLIPH2 (refs. ^38,69), tcrdist3 (ref. ³⁷) and CoNGA³⁹. All three algorithms take slightly different approaches to group TCR sequences and generate clusters of TCR sequences that share high sequence similarity. In addition, CoNGA considers the transcriptional similarities among T cell clones. Using CoNGA, we defined TCR clusters for two TCRs, TCR4 and TCR6B, and identified analogs that slightly differed in sequences. Moreover, for TCR30, we were able to identify six TCR analogs that co-clustered in CoNGA analysis as well as GLIPH2. For TCR11, we first identified a gene expression (GEX) cluster that had ~50 TCRs that clustered based on gene expression. Using tcrdist3, we calculated the relative distance of each of these TCRs from TCR11 and selected the top seven clonotypes for expression. Together, 16 TCRs were identified as analogs of the experimentally de-convoluted TCRs (Extended Data Fig. 8). These TCRs were cloned and expressed in Jurkat cells. We performed co-incubation assays using single SABR-IIs and observed that 5 of 16 TCRs recognized the same epitopes as the parental TCRs (Fig. 4b). As a result, we were able to identify the cognate epitopes of five additional TCRs from our dataset that had otherwise not been selected for SABR-II screening based on our clonal expansion cutoff. Notably, the computationally identified and experimentally validated TCRs shared similar phenotypes as the experimentally de-convoluted TCRs (Fig. 4c). Therefore, we demonstrated that computational TCR similarity determinations could amplify experimental antigen discovery, leading to the deconvolution of 16 private TCRs de novo.

**Fig. 4: Computational prediction of antigen specificity amplifies SABR-II antigen discovery.**

Identifying new HIP epitopes using SABR-II libraries

Given the predominance of HIP-reactive TCRs, we hypothesized that there may be other TCRs that respond to HIPs that were not encoded in our initial library configuration. While the initial I-Ag7 SABR-II library consisted of a number of HIPs, HIP formation is thought to be more widespread in pancreatic β-cells^52,70. Therefore, we sought to construct a defined, HIP-focused library to probe whether there were any undiscovered HIP-reactive TCRs that could be recognized by the clonally expanded TCR in our dataset. To test this, we utilized a published proteomic dataset, which predicted that several proteins that were highly expressed in secretory granules of β-cells may contribute to HIP formation⁵⁸. Using their predictions, we built a theoretical HIP library, in which all possible ‘left’ halves of the insulin C chain derived from natural cleavage products were fused to ‘right’ halves derived from secretory granule proteins (Fig. 5a). This 2,561-epitope library (12–25 amino acids per epitope) consisted of only HIPs and a small number of positive control epitopes (Supplementary Table 5). We screened the top three clonally expanded TCRs (TCR1, TCR2 and TCR3) against this library, as these TCRs had not been de-convoluted using the original library. We did not observe any putative hits for TCR1 and TCR2; however, TCR3 yielded several high- and low-confidence hits, all of which have not previously been reported (Fig. 5b). To confirm that the HIP itself was important for the cognate interaction, we cloned single 14-mer epitopes into SABR-IIs consisting of seven amino acids of the left portion of the HIP and seven amino acids of the right portion of the HIP. In this way, no nine amino acids from either peptide sequence alone could occupy the binding pocket of I-Ag7, ensuring that TCR reactivity spanned the HIP junction⁷¹. Upon single SABR co-incubations, all but one of the tested hits for TCR3 showed reactivity (Fig. 5c). Of note, in all the epitopes that were tested, the ‘left’ half derived from insulin C was conserved, whereas the ‘right’ halves were derived from several other proteins (Fig. 5d). These results show that defined theoretical SABR-II libraries can be deployed for determining non-contiguous epitope reactivity as well as TCR promiscuity. Moreover, the promiscuous binding of TCR3 to HIPs corroborates evidence from other NOD mouse-derived TCRs reacting to multiple HIPs⁷².

**Fig. 5: HIP target library for identification of TCR with new HIP specificity.**

Technical advances afforded by SABR-II screens

Finally, we sought to address two important aspects of antigen discovery techniques. First, we assessed whether SABR-II screens can directly read out the strength of TCR–pMHC binding. To that end, we selected six known BDC2.5 ligands across a range of ES values (Extended Data Fig. 9a) and we measured the functional avidity of their recognition by BDC2.5 TCR in vitro. Bone-marrow-derived dendritic cells were pulsed with a range of concentrations of peptides corresponding to the epitopes and used to present the peptides to BDC2.5 TCR-expressing 5KC cells. Secretion of mIL-2 was measured by ELISA and used to determine the functional avidity as EC₅₀ (concentration of the peptide needed to induce half-maximal mIL-2) (Extended Data Fig. 9b). We observed that there was a modest negative correlation between the EC₅₀ values of the epitopes and their ES values (Extended Data Fig. 9c). These results indicate that ES values can provide a semi-quantitative readout of the strength of interactions between TCRs and their cognate epitopes. Second, we evaluated whether we could increase the throughput of SABR-II screens by multiplexing TCRs and libraries at the cellular level. We combined the two previously described I-Ag7 libraries in equal proportions according to their size and used it as a single library. We also employed a dropout strategy, in which a mixture of seven TCRs was screened in replicate, where one TCR was left out in each replicate. After single enrichment, we determined the mean ES of all replicates that contained a given TCR and used it to identify the cognate epitope of that TCR (Extended Data Fig. 10 and Supplementary Table 6). Using this strategy, we were able to successfully recapitulate the results for four out of four TCRs previously identified in individual screens. The use of such a strategy will greatly enhance the throughput or SABR-II screens by reducing the hands-on sort time from 1 h per TCR to 20 min per TCR. These results show features that have been uniquely demonstrated by SABR-II screens and should increase the throughput of antigen discovery.

Discussion

Here, we report SABR-IIs for CD4⁺ T cell antigen discovery, providing a robust method for screening a large number (1,000s to 10,000s) of epitopes. SABR-IIs can identify TCRs rapidly and can semi-quantitatively read out TCR–pMHC binding strengths. We have also shown that other non-T cell types can also be used to detect cognate interactions, expanding antigen discovery to professional APC-based platforms. Notably, SABR-II libraries can easily encode for deamidation and HIP formation, which are both post-translational modifications. Through this approach, we identified several new HIPs that were targeted by islet-infiltrating T cells and demonstrate an HIP-focused cell-based library strategy.

Moreover, we demonstrate a robust pipeline for reconstructing TCRs from scRNA-seq data and identifying their epitopes. The ability to start from and reconstitute TCRα/β sequences means that precious human samples are not wasted and can be assayed using additional methods. Furthermore, starting from scRNA-seq has the built-in advantage of leveraging the transcriptional information for each clone of an identified specificity, not limited by a few phenotypic surface markers or agnostic of the T cell’s function altogether. While we have chosen to profile the top expanded T cell clones in this study, we envision that future efforts can be focused on specific phenotypes of interest, such as regulatory T cells. In this way, both the environment from which the T cells are sampled and the properties of the T cells themselves will help further shape hypothesis-driven antigen discovery in autoimmune diseases such as T1D.

The ability to amplify antigen discovery using related TCRs by leveraging existing computational methods not only validates their utility but generates a positive-feedback loop for increased repertoire profiling and validation of TCR specificity. This will lead to an overall enlargement of the known epitope-specific TCR repertoire and provide incorporation of orthogonally obtained datasets for de novo antigen discovery. Finally, SABR-IIs in conjunction with SABRs, allow parallel antigen discovery for CD4⁺ and CD8⁺ T cells within the same platform and experiments.

We do wish to highlight the current limitations of our technique. The SABR-II in its current iteration is similar to the MCR-TCR platform^28,33, which encodes for a signal emanating from MHC-II. There are several design differences that confer different capabilities to SABR-IIs, namely, the ability to perform single enrichments on larger libraries, the ability to multiplex TCRs and the ability to screen for both class I and II alleles. Notably, as the signaling domains of SABR-II are modular, SABR-IIs can be expressed and deployed in professional APCs; however, there are also key differences, such as lower library sizes, especially compared to the cDNA-generated libraries. As with the current cell-based epitope discovery methods, SABR-IIs cannot match the scale of yeast display, which can reach up to 10⁸ epitopes for profiling. Techniques such as TScan-II have shown genome-scale antigen discovery; however, they cannot be used for both class I and class II discovery in the same platform³⁵. Therefore, while not required, certain a priori criteria such as MHC binding prediction, tissue expression patterns or known immunopeptidomic datasets greatly enhance SABR-II library design. SABR-II screens are currently performed as ‘few against many’ assays, allowing tens of TCRs to be screened in a single day. The computational prediction tools we used here also pose inherent limitations to our workflow. As shown, 10 of 16 computationally predicted TCRs did not recognize the same antigens as the parental TCRs. This may be due to the erroneous calling of clonotypes or due to the analog-binding variations of the epitopes tested here. Either way, while we were able to amplify experimental antigen discovery, caution must be taken to not presume that prediction equals actual binding.

While we showed de novo identification of the 11 top expanded TCRs out of 36, we did not identify the cognate epitopes of the remaining TCRs. This could be due to several reasons. First, we used a published MHC elution dataset, which inherently has high specificity but low sensitivity for detecting MHC-II-bound epitopes. Building new SABR-II libraries based on tissue-specific gene expression may benefit by casting a wider net in search of cognate epitopes. In addition, a hallmark of numerous autoreactive diseases is the reactivity to post-translationally modified epitopes^73,74. While we were able to encode hybrid and deamidated epitopes in our SABR-II libraries, we are developing approaches to incorporate a wider range of chemical modifications. Finally, the antigen sensitivity of class I SABRs is inherently lower than those of TCRs. We expect that SABR-IIs may also have a similar limitation, where very-low-affinity antigens do not generate a strong SABR signal and remain below the limit of detection without further modification, such as the introduction of a disulfide trap to stabilize the MHC and fix weak binding registers in place.

In summary, this study demonstrates that wielding SABR-IIs for TCR-directed antigen discovery and amplifying discovery with existing computational methods is a powerful combination for understanding CD4⁺ T cell specificities. By increasing the ability to survey the T cell repertoire we envision a more comprehensive catalog of the T cell ‘reactome.’

Methods

Ethics statement

All animal work was performed as per Institutional Animal Care and Use Committee (IACUC) guidelines under an approved IACUC protocol (no. 20037102). All experimental work was performed according to the institutional biosafety committee protocols.

Reagents and oligonucleotide primers

Reagents and oligonucleotide primers methods can be found in Supplementary Table 7. The lists of epitopes in the SABR-II libraries can be found in Supplementary Tables 3, 4 and 6.

Cell lines and peptides

Jurkat cells (ATCC) and Daudi cells (ATCC) were cultured in R10 (RPMI 1640 medium (Corning) supplemented with 10% FBS (Gemini Bio) and 10 U ml⁻¹ penicillin–streptomycin (Corning)). NFAT–GFP Jurkat cells were a kind gift from A. Weiss and Y. Chen and were cultured in R10 supplemented with 2 mg ml⁻¹ Geneticin (Corning). HEK293T cells (ATCC) were cultured in D10 (DMEM (Corning) supplemented with 10% FBS (Gemini Bio) and 10 U ml⁻¹ penicillin–streptomycin (Corning)). 5KC cells were a kind gift from M. Nakayama and were cultured in IMDM (Gibco) with 10% FBS (Gemini Bio) and penicillin–streptomycin. All cell culture was performed at 37 °C with 5% CO₂ in a humid cell culture incubator. Primary CD4⁺ T cells were isolated from spleens for NOD mice using a STEMCELL murine CD4⁺ T cell-positive selection kit (STEMCELL Technologies) and cultured in R10 (RPMI 1640 medium (Corning) supplemented with 10% FBS (Gemini Bio) and 10 U ml⁻¹ penicillin–streptomycin (Corning)) supplemented with 5 U ml⁻¹ IL-2 (R&D Biosciences).

Mice

Mice were housed in microisolator cages with up to five mice per cage in a 14-h light–10-h dark cycle. Temperatures of 65–75 °F (~18–23 °C) with 40–60% humidity were maintained. There was constant access to water. NOD/ShiLtJ (strain 001976, The Jackson Laboratory) mice were purchased at the age of 4 weeks. The mice were fed autoclaved rodent breeder diet (T. R. Last). Female mice were used for scRNA-seq and validation assays. For scRNA-seq, 6-, 8- or 10-week-old female mice were used. All animal work was performed under IACUC protocols in the Association for Assessment and Accreditation of Laboratory Animal Care-certified animal facility at the University of Pittsburgh.

Construction of SABRs

SABRs were designed by assembling the individual component sequences in Snapgene (DNAstar). HLA allele chains were downloaded from IMGT and MHC allele chains were downloaded from UniprotKB. SignalP-5.0 (ref. ⁷⁵) was used to predict the signal sequence and truncate it. The signaling domains were derived from the previously published SABR constructs²⁷. Beta-chain-Signaling-2A-Alpha-chain fragments were assembled and codon-optimized using IDT’s codon optimization tool. BsmBI sites were replaced without affecting the amino acid sequences and EcoRI sites were added at the ends. A 2-kb stuffer fragment was also synthesized according to previously published sequences²⁷. Open reading frames were synthesized as gBlocks (IDT) and assembled using PCR (KOD mastermix, Milipore Sigma) using the following primers: 2kb-Insert-gBlock-F; 2kb-Insert-gBlock-R; BsmBI-Insert-Fwd; and ClassII-Alpha-Rev. The assembled full-length inserts were gel purified (Takara), digested with EcoRI (NEB), ligated in EcoRI-digested pCCLc-MND-X (a kind gift from D.B. Kohn) and transformed using NEB-5α cells (NEB). Inserts were verified using MND_Input_Verify_F and MND_Input_Verify_R primers. Once full-length backbones were cloned, they were used to clone individual epitopes. To insert epitopes, SABR vectors were digested with BsmBI along with alkaline phosphatase (rSAP, NEB) to excise the 2-kb stuffer fragment. Two complementary oligonucleotides, SABR-epitope-F and SABR-epitope-R, were synthesized for each epitope. Oligonucleotides were annealed to each other, phosphorylated and ligated into the BsmBI-digested backbone (T4 Ligase, NEB) and transformed in NEB-5α cells (NEB). For cloning SABR libraries, oligonucleotide pools containing overhangs (oligonucleotide epitope primer) were synthesized via Twist Biosciences. The pool was amplified using ClassII-Oligo-Fwd and ClassII-Oligo-Rev and cloned in a BsmBI-digested backbone using Infusion HD cloning (Takara). Bacteria were plated on LB agar containing 100 μg ml⁻¹ carbenicillin (Life Technologies), grown overnight and single colonies were selected for verification by Sanger sequencing (Azenta). Successful clones were used to inoculate liquid culture for overnight growth followed by plasmid minipreps (Zyppy miniprep kit, Zymo). Pooled libraries were subjected to maxipreps (Nucleobond Maxiprep EF kit, Takara). Library coverage was determined by comparing the number of total colonies transformed to the number of epitopes encoded in the library. For B cell receptor SABRs, the protein sequences CD79A and CD79B domains were obtained from UniprotKB and fused with full-length MHC-II chains and obtained via commercial synthesis (Twist Biosciences). Epitopes were cloned in the B cell receptor SABR backbone as described above, except that the stuffer fragment was removed using XhoI digestion (NEB).

scRNA-seq of islet-infiltrating T cells and analysis

NOD mice were killed by CO₂ asphyxiation and immediately dissected for pancreas perfusion and individual islet picking as previously decsribed⁶⁶. Pancreas perfusion was performed under a dissecting microscope. The pancreatic duct was clamped using surgical clamps and 3 ml 600 U ml⁻¹ Collagenase IV (Gibco) dissolved in HBSS (Gibco) was injected using a 30G needle. Perfused pancreata were collected and incubated at 37 °C for 30 min. After the incubation, HBSS with R10 was added to quench collagenase. After washing twice with HBSS + R10, the tissue was plated on a 10-cm plate and individual islets were picked using a micropipette. Islets were then incubated in dissociation buffer (Gibco), centrifuged and resuspended in the staining mix (1:500 dilution of anti-Thy1.2-BV605 + 1:500 dilution of Live/Dead-APC-Cy7 and 1:100 dilution of cell-hashing TotalSeq antibodies (BioLegend)). After staining, the cells were resuspended in PBS + 0.04% BSA (Millipore Sigma) and sorted on a BD FACS Aria III sorter. After sorting the cells, they were counted and processed for scRNA-seq. Cells were processed using 10× 5′ single-cell gene expression kit v3 in a Chromium controller according to the manufacturer’s protocols. V(D)J enrichment was performed using the single-cell 5′ VDJ enrichment kit according to the manufacturer’s protocols. Libraries were sequenced on a HiSeq4000 (Novogene) with a 70:20:10 mix for gene expression:VDJ:hashing libraries. Sequence data were downloaded on the Joglekar laboratory server and aligned to the mouse genome (Mm10) using CellRanger v.4.0.0 (10x Genomics). TCR annotation was performed using CellRanger vdj using mouse GRCm38 assembly. All three time points were sequenced and processed separately. CellRanger and CellRanger vdj output files were used as inputs in Seurat⁶² for normalization, scaling and dimensionality reduction. The packaged scRepertoire was used for TCR clonotype calling and analyses. The data were normalized using NormalizeData and scaled using ScaleData functions in Seurat. The scRepertoire⁶³ functions combineTCR and combineExpression were used to add TCR clonotypes to each cell. The HTODemux function in Seurat was used to demultiplex cell hashes and assign the correct mouse identity to each cell. At this point, all three time points were merged in Seurat using the merge function. After merging, integration was performed using FindIntegrationAnchors and IntegrateData functions. Principal-component analysis was performed using RunPCA. The top 20 principal components were used for UMAP, followed by cluster identification using FindNeighbors and FindClusters. CD4⁺ T cells were subsetted using FeatureScatter and CellSelector functions and reclustered. Cluster markers were defined by the FindAllMarkers function. Clonotype data were sorted according to expansion and exported as a csv file. UMAP representations with clonotypes were generated using the highlightClonotypes function in scRepertoire. Differentially expressed genes were identified using the FindMarkers function using DESeq2 statistics and represented using EnhancedVolcano function. For the related manuscript⁶⁷ (Xiao, Rohimikollu and Rosengart et al.), single (1), low (2–9) and medium (≥10) clonotypes were subsetted in Seurat and exported as Seurat objects for further analyses. All scRNA-seq analyses were performed using RStudio (v.2023.12.1+402).

TCR reconstruction and synthesis

TCR Vα, Jα, Vβ and Jβ alleles along with CDR3α and CDR3β sequences were used as the input to reconstruct full-length TCR sequences using the TCRgen_mouse.opt_v2.py script (available on GitHub at https://github.com/joglekar-lab/SABR-II). Mouse reference sequences were downloaded from IMGT. Full-length TCR sequences (TCRα-2A-TCRβ) flanked by EcoRI site and truncated at the BlpI site in Cb were synthesized as gene fragments via Twist Biosciences. TCR gene fragments were amplified using TCR-gene-fwd and TCR-gene-rev primers and subcloned using a pMIG-II vector containing BDC2.5 TCR (Vignali laboratory) using EcoRI-BlpI. Successful cloning was verified using Sanger sequencing (Azenta).

TCR similarity determinations

Exported clonotypes were used as inputs for GLIPH2 (ref. ⁶⁹). For CoNGA, the merged dataset was exported as a .h5ad file and used as an input along with the CellRanger vdj output file. CoNGA analysis was performed using default parameters³⁹. Pairwise relative distances among TCRs were calculated using tcrdist3 (ref. ³⁷). CoNGA, tcrdist3 and GLIPH2 output files were searched manually for analogs that co-cluster with experimentally de-convoluted TCRs. Analogs were synthesized and cloned as described above.

Generation and cloning of SABR libraries

To generate the I-Ag7 restricted SABR library, we combined all Immune Epitope Database epitopes with a published immunopeptidome generated by Wan et al.⁵¹. Sequences were filtered remove all post-translational modifications except deamidation and HIPs and trimmed between 9–25 amino acid lengths. For the insulin C HIP and HLA-DQ8 library, non-contiguous epitopes from Wiles et al.⁶⁶ as well as all Immune Epitope Database epitopes were combined to generate the epitope list. Epitope sequences were back-translated using the backtranslate_fast.py script.

Lentiviral vector production and transduction

Lentiviral vectors to express SABRs or TCRs were packaged via previously described procedures^27,76. In brief, HEK293T cells were plated in six-well plates at 1 × 106 cells per well. After 24 h, cells were transfected with a mixture of the lentiviral shuttle plasmid (1 μg per well), pMDG-VSVG (0.2 μg per well) and pCMV-RD8.9 (1 μg per well) (both kind gifts from D.B. Kohn) using TransIT-293 (Mirus Bio) and OPTI-MEM (Life Technologies) using the TransIT-293 manufacturer’s protocol. After 3 days, viral supernatant was collected and filtered through 0.45-μm syringe filters (Millipore). When possible, the freshly filtered virus was used to transduce 1 × 10⁶ Jurkat cells per ml of the virus. Occasionally, the virus was stored at −80 °C until use. For NFAT–GFP Jurkat cells, Geneticin was added 24 h following transduction.

Retroviral vector production and transduction

Retroviral vectors (pMIG-II) to express TCRs were packaged via previously described procedures⁷⁷. In brief, HEK293T cells were plated in six-well plates at 1 × 10⁶ cells per well. After 24 h, the cells were transfected with a mixture of the retroviral shuttle (1 μg per well), pRD114 (0.8 μg per well) and pHIT60 (1 μg per well) using TransIT-293 (Mirus Bio) and OPTI-MEM (Life Technologies). The following day, viral supernatant was collected and filtered through 0.45-μm syringe filters (Millipore). Transduction of 2.5 × 10⁵ Jurkat cells was performed using RetroNectin (Takara) binding according to the manufacturer’s protocol using the filtered virus. For primary murine CD4⁺ T cells and 5KC cells, Phoenix-ECO cells (ATCC) were plated in six-well plates at 1 × 10⁶ cells per well. After 24 h, the cells were transfected with the retroviral shuttle (2.5 μg per well) using TransIT-293 (Mirus Bio) and OPTI-MEM (Life Technologies) using the TransIT-293 manufacturer’s protocol. At 48 h after transfection viral supernatant was collected and filtered through 0.45-μm syringe filters (Millipore). Transduction of 2.5 × 10⁵ 5KC or primary murine CD4⁺ T cells was performed using RetroNectin (Takara) binding according to the manufacturer’s protocol using the filtered virus. Before transduction, primary murine CD4⁺ T cells were stimulated and grown for 24 h on 24-well plates coated with RetroNectin, 2 μg ml⁻¹ anti-CD3ε (BioLegend) and 1 μg ml⁻¹ anti-CD28 (BioLegend).

Co-culture assays

For SABR library screens, 3 × 10⁶ NFAT–GFP Jurkat cells expressing the SABR library were labeled with CellTrace Violet (BioLegend) according to the manufacturer’s protocol before incubation with 3 × 10⁶ Jurkat cells expressing the TCR of interest. These mixtures were incubated in a six-well plate for 16–20 h. Cells were stained with anti-CD69-APC-Cy7 where indicated (BioLegend) and the top 1–2% of GFP⁺CD69⁺ cells were sorted for genomic DNA extraction, indexing and sequencing (see below). Multiplexed assays were scaled on a per-TCR basis (for example 3 × 10⁶ for each of three TCRs against 9 × 10⁶ library cells). For single SABR assays, unless otherwise defined, 5 × 10⁵ SABR expressing NFAT–GFP Jurkat cells (or 5KC cells) were labeled with CellTrace Violet (BioLegend) according to the manufacturer’s protocol before incubation with 5 × 10⁵ TCR-expressing Jurkat cells (or 5KC cells) in a round-bottom 96-well plate for 16–20 h. Cells were stained with anti-CD69-APC-Cy7 when indicated and acquired on the Attune NxT flow cytometer (Thermo Fisher Scientific). All flow analysis was performed using FlowJo (BD). For Daudi cell co-culture, 1 × 10⁶ Jurkat cells expressing the TCR of interest were incubated with 1 × 10⁶ Daudi cells expressing a SABR of interest for 3 days. On day 3, cells were labeled with anti-RT1B-PE and anti-Fas-APC-Cy7 before being acquired on the Attune NxT flow cytometer.

High-throughput sequencing and analysis

Genomic DNA was extracted from sorted cells immediately after sorting, using the PureLink genomic DNA extraction kit (Life Technologies). The integrated SABR vectors were amplified with KOD polymerase (Millipore) and two rounds of amplification. In the first round, IDT-UD-SABR-C2-F and IDT-UD-SABR-C2-R primers were used to amplify the epitope. In the second round, UDI0001-R and UDI0001-F primers (representative of index 1) were used to add Illumina unique dual indexes (UDIs) to the amplicons. A different UDI was used for each sample. The reactions were pooled and purified with the NucleoSpin gel and PCR purification kit (Takara). The purified PCR product was checked before sequencing using 2% agarose gel and subjected to sequencing on a HiSeq4000 (Fulgent Genetics). Unaligned reads generated by the sequencer were stored in FASTQ files. FASTQ files were concatenated to generate one file for read1 and read2 each. The sequences were demultiplexed into individual indexes using demultiplex_dual.py. Epitopes were extracted and scored using epitope_extract_fastq_v1.1.py and merge_counts_split_v2.1.py. The ES was calculated using Microsoft Excel (Microsoft) workbooks.

Peptide pulsing assays

Bone-marrow-derived dendritic cells (BMDCs) were generated according to Abcam’s protocol (https://www.abcam.com/protocols/bmdc-isolation-protocol) by isolating bone marrow from NOD mice and differentiating these cells in granulocyte–macrophage colony-stimulating factor (R&D Systems) for 7 days. On day 7, 2 × 10⁴ BMDCs were resuspended in R10 and plated in a flat-bottom 96-well plate. Tenfold serial dilutions of each peptide were added to the BMDCs and left to incubate for 1 h. After 1 h, 5 × 10⁴ 5KC or primary murine CD4⁺ T cells were added to the peptide-pulsed BMDCs. The assay was left to incubate for 24 h, at which point cells were spun down, supernatant was collected and used for mIL-2 detection with the LEGEND MAX Mouse IL-2 ELISA kit (BioLegend). Peptides were custom ordered from GenScript.

Statistical analysis

Flow cytometry plots were analyzed with FlowJo v.10. Statistical analyses and graphical representations were generated by Microsoft Excel and GraphPad Prism v.9 and v.10 (GraphPad).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Sequencing data are available on the Gene Expression Omnibus under accession ID GSE247410. SABR-II plasmids, SABR-II libraries and TCRs will be made available upon request, given the large number of them. The individual sequences of epitopes as well as sufficient information to reconstruct the TCRs are provided in supplementary files. Source data are provided with this paper.

Code availability

All necessary scripts are deposited to GitHub at https://github.com/joglekar-lab/SABR-II.

References

Davis, M. M. & Bjorkman, P. J. T-cell antigen receptor genes and T-cell recognition. Nature 334, 395–402 (1988).
Article CAS PubMed Google Scholar
Robins, H. S. et al. Comprehensive assessment of T-cell receptor β-chain diversity in αβ T cells. Blood 114, 4099–4107 (2009).
Article CAS PubMed PubMed Central Google Scholar
Qi, Q. et al. Diversity and clonal selection in the human T-cell repertoire. Proc. Natl Acad. Sci. USA 111, 13139–13144 (2014).
Article CAS PubMed PubMed Central Google Scholar
de Greef, P. C. et al. The naive T-cell receptor repertoire has an extremely broad distribution of clone sizes. eLife 9, e49900 (2020).
Article PubMed PubMed Central Google Scholar
Oh, D. Y. & Fong, L. Cytotoxic CD4(+) T cells in cancer: expanding the immune effector toolbox. Immunity 54, 2701–2711 (2021).
Article CAS PubMed PubMed Central Google Scholar
Moss, P. The T cell immune response against SARS-CoV-2. Nat. Immunol. 23, 186–193 (2022).
Article CAS PubMed Google Scholar
James, E. A., Pietropaolo, M. & Mamula, M. J. Immune recognition of β-cells: neoepitopes as key players in the loss of tolerance. Diabetes 67, 1035–1042 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pugliese, A. Autoreactive T cells in type 1 diabetes. J. Clin. Invest. 127, 2881–2891 (2017).
Article PubMed PubMed Central Google Scholar
Spence, A. et al. Revealing the specificity of regulatory T cells in murine autoimmune diabetes. Proc. Natl Acad. Sci. USA 115, 5265–5270 (2018).
Article CAS PubMed PubMed Central Google Scholar
Joglekar, A. V. & Li, G. T cell antigen discovery. Nat. Methods 18, 873–880 (2021).
Article CAS PubMed Google Scholar
Williams, T. et al. Development of T cell lines sensitive to antigen stimulation. J. Immunol. Methods 462, 65–73 (2018).
Article CAS PubMed PubMed Central Google Scholar
Parish, C. R., Glidden, M. H., Quah, B. J. & Warren, H. S. Use of the intracellular fluorescent dye CFSE to monitor lymphocyte migration and proliferation. Curr. Protoc. Immunol. https://doi.org/10.1002/0471142735.im0409s84 (2009).
Article PubMed Google Scholar
Mann, S. E. et al. Multiplex T cell stimulation assay utilizing a T cell activation reporter-based detection system. Front. Immunol. 11, 633 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bercovici, N., Duffour, M. T., Agrawal, S., Salcedo, M. & Abastado, J. P. New methods for assessing T-cell responses. Clin. Diagn. Lab Immunol. 7, 859–864 (2000).
Article CAS PubMed PubMed Central Google Scholar
Zhang, S. Q. et al. High-throughput determination of the antigen specificities of T cell receptors in single cells. Nat. Biotechnol. https://doi.org/10.1038/nbt.4282 (2018).
Article PubMed PubMed Central Google Scholar
Newell, E. W., Klein, L. O., Yu, W. & Davis, M. M. Simultaneous detection of many T-cell specificities using combinatorial tetramer staining. Nat. Methods 6, 497–499 (2009).
Article CAS PubMed PubMed Central Google Scholar
Klenerman, P., Cerundolo, V. & Dunbar, P. R. Tracking T cells with tetramers: new tales from new tools. Nat. Rev. Immunol. 2, 263–272 (2002).
Article CAS PubMed Google Scholar
Dolton, G. et al. More tricks with tetramers: a practical guide to staining T cells with peptide-MHC multimers. Immunology 146, 11–22 (2015).
Article CAS PubMed PubMed Central Google Scholar
Novak, E. J., Liu, A. W., Nepom, G. T. & Kwok, W. W. MHC class II tetramers identify peptide-specific human CD4(+) T cells proliferating in response to influenza A antigen. J. Clin. Invest. 104, R63–R67 (1999).
Article CAS PubMed PubMed Central Google Scholar
Nepom, G. T. MHC class II tetramers. J. Immunol. 188, 2477–2482 (2012).
Article CAS PubMed Google Scholar
Vollers, S. S. & Stern, L. J. Class II major histocompatibility complex tetramer staining: progress, problems, and prospects. Immunology 123, 305–313 (2008).
Article CAS PubMed PubMed Central Google Scholar
Rius, C. et al. Peptide-MHC class I tetramers can fail to detect relevant functional T cell clonotypes and underestimate antigen-reactive T cell populations. J. Immunol. 200, 2263–2279 (2018).
Article CAS PubMed PubMed Central Google Scholar
Boder, E. T. & Wittrup, K. D. Yeast surface display for screening combinatorial polypeptide libraries. Nat. Biotechnol. 15, 553–557 (1997).
Article CAS PubMed Google Scholar
Wen, F. & Zhao, H. Construction and screening of an antigen-derived peptide library displayed on yeast cell surface for CD4⁺ T cell epitope identification. Methods Mol. Biol. 1061, 245–264 (2013).
Article CAS PubMed Google Scholar
Wen, F., Esteban, O. & Zhao, H. Rapid identification of CD4⁺ T-cell epitopes using yeast displaying pathogen-derived peptide library. J. Immunol. Methods 336, 37–44 (2008).
Article CAS PubMed Google Scholar
Birnbaum, M. E. et al. Deconstructing the peptide-MHC specificity of T cell recognition. Cell 157, 1073–1087 (2014).
Article CAS PubMed PubMed Central Google Scholar
Joglekar, A. V. et al. T cell antigen discovery via signaling and antigen-presenting bifunctional receptors. Nat. Methods 16, 191–198 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kisielow, J., Obermair, F.-J. & Kopf, M. Deciphering CD4⁺ T cell specificity using novel MHC–TCR chimeric receptors. Nat. Immunol. 20, 652–662 (2019).
Article CAS PubMed Google Scholar
Kula, T. et al. T-Scan: a genome-wide method for the systematic discovery of T cell epitopes. Cell 178, 1016–1028 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, G. et al. T cell antigen discovery via trogocytosis. Nat. Methods 16, 183–190 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sharma, G., Rive, C. M. & Holt, R. A. Rapid selection and identification of functional CD8⁺ T cell epitopes from large peptide-coding libraries. Nat. Commun. 10, 4553 (2019).
Article PubMed PubMed Central Google Scholar
Dobson, C. S. et al. Antigen identification and high-throughput interaction mapping by reprogramming viral entry. Nat. Methods 19, 449–460 (2022).
Article CAS PubMed PubMed Central Google Scholar
Jyothi, M. D., Flavell, R. A. & Geiger, T. L. Targeting autoantigen-specific T cells and suppression of autoimmune encephalomyelitis with receptor-modified T lymphocytes. Nat. Biotechnol. 20, 1215–1220 (2002).
Article CAS PubMed Google Scholar
Obermair, F. J. et al. High-resolution profiling of MHC II peptide presentation capacity reveals SARS-CoV-2 CD4 T cell targets and mechanisms of immune escape. Sci. Adv. 8, eabl5394 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dezfulian, M. H. et al. TScan-II: a genome-scale platform for the de novo identification of CD4(+) T cell epitopes. Cell 186, 5569–5586 (2023).
Article CAS PubMed Google Scholar
Yu, B. et al. Engineered cell entry links receptor biology with single-cell genomics. Cell 185, 4904–4920 (2022).
Article CAS PubMed Google Scholar
Dash, P. et al. Quantifiable predictive features define epitope-specific T cell receptor repertoires. Nature 547, 89–93 (2017).
Article CAS PubMed PubMed Central Google Scholar
Glanville, J. et al. Identifying specificity groups in the T cell receptor repertoire. Nature 547, 94–98 (2017).
Article CAS PubMed PubMed Central Google Scholar
Schattgen, S. A. et al. Integrating T cell receptor sequences and transcriptional profiles by clonotype neighbor graph analysis (CoNGA). Nat. Biotechnol. 40, 54–63 (2022).
Article CAS PubMed Google Scholar
Pogorelyy, M. V. et al. Resolving SARS-CoV-2 CD4⁺ T cell specificity via reverse epitope discovery. Cell Rep. Med. 3, 100697 (2022).
Article CAS PubMed PubMed Central Google Scholar
Robertson, J. M., Jensen, P. E. & Evavold, B. D. DO11.10 and OT-II T cells recognize a C-terminal ovalbumin 323-339 epitope. J. Immunol. 164, 4706–4712 (2000).
Article CAS PubMed Google Scholar
Buzas, E. I. et al. A proteoglycan (aggrecan)-specific T cell hybridoma induces arthritis in BALB/c mice. J. Immunol. 155, 2679–2687 (1995).
Article CAS PubMed Google Scholar
Judkowski, V. et al. Identification of MHC class II-restricted peptide ligands, including a glutamic acid decarboxylase 65 sequence, that stimulate diabetogenic T cells from transgenic BDC2.5 nonobese diabetic mice. J. Immunol. 166, 908–917 (2001).
Article CAS PubMed Google Scholar
Tait, B. D. Genetic susceptibility to type I diabetes: a review. J. Autoimmun. 3, 3–11 (1990).
Article PubMed Google Scholar
Noble, J. A. et al. The role of HLA class II genes in insulin-dependent diabetes mellitus: molecular analysis of 180 Caucasian, multiplex families. Am. J. Hum. Genet 59, 1134–1148 (1996).
CAS PubMed PubMed Central Google Scholar
Michels, A. W. et al. Islet-derived CD4 T cells targeting proinsulin in human autoimmune diabetes. Diabetes 66, 722–734 (2017).
Article CAS PubMed Google Scholar
Hao, Z. et al. Fas receptor expression in germinal-center B cells is essential for T and B lymphocyte homeostasis. Immunity 29, 615–627 (2008).
Article CAS PubMed PubMed Central Google Scholar
Matou-Nasri, S. et al. CD95-mediated apoptosis in Burkitt’s lymphoma B-cells is associated with Pim-1 down-regulation. Biochim. Biophys. Acta Mol. Basis Dis. 1863, 239–252 (2017).
Article CAS PubMed Google Scholar
Rathmell, J. C. et al. CD95 (Fas)-dependent elimination of self-reactive B cells upon interaction with CD4+ T cells. Nature 376, 181–184 (1995).
Article CAS PubMed Google Scholar
Vita, R. et al. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 47, D339–D343 (2019).
Article CAS PubMed Google Scholar
Wan, X. et al. The MHC-II peptidome of pancreatic islets identifies key features of autoimmune peptides. Nat. Immunol. 21, 455–463 (2020).
Article CAS PubMed PubMed Central Google Scholar
Baker, R. L. et al. CD4 T cells reactive to hybrid insulin peptides are indicators of disease activity in the NOD mouse. Diabetes 67, 1836–1846 (2018).
Article CAS PubMed PubMed Central Google Scholar
Amdare, N., Purcell, A. W. & DiLorenzo, T. P. Noncontiguous T cell epitopes in autoimmune diabetes: From mice to men and back again. J. Biol. Chem. 297, 100827 (2021).
Article CAS PubMed PubMed Central Google Scholar
Stadinski, B. D. et al. Chromogranin A is an autoantigen in type 1 diabetes. Nat. Immunol. 11, 225–231 (2010).
Article CAS PubMed PubMed Central Google Scholar
Parras, D., Sole, P., Delong, T., Santamaria, P. & Serra, P. Recognition of multiple hybrid insulin peptides by a single highly diabetogenic T-cell receptor. Front. Immunol. 12, 737428 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ramirez, L. & Hamad, A. R. Status of autoimmune diabetes 20-year after generation of BDC2.5-TCR transgenic non-obese diabetic mouse. World J. Diabetes 4, 88–91 (2013).
Article PubMed PubMed Central Google Scholar
Lee, T., Sprouse, M. L., Banerjee, P., Bettini, M. & Bettini, M. L. Ectopic expression of self-antigen drives regulatory T cell development and not deletion of autoimmune T cells. J. Immunol. 199, 2270–2278 (2017).
Article CAS PubMed Google Scholar
Wiles, T. A. et al. Identification of hybrid insulin peptides (HIPs) in mouse and human islets by mass spectrometry. J. Proteome Res. 18, 814–825 (2019).
Article PubMed PubMed Central Google Scholar
Pearson, J. A., Wong, F. S. & Wen, L. The importance of the non obese diabetic (NOD) mouse model in autoimmune diabetes. J. Autoimmun. 66, 76–88 (2016).
Article CAS PubMed Google Scholar
Prasad, S., Kohm, A. P., McMahon, J. S., Luo, X. & Miller, S. D. Pathogenesis of NOD diabetes is initiated by reactivity to the insulin B chain 9-23 epitope and involves functional epitope spreading. J. Autoimmun. 39, 347–353 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zakharov, P. N., Hu, H., Wan, X. & Unanue, E. R. Single-cell RNA sequencing of murine islets shows high cellular complexity at all stages of autoimmune diabetes. J. Exp. Med. 217, e20192362 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 (2021).
Article CAS PubMed PubMed Central Google Scholar
Borcherding, N., Bormann, N. L. & Kraus, G. scRepertoire: an R-based toolkit for single-cell immune receptor analysis. F1000Res 9, 47 (2020).
Article CAS PubMed PubMed Central Google Scholar
Baker, F. J., Lee, M., Chien, Y. H. & Davis, M. M. Restricted islet-cell reactive T cell repertoire of early pancreatic islet infiltrates in NOD mice. Proc. Natl Acad. Sci. USA 99, 9374–9379 (2002).
Article CAS PubMed PubMed Central Google Scholar
Galley, K. A. & Danska, J. S. Peri-islet infiltrates of young non-obese diabetic mice display restricted TCR β-chain diversity. J. Immunol. 154, 2969–2982 (1995).
Article CAS PubMed Google Scholar
Grebinoski, S. et al. Autoreactive CD8(+) T cells are restrained by an exhaustion-like program that is maintained by LAG3. Nat. Immunol. 23, 868–877 (2022).
Article CAS PubMed PubMed Central Google Scholar
Rahimikollu, J. et al. SLIDE: significant latent factor interaction discovery and exploration across biological domains. Nat. Methods https://doi.org/10.1038/s41592-024-02175-z (2024).
Article PubMed Google Scholar
Giudicelli, V., Chaume, D. & Lefranc, M. P. IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes. Nucleic Acids Res. 33, D256–D261 (2005).
Article CAS PubMed Google Scholar
Chiou, S. H. et al. Global analysis of shared T cell specificities in human non-small cell lung cancer enables HLA inference and antigen discovery. Immunity 54, 586–602 (2021).
Article CAS PubMed PubMed Central Google Scholar
Baker, R. L., Jamison, B. L. & Haskins, K. Hybrid insulin peptides are neo-epitopes for CD4 T cells in autoimmune diabetes. Curr. Opin. Endocrinol. Diabetes Obes. 26, 195–200 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gioia, L. et al. Position beta57 of I-A(g7) controls early anti-insulin responses in NOD mice, linking an MHC susceptibility allele to type 1 diabetes onset. Sci. Immunol. 4, eaaw6329 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wenzlau, J. M. et al. Insulin B-chain hybrid peptides are agonists for T cells reactive to insulin B:9-23 in autoimmune diabetes. Front. Immunol. 13, 926650 (2022).
Article CAS PubMed PubMed Central Google Scholar
Balakrishnan, S., Kumar, P. & Prabhakar, B. S. Post-translational modifications contribute to neoepitopes in Type-1 diabetes: challenges for inducing antigen-specific tolerance. Biochim Biophys. Acta Proteins Proteom. 1868, 140478 (2020).
Article CAS PubMed Google Scholar
Buitinga, M. et al. Inflammation-Induced citrullinated glucose-regulated protein 78 elicits immune responses in human type 1 diabetes. Diabetes 67, 2337–2348 (2018).
Article CAS PubMed PubMed Central Google Scholar
Almagro Armenteros, J. J. et al. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol. 37, 420–423 (2019).
Article CAS PubMed Google Scholar
Cooper, A. R. et al. Highly efficient large-scale lentiviral vector concentration by tandem tangential flow filtration. J. Virol. Methods 177, 1–9 (2011).
Article CAS PubMed PubMed Central Google Scholar
Szymczak, A. L. et al. Correction of multi-gene deficiency in vivo using a single ‘self-cleaving’ 2A peptide-based retroviral vector. Nat. Biotechnol. 22, 589–594 (2004).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank A.R. Cillo, C. Workman, J. Bridge, P. Thomas, S. Schattgen, S. Chiou, J. Das, H. Xiao and H. Singh for scientific discussions and advice on experimental and computational techniques. We thank M.T. Leonard, K. Ford, K. Rankin, S. Rathod, K. Adam and A. Parikh for technical assistance in the experimental antigen discovery pipeline. We thank T. Tabib and R. Lafyatis at the Single Cell Core, The Unified Flow Cytometry Core and the Division of Laboratory Animal Research for aiding with scRNA-seq, flow cytometry and animal husbandry. NFAT–GFP Jurkat cells were a gift from A. Weiss (University of California San Francisco) and Y. Chen (University of California Los Angeles). The pCCLc-MND-X backbone and pCMV-RD8.9 were gifts from D.B. Kohn (University of California Los Angeles). The 5KC cells were a kind gift from M. Nakayama (Barbara Davis Center for Diabetes, University of Colorado Anschutz Medical Campus). P.M.Z. was funded by an Autoimmunity and Immunopathology training grant (5T32AI089443-14). This research was funded by National Institutes of Health (NIH)/National Institute of Diabetes and Digestive and Kidney Diseases New Investigator Gateway Award (1R03DK127447-01 to A.V.J.), NIH/National Institute of Diabetes and Digestive and Kidney Diseases/dkNET New Investigator in Bioinformatics Award (to A.V.J.), Pittsburgh Autoimmunity Center for Excellence in Rheumatology Innovative Discovery Award (to A.V.J.); NIH Director’s New Innovator Award (DP2 OD033187-01 to A.V.J.) and Juvenile Diabetes Research Foundation Strategic Research Agreement (3-SRA-2023-1354-S-B to A.V.J.).

Author information

These authors contributed equally: Nishtha Trivedi, Stephanie Grebinoski.

Authors and Affiliations

Department of Immunology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
Paul M. Zdinak, Nishtha Trivedi, Stephanie Grebinoski, Jessica Torrey, Eduardo Zarate Martinez, Salome Martinez, Louise Hicks, Rashi Ranjan, Venkata Krishna Kanth Makani, Mary Melissa Roland, Lyubov Kublo, Sanya Arshad, Dario A. A. Vignali & Alok V. Joglekar
Center for Systems Immunology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
Paul M. Zdinak, Nishtha Trivedi, Jessica Torrey, Eduardo Zarate Martinez, Salome Martinez, Louise Hicks, Rashi Ranjan, Venkata Krishna Kanth Makani, Mary Melissa Roland, Lyubov Kublo, Sanya Arshad & Alok V. Joglekar
Program in Microbiology and Immunology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
Paul M. Zdinak & Stephanie Grebinoski
Microbiology and Immunology Diversity Scholars Program, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
Eduardo Zarate Martinez
Diabetes Center, University of California San Francisco, San Francisco, CA, USA
Mark S. Anderson
Tumor Microenvironment Center, UPMC Hillman Cancer Center, Pittsburgh, PA, USA
Dario A. A. Vignali
Cancer Immunology and Immunotherapy Program, UPMC Hillman Cancer Center, Pittsburgh, PA, USA
Dario A. A. Vignali & Alok V. Joglekar

Authors

Paul M. Zdinak
View author publications
You can also search for this author in PubMed Google Scholar
Nishtha Trivedi
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Grebinoski
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Torrey
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo Zarate Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Salome Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Louise Hicks
View author publications
You can also search for this author in PubMed Google Scholar
Rashi Ranjan
View author publications
You can also search for this author in PubMed Google Scholar
Venkata Krishna Kanth Makani
View author publications
You can also search for this author in PubMed Google Scholar
Mary Melissa Roland
View author publications
You can also search for this author in PubMed Google Scholar
Lyubov Kublo
View author publications
You can also search for this author in PubMed Google Scholar
Sanya Arshad
View author publications
You can also search for this author in PubMed Google Scholar
Mark S. Anderson
View author publications
You can also search for this author in PubMed Google Scholar
Dario A. A. Vignali
View author publications
You can also search for this author in PubMed Google Scholar
Alok V. Joglekar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.M.Z. designed and performed experiments, analyzed and interpreted the data and wrote the paper. S.G. designed and performed the scRNA-seq experiments. N.T. designed and performed SABR screens, validation experiments and SABR B cell experiments. J.T., E.Z.M., L.H., R.R., V.K.K.M., S.M., M.M.R. and S.A., performed experiments and assisted with technical procedures in TCR cloning, SABR screens and validations. M.S.A. and D.A.A.V. provided reagents, guidance and scientific discussions. A.V.J. conceptualized the study, designed and performed experiments, analyzed and interpreted the data and wrote the paper.

Corresponding author

Correspondence to Alok V. Joglekar.

Ethics declarations

Competing interests

D.A.A.V. is a cofounder and stockholder for Novasenta, Tizona and Trishula; stockholder for Oncorus and Werewolf; has patents licensed and obtains royalties from Astellas, BMS and Novasenta; is a scientific advisory board member for Tizona, Werewolf, F-Star, Bicara, Apeximmune and T7/Imreg Bio; is a consultant for Astellas, BMS, Almirall, Incyte, G1 Therapeutics and Inzen Therapeutics; and receives research funding from BMS, Astellas and Novasenta. M.S.A. Is a consultant for Imcyse and Novartis and owns stock in Merck and Medtronic. A.V.J. is a co-inventor on a patent application concerning the described platform, has received research funding from Mitsubishi-Tanabe Pharma and has served as a consultant for Pfizer. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks Michael Birnbaum, Encarnita Mariotti-Ferrandiz and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Madhura Mukhopadhyay, in collaboration with the Nature Methods team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 SABR expression and signaling.

A. Representative expression of murine TCRβ (Bdc2.5 TCR) and I-Ag7-2.5mimo SABR-II after transduction of Jurkat and NFAT-GFP-Jurkat cells respectively. B. Representative flow cytometry plots of SABR-II expressing NFAT-GFP-Jurkat cells after co-incubation with TCR-expressing Jurkats. The TCRs and SABRs are indicated by rows and column names respectively. C. TCR3 expressing or mock Jurkats were co-incubated against NFAT-GFP-Jurkats cells expressing the I-Ag7 SABR presenting the QVEQLELNAARDPN HIP (SABR1). The GFP MFI was plotted (y-axis) as dots with s.d. (error bars) from technical duplicates against the I-Ag7 levels binned by half logs of the I-Ag7 labeling MFI (x-axis, bins depicted in D). D. Gating strategy to generate the half log bins of I-Ag7 expression in panel C E. Representative pseudo color dot plot depicting the I-Ag7 levels (x-axis) of SABR1 NFAT-GFP-Jurkats co-incubated against TCR3 expressing Jurkats depicted in panel A. The pseudo coloring depicts the GFP MFI for each given event. F. Representative expression of the HLA-DQ8 SABR-II after transduction of NFAT-GFP-Jurkat cells. G. Representative flow cytometry plots of SABR-II expressing NFAT-GFP Jurkat cells after co-incubation with TCR-expressing Jurkats. The respective TCRs and SABRs are indicated by rows and columns respectively.

Source data

Extended Data Fig. 2 Modularity of SABR-IIs.

A. Schematic (color coded by cell type) for corresponding co-incubation assays demonstrating SABR-IIs in both human (Jurkat) and murine (5KC) cell lines. B. OT-II TCR-expressing Jurkat cells against NFAT-GFP-Jurkats expressing the I-Ab-OVA SABR (left) and OT-II expressing 5KC cells against 5KC cells expressing the I-Ab-OVA SABR (right). Bars represent means from two biological replicates (dots). C. Schematic for corresponding co-incubation assays in NFAT-GFP-Jurkat cells demonstrating SABR-IIs with CD28-CD3z or CD79A and CD79B signaling domains. D. Representative flow plots (left) of OT-II TCR Jurkats co-incubated against NFAT-GFP-Jurkats expressing SABR-IIs with either CD28-CD3z(top) or CD79A and CD79B signaling domains. GFP expression was quantified after 24 hr co-incubation (right) with bars depicting the mean and s.d. (error bars) from 3 biological replicates (dots). E. C. Schematic for corresponding co-incubation assays in Daudi Cells demonstrating SABR-IIs with CD28-CD3z or CD79A and CD79B signaling domains. F. Representative dot plots (left) of Bdc2.5 TCR Jurkats co-incubated against Daudi cells expressing SABR-IIs with either CD28-CD3ζ or CD79A and CD79B signaling domains. The percentage of SABR-II+ Daudi cells expressing FAS was quantified by flow cytometry after 72 hr with bars depicting the mean and s.d. (error bars) from 3 biological replicates (dots).

Source data

Extended Data Fig. 3 SABR-II library screening.

A. Schematics of the SABR-II library cloning and PCR strategy used for targeted reamplification of gDNA from sorted SABR-II library cells. B. The average read count in the libraries across 8 independent experiments is shown (blue dots) with s.d. (error bars). The x-axis denotes the epitope number (ordered in a descending order of mean read counts). C. Representative flow cytometry plots for SABR-II screen sorts. NFAT-GFP-Jurkat cells expressing the SABR-II library were labeled with cell trace violet, gated, and subsequently used to select top 1–2% of GFP/CD69 double positive cells for sorting. D. PCR indexing strategy for epitopes from gDNA for sequencing of SABR-II screens. E-H. NFAT-GFP-Jurkat cells expressing the I-Ag7 library, which contains no TCR3 target epitopes, were labeled with cell trace far red (CTFR+), and NFAT-GFP-Jurkat cells expressing the TCR3 targeted single SABR-II (QVEQLELNAARDPN) were labeled with cell trace violet (CTV+). The two cell types were mixed at decreasing ratios of the TCR3 targeted SABR-II and incubated against TCR3 expressing Jurkats at a 1:1 Jurkat to NFAT-GFP-Jurkat ratio. E. Gating strategy to identify sensitivity and enrichment of target cells in SABR-II library sort gate. Cells were partitioned by cell trace label then separately analyzed for proportion that fall into the sort gate (1–2% CD69+GFP+ NFAT-GFP-Jurkat cells) in the far-right panel. F. Relative proportion of TCR3 targeted SABR-II cells and library cells in the assay that are captured in the sort gate where bars indicate mean from 3 technical replicates. The x-axis indicates the level of spike in of the TCR3 target single SABR-II. G. Left panel shows the fold enrichment (mean of 3 technical replicates) of spiked in TCR3 target single SABR-II cells after sorting. The x-axis indicates the proportion of target cells spike in as in panel F. The right panel shows the pre- and post-sort proportion of spiked in TCR3 target single SABR-II cells. H. Gating strategy to specifically identify specificity of sort gate. Cells are partitioned by cell trace labeling after their appearance in the sort gate. Right panel shows quantification of the proportion of the TCR3 targeted single SABR-II cells that makeup the total sorted cells vs the untargeted library where bars indicated the mean percentage of total sorted cells from two technical duplicates (dots) across the % of target cells spiked in pre-sort (x-axis).

Source data

Extended Data Fig. 4 Enrichment score plots from I-Ag7 Library Validations.

A. The enrichment score plots for each of 8-independent replicate screen of the BDC2.5 TCR against the I-Ag7-SABR-II library. The resulting high and low confidence thresholds are denoted by green and orange lines respectively. B. The enrichment score for each of 3-independet replicate screen of the 4–8Ins TCR against the SABR-II library. The same high and low confidence thresholds as maintained from experiments in panel A. For every plot each dot represents a single epitope spread across the x-axis and red dots indicate putative cognate epitopes selected for validation.

Source data

Extended Data Fig. 5 Single-cell RNA-Sequencing of islet-infiltrating CD4+ T cells from NOD mice.

A. Hierarchical clustering of total T cells across 11 mice from 6-, 8-, and 10-week-old time points. B. Hierarchical clustering of CD4+ T cells from individual mice across 6-, 8-, and 10-week time points. C. Projection of top 40 expanded CD4+ T cell clones from 8-, and 10-week-old NOD mice onto Seurat clusters using scRepertoire. D. Distribution of top 40 expanded CD4+ TCR sequences across all mice. E. TRAV (top) and TRBV (bottom) usage from top 40 expanded CD4+ TCR sequences across all mice. F. Morisita-Horn index comparing expanded TCR clones across each mouse individually.

Source data

Extended Data Fig. 6 Representative SABR-II screens and hit validation.

A. Representative flow sort gating for cell trace violet labeled NFAT-GFP-Jurkat cells expressing the I-Ag7-SABR-II library after co-incubation with Jurkat cells expressing TCRs. Top 1–2% of cells expressing GFP and CD69 were sorted as is shown in the two rightmost panels (gate is constant across panels). B. ES plots from a single sort of 6-TCRs individually. High and low confidence thresholds are denoted by green and orange lines respectively. The colored arrows indicate putative hits tested for validation with single SABR-II assays depicted in panel C. C. Single SABR-II co-incubations for validation of putative hits from the screens in panel B where single SABR-II expressing NFAT-GFP-Jurkat cells were co-incubated against Jurkats expressing the TCR of interest and assayed 18–20 hr later by flow cytometry. Bars indicate means from 2 technical replicates (dots). Top panel depicts non-validated epitope, middle and bottom depict validated synthetic altered peptide ligands (APLs) and physiological epitopes. D. ES plots for TCRs screened against the I-Ag7-SABR-II library that yielded high-confidence putative hits grouped by the highest non-APL epitope. The same high and low confidence thresholds are used from plots generated in Extended Data Fig. 4.

Source data

Extended Data Fig. 7 Putative hit validation for high and low confidence hits.

A. Representative flow plots from the single SABR-II validations of the putative hits for high-confidence, non-APL hits. B. Murine IL-2 ELISA from 24 hr co-incubation of 5KC cells expressing TCR37 with NFAT-GFP-Jurkat cells expressing the InsB9:23 epitope where bars represent mean of IL-2 sectretion into supernatant from two technical duplicates (dots). C. CD25 expression measured on primary murine CD4+ T cells expressing TCR15 after 24 hr co-incubation with Bone Marrow Dendritic Cells pulsed with either 1μg InsC-IAPP peptide (LQTLALNAARDP) or no peptide. D. ES plots with arrows denoting epitopes tested in the corresponding single SABR-II co-incubations. The same high and low confidence thresholds are taken from Extended Data Fig. 4. The inset plots show single SABR-II validation assays where bars indicate the mean from 2–3 technical replicates (dots). The arrow colors match the epitopes within each inset plot.

Source data

Extended Data Fig. 8 CoNGA, TCRdist results and identification of TCR analogs.

The top three row of the CoNGA panel shows the GEX and TCR clusters with phenotype marker expression in each cluster. The TCR logo panel shows TCR clusters with logo representations of the TCRs that form these clusters. The purple, red, and blue rectangles indicate the TCR4 CoNGA cluster, TCR30 CoNGA cluster, and TCR11 gene expression clusters respectively. Each cluster’s TCRs which were selected for validation are listed in the tables below with the parental TCR. TCRs which lack GEX clusters in the TCR30 analog table were selected for validation by similarity determined through GLIPH2. TCR6B is not depicted in the figure because of the minimum clone size requirement for CoNGA, TCR11 mouse IDs are lost during CoNGA processing.

Extended Data Fig. 9 SABR-II screens are semi-quantitative readouts of functional avidity.

A. Selected BDC2.5 epitopes for functional avidity measurements. As shown by blue arrows, 6 epitopes across range of ES were chosen from mean ES (bars) across 8 independent experiments (dots). B. Individual biological replicates of peptide pulsing experiments with BDC2.5 TCR-expressing 5KCs against selected peptides. The y-axis shows normalized murine IL-2 secretion across a range of peptide concentrations (x-axis). C. The mean (dots) and s.d. (error bars) of Log EC50 values (x-axis) from the 3 biological replicates in panel B plotted against the mean (dots) and s.d. of ES values across 8 biological replicates (y-axis). The r and p values for two-sided Pearson correlation (dashed line) are reported.

Source data

Extended Data Fig. 10 ES plots from TCR and library multiplexing screens.

ES plots for each multiplexed sort where the dots represent the mean ES for each epitope in replicates (n = 6) where the target TCR was not included (dropout). The red dots indicate the cognate epitope of each TCR that were previously de-convoluted using individual TCR screens against the library.

Source data

Supplementary information

Reporting Summary

Supplementary Table 1

Lists of epitopes for I-Ag7 SABR-II library.

Supplementary Table 2

Lists of epitopes for DQ8 SABR-II library and ES.

Supplementary Table 3

TCR alleles and CDR3 sequences for top clonally expanded TCRs.

Supplementary Table 4

ES calculations for all TCRs.

Supplementary Table 5

Lists of epitopes for I-Ag7 HIP library and ES.

Supplementary Table 6

ES from dropout analysis.

Supplementary Table 7

Reagents used in the study.

Source data

Source Data Fig. 1

Statistical source data.

Source Data Fig. 2

Statistical source data.

Source Data Fig. 3

Statistical source data.

Source Data Fig. 4

Statistical source data.

Source Data Fig. 5

Statistical source data.

Source Data Extended Data Fig. 1

Statistical source data.

Source Data Extended Data Fig. 2

Statistical source data.

Source Data Extended Data Fig. 3

Statistical source data.

Source Data Extended Data Fig. 4

Statistical source data.

Source Data Extended Data Fig. 5

Statistical source data.

Source Data Extended Data Fig. 6

Statistical source data.

Source Data Extended Data Fig. 7

Statistical source data.

Source Data Extended Data Fig. 9

Statistical source data.

Source Data Extended Data Fig. 10

Statistical source data.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zdinak, P.M., Trivedi, N., Grebinoski, S. et al. De novo identification of CD4⁺ T cell epitopes. Nat Methods (2024). https://doi.org/10.1038/s41592-024-02255-0

Download citation

Received: 20 November 2022
Accepted: 22 March 2024
Published: 24 April 2024
DOI: https://doi.org/10.1038/s41592-024-02255-0

Subjects

Abstract

Similar content being viewed by others

Main

Results

Signaling and antigen-presenting bifunctional receptors II

Single-cell profiling of islet-infiltrating CD4+ T cells

Identifying cognate epitopes of CD4+ TCRs de novo

TCR similarity predictions amplify antigen discovery

Identifying new HIP epitopes using SABR-II libraries

Technical advances afforded by SABR-II screens

Discussion

Methods

Ethics statement

Reagents and oligonucleotide primers

Cell lines and peptides

Mice

Construction of SABRs

scRNA-seq of islet-infiltrating T cells and analysis

TCR reconstruction and synthesis

TCR similarity determinations

Generation and cloning of SABR libraries

Lentiviral vector production and transduction

Retroviral vector production and transduction

Co-culture assays

High-throughput sequencing and analysis

Peptide pulsing assays

Statistical analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links

Single-cell profiling of islet-infiltrating CD4⁺ T cells

Identifying cognate epitopes of CD4⁺ TCRs de novo