Genetic alteration of human MYH6 is mimicked by SARS-CoV-2 polyprotein: mapping viral variants of cardiac interest

Anand, Praveen; Lenehan, Patrick J.; Niesen, Michiel; Yoo, Unice; Patwardhan, Dhruti; Montorzi, Marcelo; Venkatakrishnan, A. J.; Soundararajan, Venky

doi:10.1038/s41420-022-00914-9

Download PDF

Article
Open access
Published: 21 March 2022

Genetic alteration of human MYH6 is mimicked by SARS-CoV-2 polyprotein: mapping viral variants of cardiac interest

Praveen Anand¹^na1,
Patrick J. Lenehan²^na1,
Michiel Niesen²,
Unice Yoo²,
Dhruti Patwardhan¹,
Marcelo Montorzi^2,3,
A. J. Venkatakrishnan² &
…
Venky Soundararajan ORCID: orcid.org/0000-0001-7434-9211^1,2

Cell Death Discovery volume 8, Article number: 124 (2022) Cite this article

2547 Accesses
6 Citations
26 Altmetric
Metrics details

Subjects

Abstract

Acute cardiac injury has been observed in a subset of COVID-19 patients, but the molecular basis for this clinical phenotype is unknown. It has been hypothesized that molecular mimicry may play a role in triggering an autoimmune inflammatory reaction in some individuals after SARS-CoV-2 infection. Here we investigate if linear peptides contained in proteins that are primarily expressed in the heart also occur in the SARS-CoV-2 proteome. Specifically, we compared the library of 136,704 8-mer peptides from 144 human proteins (including splicing variants) to 9926 8-mers from all the viral proteins in the reference SARS-CoV-2 proteome. No 8-mers were exactly identical between the reference human proteome and the reference SARS-CoV-2 proteome. However, there were 45 8-mers that differed by only one amino acid when compared to the reference SARS-CoV-2 proteome. Interestingly, analysis of protein-coding mutations from 141,456 individuals showed that one of these 8-mers from the SARS-CoV-2 Replicase polyprotein 1a/1ab (KIALKGGK) is identical to an MYH6 peptide encoded by the c.5410 C > A (Q1804K) genetic variation, which has been observed at low prevalence in Africans/African Americans (0.08%), East Asians (0.3%), South Asians (0.06%), and Latino/Admixed Americans (0.003%). Furthermore, analysis of 4.85 million SARS-CoV-2 genomes from over 200 countries shows that viral evolution has already resulted in 20 additional 8-mer peptides that are identical to human heart-enriched proteins encoded by reference sequences or genetic variants. Whether such mimicry contributes to cardiac inflammation during or after COVID-19 illness warrants further experimental evaluation. We suggest that SARS-CoV-2 variants harboring peptides identical to human cardiac proteins should be investigated as “viral variants of cardiac interest”.

Predicting human and viral protein variants affecting COVID-19 susceptibility and repurposing therapeutics

Article Open access 20 June 2024

A proteome-scale map of the SARS-CoV-2–human contactome

Article Open access 10 October 2022

SARS-CoV-2 receptor ACE2 is co-expressed with genes related to transmembrane serine proteases, viral entry, immunity and cellular stress

Article Open access 08 December 2020

Introduction

Cardiac injury is a prevalent complication associated with COVID-19 [1]. In a study of 100 recently recovered COVID-19 patients, cardiovascular magnetic resonance imaging revealed cardiac involvement or ongoing myocardial inflammation in 78 and 60 patients, respectively [2]. In another study of 39 consecutive autopsies from patients who died of COVID-19, viral RNA was detectable in the heart of 24 (62%) patients. A large nationwide study from Israel reported that SARS-CoV-2 infection is associated with increased rates of myocarditis, arrhythmia, myocardial infarction, and pericarditis [3]. Myocarditis has also been reported in a small fraction of individuals after receiving an mRNA COVID-19 vaccine [4,5,6].

Despite these phenotypic associations, the mechanisms underlying myocardial inflammation in the setting of COVID-19 infection and vaccination remain unclear. A prevalent hypothesis, known as molecular mimicry, posits that T lymphocytes and/or antibodies that recognize SARS-CoV-2 antigens and mediate virus neutralization may also cross-react against host cardiac proteins and trigger an autoimmune response against cardiomyocytes [7]. This mechanism has also been suggested to contribute to other inflammatory conditions seen in the context of COVID-19 infection [8,9,10]. Indeed, autoimmune sequelae of other infectious diseases have been attributed to mimicry between host and microbial antigens [11,12,13,14,15,16].

Advances in next-generation sequencing technologies have facilitated the rapid development of large-scale multi-omic datasets and genomic epidemiology resources to better understand the COVID-19 pandemic. Bulk and single-cell RNA-sequencing datasets have elucidated the transcriptional signatures of most healthy human tissues and cell types [17,18,19,20]. Amino acid sequences of human proteins, including genetic variants and immunologic epitopes, are available in UniProt [21], gnomAD [22], and Immune Epitope Database (IEDB) [23]. The GISAID database currently hosts 4.85 million SARS-CoV-2 genomes from more than 200 countries[24]. The availability of such genome-scale data enables us to investigate the potential for molecular mimicry between SARS-CoV-2 and human cardiac proteins.

Here we present a systematic comparison of peptides from human cardiac proteins and SARS-CoV-2 proteins. We show that no 8-mer peptides are identical between the reference sequences of these two groups of proteins. However, when including human and viral genetic variants in this comparison, we found 21 8-mer peptides to be identical between human cardiac proteins and SARS-CoV-2 proteins. Among these, a human genetic variant of MYH6 (c.5410 C > A; Q1804K) is identical to a peptide of the reference SARS-CoV-2 replicase polyprotein. Finally, we propose that the SARS-CoV-2 variants that have peptides identical to human cardiac proteins should be studied as potential “viral variants of cardiac interest”.

Results

Identification of genes that are overexpressed in cardiac tissue

To identify heart-enriched proteins, we compared the expression of all human protein-coding genes in heart samples (n = 861) versus all non-striated muscle samples (n = 15,718) from the Genotype-Tissue Expression (GTEx) project. There were 137 genes expressed at least fivefold higher in the heart with a Cohen’s D value greater than or equal to 0.5 (Fig. 1 and Supplementary Table 1). Similarly, we compared the expression of all genes between cardiomyocytes (n = 8.9 K cells) and non-cardiomyocytes (n = ~2.5 million cells) across a database of 52 single-cell RNA-sequencing studies covering 62 tissues[18]. There were 46 genes overexpressed in cardiomyocytes based on the same criteria outlined above (Fig. 1 and Supplementary Table 1). Combining the lists of genes identified from bulk and single-cell RNA-sequencing analyses, we identified a total of 144 candidate cardiac proteins.

**Fig. 1: Identification of mimicked peptides between SARS-CoV-2 and human proteins.**

MYH6 variant is mimicked by an epitope of SARS-CoV-2 Replicase polyprotein 1a/1ab

We computed peptide 8-mers from the reference sequences of the 144 cardiac proteins (including isoforms) and all 17 proteins from the reference SARS-CoV-2 sequence. We then systematically compared the pairwise sequence identity (using Hamming Distance; see Methods) for the 136,704 cardiac protein 8-mers with the 9,926 8-mers derived from the SARS-CoV-2 proteins. No peptides were fully identical between these two groups. However, 45 8-mers were nearly identical, with only a single mismatched amino acid (Table 1). To determine whether human genetic variation results in any 8-mers, which exactly match the reference SARS-CoV-2 proteome, we then analyzed amino acid mutations from 141,456 individuals using the gnomAD database (including 83,623 mutations in the 144 cardiac proteins) [22].

Table 1 List of peptide pairs from SARS-CoV-2 proteins and human cardiac proteins that have a Hamming distance less than or equal to 1.

Full size table

Interestingly, one specific 8-mer from the SARS-CoV-2 Replicase polyprotein 1a/1ab (KIALKGGK) is identical to a mutant peptide encoded by the c.5410 C > A(Gln1804Lys) variation in human MYH6 (KIALKGGK), which is a subunit of a cardiac motor protein. This genetic variation has been identified in Africans/African Americans (0.08% prevalence), East Asians (0.3% prevalence), South Asians (0.06% prevalence) and Latino/Admixed Americans (0.003% prevalence) [22]. Analysis of peptides from IEDB shows that the non-mutated 7-mer in this peptide (IALKGGK) also overlaps with a known B-cell epitope from SARS-CoV-2 (IALKGGKIVNNWLKQ, IEDB ID:1039277) [25]. Whether the mimicry between SARS-CoV-2 Replicase polyprotein 1a/1ab and wild-type or mutant MYH6 contributes to cardiac inflammation in the setting of COVID-19 warrants further investigation.

SARS-CoV-2 variants harbor epitopes that are identical to peptides of cardiac proteins

We next examined whether SARS-CoV-2 evolution has given rise to variants that harbor peptides identical to human cardiac proteins. To this end, we analyzed 4.85 million SARS-CoV-2 genomes from over 200 countries obtained from the GISAID database [24]. We identified 21 8-mer peptides from SARS-CoV-2 variants that are identical to cardiac proteins encoded by reference sequences or genetic variants (Tables 2 and 3). Of these 21 peptides, the one present in the highest number of viral sequences was STAVLGVL, which mapped to the NSP3 protein (part of Replicase polyprotein sequence) in 4501 (0.09%) SARS-CoV-2 genomes. This sequence is present in a transmembrane helix of all three described isoforms of the cardiac protein SLC4A3 (UniProtKB: P48751), an anion exchange protein that is associated with short QT syndrome (Tables 2 and 3) [21, 26]. Although no epitopes containing this full sequence were present in IEDB, a BLAST search at 90% similarity identified a similar epitope (HEAQAVLGVLL) that is reported to bind to both HLA-B*40:1 and HLA-B*58:01 [25, 27]. We additionally analyzed the temporal and geographical emergence of the five variant SARS-CoV-2 8-mers that are identical to cardiac protein 8-mers and that occur in over 100 reported SARS-CoV-2 genomes (Supplementary Fig. S1). Variants with exactly matching 8-mers have occurred sporadically in different geographic locations, but they have not persisted over time (Supplementary Fig. S1).

Table 2 List of identical cardiac peptides found in SARS-CoV-2 variants in GISAID.

Full size table

Table 3 List of mutated cardiac peptide n-mers from human genetic variants identical to SARS-CoV-2 variants.

Full size table

Discussion

Understanding the mechanistic basis of acute cardiac injury in COVID-19 patients is important to develop countermeasures. Viral infections have previously been proposed to trigger autoimmune reactions, and it has been hypothesized that molecular mimicry plays a role in mediating autoimmune reactions [7,8,9, 11,12,13, 28]. Here, a systematic analysis of gene expression from cardiac tissues, based on bulk RNA-seq data and single-cell RNA-seq data, combined with analysis of human genetic variation and SARS-CoV-2 genomes has led to the identification of candidate proteins and peptide regions therein that might be involved in immune cross-reactivity (Tables 2 and 3). These newly identified identical peptides expand the set of known shared peptides between the human proteome and SARS-CoV-2, such as the furin cleavage site “RRARSVAS” present in both human ENaC-ɑ and the Spike glycoprotein[10, 29, 30]. Further research is warranted to ascertain whether these mimicked peptides contribute to autoinflammatory pathology in the context of COVID-19 infection. Additionally, given the occurrence of myocarditis in some individuals shortly after receiving an mRNA COVID-19 vaccine, the potential for molecular mimicry between the antigen encoded by these vaccines (i.e., the pre-fusion stabilized Spike glycoprotein) and human cardiac proteins should be evaluated[4,5,6, 31, 32].

There are a few limitations to this study. First, the human proteins that we surveyed were shortlisted based on their overexpression in cardiac tissue. There could be mimicked proteins that are shared between cardiac tissues and other tissues that are not accounted for in the current analysis. Second, there are other mechanisms that could contribute to autoimmunity after a viral infection such as bystander activation, epitope spreading, and viral persistence [33]. Third, the presence of identical peptides in cardiac proteins and the SARS-CoV-2 proteome could occur due to chance. Comparing all SARS-CoV-2 8-mers to a set of brain enriched proteins and a set of skin enriched proteins shows a similar probability distribution of Hamming distance (Supplementary Fig. S2), suggesting that the observed similarity with SARS-CoV-2 peptides is not specific to human cardiac proteins. Fourth, it is possible that peptides with lower degrees of similarity could contribute to immunologic mimicry, as T cells can be highly cross-reactive against different major histocompatibility complex (MHC)-presented peptides [34,35,36,37,38].

Taken together, by studying the intersection of human genetic variation in cardiac proteins and SARS-CoV-2 evolution, we have identified candidates of molecular mimicry that have the potential to contribute to cardiac inflammation in the context of COVID-19. It will be important to perform follow-up functional studies evaluating the potential of SARS-CoV-2 reactive T cells and antibodies (e.g., from active or recovering COVID-19 patients) to cross-react with these peptides. Thus, we propose that SARS-CoV-2 variants harboring peptides identical to host heart-enriched proteins should be studied as “viral variants of cardiac interest”. We highlight that a similar strategy can be applied to identify and categorize plausible mimicry candidates from any human tissues that are targeted by other autoimmune responses in COVID-19 patients.

Methods

Identification of proteins enriched in cardiac tissue

Bulk RNA-sequencing (RNA-seq) data was accessed from the Genotype-Tissue Expression (GTEx) project V8 [17]. For each sample, FASTQ files were processed using Salmon (in mapping-based mode) to quantify gene expression in transcripts per million (TPM). Specifically, the expression of each transcript isoform was first determined by passing FASTQ files to Salmon quant with the following parameters passed: validateMappings, rangeFactorizationBins 4, gcBias, biasSpeedSamp 10. All isoforms are then summed via a transcript-to-gene map, generating a gene-level expression value. GRCh38 was used as the reference, including cDNA and non-coding RNA.

For single-cell RNA-seq studies, processed count matrices were accessed from Gene Expression Omnibus or other publicly available data repositories. There were two datasets analyzing heart tissues which captured cardiomyocytes, our main cell type of interest for this report [20, 39]. Other datasets captured a wide variety of immune, stromal, and parenchymal cell types from tissues including the respiratory tract, gastrointestinal tract, genitourinary tract, hepatobiliary system, skeletal muscle, brain, skin, eyes, and endocrine organs. Each dataset was processed using Scrublet and Seurat v3.0 as described previously [18, 40,41,42]. Cell type annotations were obtained from associated metadata files if available; otherwise, annotation was performed manually, guided by the cell types reported in the associated publication.

To identify genes that are overexpressed in cardiac tissue, we calculated fold change and Cohen’s D values between defined sample cohorts. For bulk RNA-seq data, Cohort A was defined as all GTEx heart samples (n = 861), and Cohort B was defined as all remaining GTEx samples except for those derived from skeletal muscle (n = 15,718). For single-cell RNA-seq data, Cohort A was defined as all cells annotated as cardiomyocytes (n ~8900 cells), and Cohort B was defined as all other cells from all processed studies (n ~2.5 million cells). Fold change and Cohen’s D were calculated as follows:

$${\mathbf{Fold}}\,{\mathbf{Change}} = \frac{{{\mathrm{TPM}}_{{\mathrm{Cohort}}\,{\mathrm{A}}} + 1}}{{{\mathrm{TPM}}_{{\mathrm{Cohort}}\,{\mathrm{B}}} + 1}}$$

Cohen’s D = $\frac{{{\mathrm{TPM}}_{{\mathrm{Cohort}}\,{\mathrm{A}}} - {\mathrm{TPM}}_{{\mathrm{Cohort}}\,{\mathrm{B}}}}}{{{\mathrm{SD}}_{{\mathrm{pooled}}}}}$, where the pooled standard deviation SD_pooled is defined as: SD_pooled = $\sqrt {\frac{{(N_{{\mathrm{Cohort}}\,A} - 1) \times {\mathrm{SD}}_{Cohort\,A}^2 + (N_{{\mathrm{Cohort}}\,B} - 1) \times {\mathrm{SD}}_{{\mathrm{Cohort}}\,B}^2}}{{(N_{{\mathrm{Cohort}}\,A} + N_{{\mathrm{Cohort}}\,B} - 2)}}}$, where N_{Cohort A} and N_{Cohort B} are the number of samples in Cohorts A and B, respectively, and SD_{Cohort A} and SD_{Cohort B} are the standard deviation of TPM values for the given gene in Cohorts A and B, respectively.

Genes with fold change ≥5 and Cohen’s D ≥ 0.5 from either the bulk or single-cell RNA-seq analysis were considered to be enriched in cardiac tissue. In the volcano plots used to visualize these analyses, we filtered to genes with a TPM or CP10K value ≥1 in either Cohort A or Cohort B, and genes meeting the criteria for overexpression in heart or cardiomyocytes are colored in red.

For a control analysis, we also identified genes overexpressed in the brain or skin by bulk RNA-sequencing from the GTEx project. We used the same approach as described above, except that Cohort A was defined as either all brain samples (n = 2351) or all skin samples (n = 1305), and Cohort B was defined as all other samples.

Comparison of 8-mers from reference sequences of cardiac proteins and SARS-CoV-2 proteins

The translated proteome from reference to the SARS-CoV-2 genome (NC_045512.2) was downloaded from UniProt (https://covid-19.uniprot.org/) [21, 43]. A sliding window approach was used to enumerate all 8-mers from the 17 proteins in this viral proteome. Similarly, we used a sliding window approach to generate all 8-mers from the reference amino acid sequences of the previously defined 144 cardiac proteins, including the canonical isoforms and all described isoforms indicated in UniProt. We then performed a pairwise comparison of all 8-mers in these two groups by calculating the Hamming distance using the stringdist function from the stringdist package (version 0.9.8) in R (version 4.0.3). In a control analysis, we used the same approach to calculate the Hamming distance between all SARS-CoV-2 8-mers and the control sets of 369 human proteins enriched in the brain or 198 human proteins enriched in the skin (described above).

Assessing the impact of human and SARS-CoV-2 variants on cardiac peptide matches

To assess the impact of human genetic variation on potential molecular mimicry, we retrieved all missense variants from the gnomAD database for the previously identified cardiac proteins that had at least one 8-mer similar to a peptide in the SARS-CoV-2 reference proteome (Hamming distance = 1) [22]. We used the gnomad-api (https://gnomad.broadinstitute.org/api) to fetch the variant calls from the gnomad_r2_1 version from the Human GRCh37 genome assembly. The variants in this gnomad version (GRCh37/hg19) are derived from 125,748 exome sequences and 15,708 whole-genome sequences from unrelated individuals sequenced as part of various disease-specific and population genetic studies. For any variants that alter the amino acid sequence of a potentially mimicked peptide, we determined whether the mutation resulted in an exact match (Hamming distance = 0) to the corresponding 8-mer from the SARS-CoV-2 reference proteome.

To assess the impact of viral evolution on potential molecular mimicry, we queried the cardiac 8-mers with Hamming distance of 1 (including any alterations of these 8-mers arising from human genetic variation as described above) against all protein variants encoded in 4,854,709 SARS-CoV-2 genomes deposited in the GISAID database (last accessed 11/5/2021) [24]. Here, we determined whether any mutations in viral genomes (relative to the reference sequence) resulted in 8-mers which exactly match one or more cardiac peptides.

Evaluation of mimicked peptides for inclusion in immune epitopes

For any 8-mers which showed an exact match between a cardiac peptide (reference or variant sequences) and a SARS-CoV-2 peptide (reference or variant sequences), we queried the 8-mer using the Immune Epitope Database (IEDB; www.iedb.org) and Analysis Resource [23]. We searched for any linear peptide epitope with a Blast similarity of at least 90% from any human host that had positive experimental evidence in any assay (T cell, B cell, or MHC Ligand). No MHC class restrictions or disease filters were applied.

Data availability

The expression profiling analyses for the cardiac proteins were carried out using the count matrix derived from Genotype-Tissue Expression (GTEx) project V8 dataset (https://gtexportal.org/home/datasets). The data for single-cell datasets from heart tissue were reanalyzed from two different publicly available studies with the following GEO accession IDs: GSE134355 [20], GSE109819 [39], and GSE121893 [39]. The human variants were obtained from the gnomad 2.1.1 version that is publicly available for download at http://gnomad.broadinsitute.org. The antigenicity potential for the peptide matches were evaluated using the publicly accessible IEDB database (www.iedb.org).

References

Chung MK, Zidar DA, Bristow MR, Cameron SJ, Chan T, Harding CV, et al. COVID-19 and cardiovascular disease: from bench to bedside. Circ Res. 2021;128:1214–36.
Article CAS Google Scholar
Puntmann VO, Carerj ML, Wieters I, Fahim M, Arendt C, Hoffmann J, et al. Outcomes of cardiovascular magnetic resonance imaging in patients recently recovered from coronavirus disease 2019 (COVID-19). JAMA Cardiol. 2020;5:1265.
Article Google Scholar
Barda N, Dagan N, Ben-Shlomo Y, Kepten E, Waxman J, Ohana R, et al. Safety of the BNT162b2 mRNA Covid-19 vaccine in a nationwide setting. N Engl J Med. 2021;385:1078–90.
Article CAS Google Scholar
CDC. COVID-19 vaccination centers for disease control and prevention. 2020. https://www.cdc.gov/coronavirus/2019-ncov/vaccines/safety/myocarditis.html. Accessed 22 Feb 2022.
Witberg G, Barda N, Hoss S, Richter I, Wiessman M, Aviv Y, et al. Myocarditis after Covid-19 vaccination in a large health care organization. N Engl J Med. 2021;385:2132–9.
Article CAS Google Scholar
Simone A, Herald J, Chen A, Gulati N, Shen AY-J, Lewin B, et al. Acute myocarditis following COVID-19 mRNA vaccination in adults aged 18 years or older. JAMA Intern Med. 2021;181:1668.
Article CAS Google Scholar
Bozkurt B, Kamat I, Hotez PJ. Myocarditis with COVID-19 mRNA vaccines. Circulation. 2021;144:471–84.
Article CAS Google Scholar
Proal AD, VanElzakker MB. Long COVID or post-acute sequelae of COVID-19 (PASC): an overview of biological factors that may contribute to persistent symptoms. Front Microbiol. 2021;12:698169.
Article Google Scholar
Galeotti C, Bayry J. Autoimmune and inflammatory diseases following COVID-19. Nat Rev Rheumatol. 2020;16:413–4.
Article CAS Google Scholar
Kanduc D. From anti-SARS-CoV-2 immune responses to COVID-19 via molecular mimicry. Antibodies. 2020;9:E33.
Article Google Scholar
Gowthaman U, Eswarakumar VP. Molecular mimicry: good artists copy, great artists steal. Virulence. 2013;4:433–4.
Article Google Scholar
Cusick MF, Libbey JE, Fujinami RS. Molecular mimicry as a mechanism of autoimmune disease. Clin Rev Allerg Immunol. 2012;42:102–11.
Article CAS Google Scholar
Oldstone MBA. Molecular mimicry, microbial infection, and autoimmune disease: evolution of the concept. In: Oldstone MBA, editor. Molecular mimicry: infection-inducing autoimmune disease. Berlin Heidelberg: Springer; 2005. p. 1–17.
Adderson EE, Shikhman AR, Ward KE, Cunningham MW. Molecular analysis of polyreactive monoclonal antibodies from rheumatic carditis: human anti-N-acetylglucosamine/anti-myosin antibody V region genes. J Immunol. 1998;161:2020–31.
CAS PubMed Google Scholar
Cunningham MW, Antone SM, Smart M, Liu R, Kosanke S. Molecular analysis of human cardiac myosin-cross-reactive B- and T-cell epitopes of the group A streptococcal M5 protein. Infect Immun. 1997;65:3913–23.
Article CAS Google Scholar
Cunningham MW. Autoimmunity and molecular mimicry in the pathogenesis of post streptococcal heart disease. Front Biosci. 2003;8:s533–543.
Article CAS Google Scholar
Lonsdale J, Thomas J, Salvatore M, Phillips R, Lo E, Shad S, et al. The genotype-tissue expression (GTEx) project. Nat Genet. 2013;45:580–5.
Article CAS Google Scholar
Venkatakrishnan AJ, Puranik A, Anand A, Zemmour D, Yao X, Wu X, et al. Knowledge synthesis of 100 million biomedical documents augments the deep expression profiling of coronavirus receptors. eLife. 2020. https://doi.org/10.7554/eLife.58040.
Rozenblatt-Rosen O, Stubbington MJT, Regev A, Teichmann SA. The human cell atlas: from vision to reality. Nature. 2017;550:451–3.
Article CAS Google Scholar
Han X, Zhou Z, Fei L, Sun H, Wang R, Chen Y, et al. Construction of a human cell landscape at single-cell level. Nature. 2020;581:303–9.
Article CAS Google Scholar
UniProt Consortium. The universal protein resource (UniProt). Nucleic Acids Res. 2008;36:D190–195.
Article Google Scholar
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–43.
Article CAS Google Scholar
Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, et al. The immune epitope database (IEDB): 2018 update. Nucleic Acids Res. 2019;47:D339–D343.
Article CAS Google Scholar
Shu Y, McCauley J. GISAID: global initiative on sharing all influenza data – from vision to reality. Eur Surveill. 2017;22:30494.
Article Google Scholar
Schwarz T, Heiss K, Mahendran Y, Casilag F, Kurth F, Sander LE et al. SARS-CoV-2 Proteome-Wide Analysis Revealed Significant Epitope Signatures in COVID-19 Patients. Front Immunol 2021;12:629185.
Thorsen K, Dam VS, Kjaer-Sorensen K, Pedersen LN, Skeberdis VA, Jurevičius J, et al. Loss-of-activity-mutation in the cardiac chloride-bicarbonate exchanger AE3 causes short QT syndrome. Nat Commun. 2017;8:1696.
Article Google Scholar
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.
Article CAS Google Scholar
Ehrenfeld M, Tincani A, Andreoli L, Cattalini M, Greenbaum A, Kanduc D, et al. Covid-19 and autoimmunity. Autoimmun Rev. 2020;19:102597.
Article CAS Google Scholar
Anand P, Puranik A, Aravamudan M, Venkatakrishnan A, Soundararajan V. SARS-CoV-2 strategically mimics proteolytic activation of human ENaC. eLife. 2020;9:e58603.
Article CAS Google Scholar
Venkatakrishnan AJ, Kayal N, Anand P, Badley AD, Church GM, Soundararajan V. Benchmarking evolutionary tinkering underlying human-viral molecular mimicry shows multiple host pulmonary-arterial peptides mimicked by SARS-CoV-2. Cell Death Disco. 2020;6:96.
Article CAS Google Scholar
Corbett KS, Edwards DK, Leist SR, Abiona OM, Boyoglu-Barnum S, Gillespie RA, et al. SARS-CoV-2 mRNA vaccine design enabled by prototype pathogen preparedness. Nature. 2020;586:567–71.
Article CAS Google Scholar
Walsh EE, Frenck RW, Falsey AR, Kitchin N, Absalon J, Gurtman A, et al. Safety and immunogenicity of two RNA-based Covid-19 vaccine candidates. N Engl J Med. 2020;383:2439–50.
Article CAS Google Scholar
Fujinami RS, von Herrath MG, Christen U, Whitton JL. Molecular mimicry, bystander activation, or viral persistence: infections and autoimmune disease. Clin Microbiol Rev. 2006;19:80–94.
Article CAS Google Scholar
Sewell AK. Why must T cells be cross-reactive? Nat Rev Immunol. 2012;12:669–77.
Article CAS Google Scholar
Borbulevych OY, Santhanagopolan SM, Hossain M, Baker BM. TCRs used in cancer gene therapy cross-react with MART-1/Melan-A tumor antigens via distinct mechanisms. J Immunol. 2011;187:2453–63.
Article CAS Google Scholar
Scott DR, Borbulevych OY, Piepenbrink KH, Corcelli SA, Baker BM. Disparate degrees of hypervariable loop flexibility control T-cell receptor cross-reactivity, specificity, and binding mechanism. J Mol Biol. 2011;414:385–400.
Article CAS Google Scholar
Garcia KC, Degano M, Pease LR, Huang M, Peterson PA, Teyton L, et al. Structural basis of plasticity in T cell receptor recognition of a self peptide-MHC antigen. Science. 1998;279:1166–72.
Article CAS Google Scholar
Wooldridge L, Ekeruche-Makinde J, van den Berg HA, Skowera A, Miles JJ, Tan MP, et al. A single autoimmune T cell receptor recognizes more than a million different peptides. J Biol Chem. 2012;287:1168–77.
Article CAS Google Scholar
Wang L, Yu P, Zhou B, Song J, Li Z, Zhang M, et al. Single-cell reconstruction of the adult human heart during heart failure and recovery reveals the cellular landscape underlying cardiac function. Nat Cell Biol. 2020;22:108–19.
Article CAS Google Scholar
Doddahonnaiah D, Lenehan PJ, Hughes TK, Zemmour D, Garcia-Rivera E, Venkatakrishnan AJ, et al. A literature-derived knowledge graph augments the interpretation of single cell RNA-seq datasets. Genes. 2021;12:898.
Article CAS Google Scholar
Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, et al. Comprehensive integration of single-cell data. Cell. 2019;177:1888–1902.e21.
Article CAS Google Scholar
Wolock SL, Lopez R, Klein AM. Scrublet: computational identification of cell doublets in single-cell transcriptomic data. Cell Syst. 2019;8:281–291.e9.
Article CAS Google Scholar
Wu F, Zhao S, Yu B, Chen Y-M, Wang W, Song Z-G et al. A new coronavirus associated with human respiratory disease in China. Nature 2020;579:265–269.

Download references

Author information

These authors contributed equally: Praveen Anand, Patrick J. Lenehan.

Authors and Affiliations

nference Labs, Bengaluru, Karnataka, 560017, India
Praveen Anand, Dhruti Patwardhan & Venky Soundararajan
nference, Cambridge, MA, 02139, USA
Patrick J. Lenehan, Michiel Niesen, Unice Yoo, Marcelo Montorzi, A. J. Venkatakrishnan & Venky Soundararajan
Southcoast Health, Fairhaven, MA, 02719, USA
Marcelo Montorzi

Authors

Praveen Anand
View author publications
Search author on:PubMed Google Scholar
Patrick J. Lenehan
View author publications
Search author on:PubMed Google Scholar
Michiel Niesen
View author publications
Search author on:PubMed Google Scholar
Unice Yoo
View author publications
Search author on:PubMed Google Scholar
Dhruti Patwardhan
View author publications
Search author on:PubMed Google Scholar
Marcelo Montorzi
View author publications
Search author on:PubMed Google Scholar
A. J. Venkatakrishnan
View author publications
Search author on:PubMed Google Scholar
Venky Soundararajan
View author publications
Search author on:PubMed Google Scholar

Contributions

VS, AJV, and PJL were involved in conceptualization, supervision, and drafting the final version of the manuscript. PA, PJL, UY, and DP were involved in data curation, formal analysis, validation, methodology, review/editing of the manuscript. MN and MM contributed to the study design and review/editing of the manuscript. All authors have approved the final version of the manuscript.

Corresponding authors

Correspondence to A. J. Venkatakrishnan or Venky Soundararajan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Anand, P., Lenehan, P.J., Niesen, M. et al. Genetic alteration of human MYH6 is mimicked by SARS-CoV-2 polyprotein: mapping viral variants of cardiac interest. Cell Death Discov. 8, 124 (2022). https://doi.org/10.1038/s41420-022-00914-9

Download citation

Received: 06 December 2021
Revised: 07 February 2022
Accepted: 24 February 2022
Published: 21 March 2022
DOI: https://doi.org/10.1038/s41420-022-00914-9

This article is cited by

Longitudinal efficacy and toxicity of SARS-CoV-2 vaccination in cancer patients treated with immunotherapy
- Pavlina Spiliopoulou
- Helena J. Janse van Rensburg
- Lillian L. Siu
Cell Death & Disease (2023)