Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus was first identified in China in December 20191 and rapidly spread across the globe, causing one of the most significant pandemics in the history of humanity2. Following the declaration of a global public health emergency by WHO in March 2020, the management of the pandemic, designated coronavirus disease 2019 (COVID-19), immediately proved challenging due to the wide range of clinical manifestations, the ability of the virus to spread by aerosol, and its ability to evolve into “variants” capable of evading the immune system even in previously infected or vaccinated hosts3. The virus is primarily adsorbed through the cellular receptor angiotensin 2 (ACE), which is widely expressed in the lung epithelium4. Consequently, in the severe form of COVID-19, pulmonary damage occurs in a significant proportion of subjects5.

The pathogenesis of the COVID-19 appears to exhibit a multifactorial nature. While individuals with compromised immunity, advanced age, and concomitant diseases are most susceptible to severe complications6, a subset of healthy younger individuals may also exhibit severe disease manifestations7,8. Furthermore, the incidence of disease9 and the distribution of viral strains exhibit considerable variation globally10,11, suggesting a specific host-virus interaction and then a role of the human leukocyte antigen (HLA) system12.

SARS-CoV-2 is a single-stranded, positive-sense RNA virus belonging to the Beta-coronavirus genus within the Coronaviridae family13,14. The genome of SARS-CoV-2 is approximately 30 kilobases in length and is flanked by 5ʹ and 3ʹ untranslated regions. It contains two large open reading frames (ORF1a and ORF1b), which encode 16 non-structural proteins (NSP-1–16), four structural proteins, namely spike (S), envelope (E), membrane (M), and nucleocapsid (N), and at least 9 accessory proteins15. The non-structural proteins comprise the viral replication-transcription complex16,17, whereas the structural proteins are responsible for the formation of the virion particle18. A crucial function in the virus’s capability to infect host cells is carried out by the S proteins, which specifically bind to the ACE2 receptor on the surface of human cells19. Given its pivotal role in viral entry, the S protein20 represents a primary target for vaccine development and therapeutic interventions against COVID-1921. While extensive studies have been conducted on S and N proteins22,23, comparatively less is known about the M protein. It is, however, the most abundant structural protein and could play an important role in the host immunological response24.

In this study, the immunological profile of subjects with a previous SARS-CoV-2 infection was analysed through the detection of antibodies IgG against S, N and M proteins. The results revealed a significant statistical association between IgG anti-M antibodies and pulmonary damage. Furthermore, an increasing amount of the antibodies was associated with a greater likelihood of developing lung injury.

Results

Production of recombinant proteins

Supplementary Figure S1, Panel A, depicts the three recombinant proteins purified using Ni–NTA affinity chromatography, separated by polyacrylamide gel electrophoresis. Panel B shows a Western blot analysis of the same proteins, employing a monoclonal antibody against the hexa-histidine (His) epitope.

Sample investigated

The study population consisted of 63 adults (50 male and 13 female) ranging from 18 to 81 years of age (mean = 47; median = 48). The interval between the date of molecular test positivity and serum collection exhibited a considerable range, spanning from a minimum of 21 days to a maximum of 328 days, with an average of 105 days. Image analysis revealed that 30 of 63 subjects had evidence of lung injury, of which 9 required oxygen therapy. A summary of the data is presented in Supplementary Table S1.

ELISA results and statistical analysis

Recombinant S and N proteins were found to detect IgG antibodies in 48 of 63 samples tested. Of the 15 samples that yielded negative results, 9 samples exhibited concordant results in both tests. The recombinant M protein was able to detect the presence of IgG antibodies in 40 out of 64 samples.

The χ2 association test was performed on the ELISA results of the three recombinant proteins stratified by lung injury. No association was found between the anti-S and anti-N IgG results and lung injury, conversely, a significant association with the anti-M IgG result was found (χ2 = 4.29; df = 1; p < 0.05) (Table 1).

Table 1 Correlation between IgG anti-M and lung injury.

A T-test analysis (Mann–Whitney U) was employed to conduct a comparison between the antibody levels of individuals with and without lung injury. For all recombinant proteins, higher levels of antibodies were found in subjects reporting lung injury, with a statistically significant difference. The largest effect size was observed for the IgG anti-M. The data are shown in Table 2.

Table 2 Antibody titer comparison.

A binomial regression logistic analysis was performed to assess the relationship between the antibody titers of IgG anti-S, anti-N, and anti-M and lung injury. The optical absorption, obtained in ELISA with the three antigens, was used as a continuous variable, while the presence or absence of lung injury served as the binomial variable.

The three models demonstrated statistical significance (Table 3).

Table 3 Relationship between antibody titer and lung injury.

The data indicated, however, that IgG anti-M antibodies were the higher risk factor for the development of lung injury, with an odds ratio of 6.8 (95% CI, lower 2.05; upper 22.5) (Table 3, panel A). In contrast, the odds ratio for IgG anti-S and anti-N were 1.8 and 1.67, respectively (Table 3, panel B, C). The results are presented in graphical form in Fig. 1, which plots the optical density (OD) values resulting from the ELISA analysis of the three recombinant proteins against the estimated probability of developing lung injury. The IgG anti-M protein exhibited optimal logistic behaviour, reaching the plateau (probability = 1) at approximately 4 OD. Conversely, the IgG anti-S and anti-N proteins exhibited linear-like behaviour without reaching the maximum risk probability. The potential interactions between the antibodies were also investigated. Nevertheless, the various statistical mixed logit models that were evaluated yielded unsatisfactory outcomes, resulting in a reduction in the odds ratio of IgG anti-M antibodies and the loss of statistical significance for IgG anti-S and anti-N antibodies (Supplementary Tables S2, S3). Figure 2 depicts the scatterplot resulting from the logit analysis of the three antibodies above described, with the analysed samples labelled according to the presence or absence of lung injury. Upon observation of the scatterplot in relation to IgG anti-M (Fig. 2, panel A), it can be reasonably deduced that the cutoff value utilized in the ELISA test (cutoff = 1.1), which serves to designate the test result as positive, was not appropriate for differentiating between individuals at low or elevated risk of developing lung injury. A new ELISA cutoff was established by employing the optimal Youden index (OD = 1.5)25 and the resulting data were then subjected to a novel χ2 association test. The results (Table 4) indicated a statistically significant association (p < 0.00001) and a larger effect size (Cramer’s V = 0.568) than in the first contingency table. The ELISA with the new cutoff exhibited a specificity of 91% and a sensitivity of 63% in the detection of lung injury.

Fig. 1
Fig. 1
Full size image

Graphical representation of the three logit functions. The panels illustrate the functions resulting from the estimation of the likelihood of lung injury (ordinate) plotted against the optical density (OD) resulting from the ELISA. As shown in panel A, only the membrane protein resembles a similar “S” shape and achieves the maximum probability (p = 1), while the other two proteins (spike, panel B; nucleocapsid, panel C) display a linear-like behavior and do not reach the maximum probability.

Fig. 2
Fig. 2
Full size image

Assessment of the optimal ELISA cutoff in lung injury detection. The figure depicts the graphical representation of the logit function obtained for the M, S, and N proteins, with samples labeled according to the presence or absence of lung injury (blue = absence; yellow = presence). The OD value (cutoff) for the ELISA to differentiate individuals at high risk of lung injury (OD = 1.5), as determined by Youden’s index, is shown.

Table 4 A new contingency table using optimized ELISA data.

Discussion

Despite global effort, the multifactorial nature of COVID-19 has thus far prevented elucidation of the pathogenetic mechanism leading to progressive lung damage26,27. The immunological system appears to play an important role, given the unbalanced immune response often observed in critical patients28,29. The role of the humoral response remains a matter of contention, with several studies reporting conflicting results regarding the correlation between virus-specific antibody titers and the progression to severe disease. The interpretation of the data is challenging because of the differing kinetics of the antibody response observed in patients following infection. A number of studies agree on a correlation between higher IgG antibody titers and more severe disease, with antibody levels sustained during the first 6 months, followed by a decline between six and twelve months. Conversely, in patients with mild or asymptomatic infection, IgG titers are lower, and antibody decay is faster in the first months after infection30,31,32,33,34. Noteworthy, these studies have primarily focused on antibodies against the spike and nucleocapsid proteins. The contribution of the two other structural proteins, the membrane and the envelope, remains substantially unknown.

The objective of the present study was to examine the immunoglobulin response of subjects with a previous SARS-CoV-2 infection and lung injury. The immunological profile of the sample under investigation was evaluated by the detection of antibodies using an in-house ELISA which employed specific portions of the spike, nucleocapsid, and membrane proteins.

A total of 63 subjects were analysed, of whom 30 exhibited evidence of lung injury (Supplementary Table 1). The preliminary analyses (Table 1) indicated a significant association (p < 0.05) between positivity to IgG anti-M and lung injury. However, the strength of this association was low (V = 0.275). The data were subsequently subjected to a binary logistic regression analysis. The results of the logistic regression analysis were consistent with those of the previous analysis and demonstrated a strong correlation between the risk of developing pulmonary lesions and the growing amount of IgG anti-M, with an odds ratio of 6.8 (Table 3A).

Similarly, the logistic regression analysis of the IgG anti-S and anti-N antibodies demonstrated a correlation with lung injury (Table 3B,C), with an odds ratio of 1.83 and 1.67, respectively. Analysis of the potential interactions of these antibodies with IgG anti-M did not, however, indicate any synergistic effects on the pulmonary damage. The results of the logit analysis of IgG anti-M in conjunction with the other two proteins and pulmonary damage produced a loss of statistical significance for IgG anti-S and anti-N. Contextually, the odds ratio for the IgG anti-M decreased from 6.8 to lower values (Supplementary Tables 2, 3). These results may suggest that the elevated levels of antibodies against S and N observed in subjects with lung injury (Table 2) are the consequence of a physiological, protective immune response masked by a concomitant underlying factor, rather than an indicator of pathological processes. Furthermore, as illustrated in Fig. 2 (panels B and C), there is no discernible pattern in the distribution of samples exhibiting or lacking lung injury within a broad range of optical density with respect to the S and N proteins. Conversely, the M protein (Fig. 2, panel A) showed a cluster of samples, primarily from subjects without lung lesions, within a narrow range of optical density (OD) values. The ELISA results with the M protein elaborated through the choice of a more appropriate ELISA cutoff point (OD = 1.5) in relation to lung injury (Table 4), revealed marked statistical significance (p < 0.00001) and an increased effect size (V = 0.568). The assay demonstrated a 91% specificity and a 63% sensitivity, indicating a very low risk of developing lung disease in the absence of antibodies IgG anti-M protein.

The data are also consistent with disease severity. As shown in Fig. 3, by stratifying the subjects on an ordinal scale in relation to lung injury as revealed by image analysis (low: grade 1; moderate: grade 2; high: grade 3), logistic analysis identified 6 out of 9 subjects who required oxygen therapy as having a high predicted probability of developing lung injury: specifically, 4 subjects in grade 3 (p = 0.9–1) and 2 subjects in grade 2 (p = 0.7–0.9).

Fig. 3
Fig. 3
Full size image

Relationship between disease likelihood and oxygen therapy. The figure shows the logit function obtained for M protein, with samples labelled according to the degree of lung injury as revealed by image analysis. The grading system employed was as follows: G0 (red) = absence of injury; G1 (blue) = low-grade injury; G2 (green) = medium-grade injury; G3 (violet) = high-grade injury. The asterisk indicates subjects who required oxygen therapy.

Regarding the low sensitivity of the ELISA employing the M antigen, it is important to note that the assay was developed using a portion of the protein, which was produced in a prokaryotic system, purified under denaturing conditions and then used with an in-house assay.

Furthermore, given the multifactorial nature of the disease, it is very likely that IgG anti-M is not the only factor involved12,35,36.

A risk factor for disease progression has been identified in the AB0 system. Despite the lack of complete agreement on which blood groups are protective and which are not, the majority of studies have revealed a protective role for the 0 group and a risk factor for the A group37.

The present study included 44 samples with available data on the AB0 system. The results of the association study with lung injury showed a significant relationship between the AB0 system and lung injury, but in contrast to the findings of previous studies. The A group was identified as a protective factor, while the 0 group was identified as a risk factor (χ2 = 10.7; df = 3; Fisher’s exact test, p = 0.0113; Cramer’s V = 0.494) (Supplementary Table 5). No synergistic effects with the ELISA results were observed.

Finally, subjects below 30 years old did not show lung injury (χ2 = 6.93; df = 1; Fisher’ exact test, p = 0.012; Cramer’s V = 0.334) (Supplementary Table 6).

This study has some limitations. As far as the authors know, this is the first description of a relationship between IgG anti-M and the risk of lung injury in subjects infected with SARS-CoV-2. Further studies are required to confirm these findings.

It is well established that levels of anti-S and anti-N antibodies decline over time. In this study, the sample collection period ranged from 15 to 328 days, suggesting the potential for underestimating antibody levels in response to the S and N proteins. Conversely, antibodies against the M protein have been reported to remain stable for at least 1 year38.

The samples were collected during the initial phase of the epidemic, and the recombinant proteins were produced using the circulating strain at the time (Wuhan-Hu-1, Spike_D614G). The subsequent viral variants’ behaviour remains unconfirmed, despite the fact that the viral M gene is well conserved among the different strains24,39.

The potential utility of IgG anti-M as a prognostic marker is contingent upon the temporal dynamics of antibody appearance. As illustrated in Table 1, the interval between molecular test positivity and serum collection ranges from 21 to 328 days, with only five serum samples collected within three weeks following the molecular diagnosis of infection.

It is also not possible to ascertain whether the risk of damage is due to the cross reactivity of IgG anti-M or to the antigen itself. The M protein represents the most abundant structural protein of SARS-CoV-2. It has been demonstrated that the antigen modulates the host immune response by reducing interferon β and λ production. Furthermore, it has been shown that the M protein promotes apoptosis and enhances the permeability of lung cells39,40,41,42.

It has recently been reported, however, that IgG anti-M can induce antibody-dependent cell-mediated cytotoxicity (ADCC) in effector cells of the immune system through the interaction of the Fc region with the Fcγ receptor. In addition, it was found that IgG anti-M antibodies from patients with severe COVID-19 were more potent stimulators of cytotoxicity than anti-M antibodies from patients with mild COVID-1943.

Overall, the data indicate a clear correlation between IgG anti-M antibodies and lung injuries, providing compelling evidence that this is an area worthy of further investigation.

Material and methods

Study and samples

The study adopted an observational approach. Samples were collected during the initial phase of the pandemic, from April 2020 to April 2021. They included 63 subjects diagnosed as positive for SARS-CoV-2 infection by molecular analysis of an oro-pharyngeal swab. None of the individuals had been vaccinated. Lung injury was evaluated by imaging techniques, including computed tomography (CT) scans and X-rays. All image analyses were conducted on subjects with respiratory distress or dyspnoea after the positive result of the molecular test. A specialized radiologist made the diagnosis of lung injury. Serum samples were collected and stored at a temperature of − 80 °C. A quantitative immunological profile was established for each participant through the detection of antibodies directed against specific portions of the viral proteins S, N and M, using enzyme-linked immunosorbent assay (ELISA).

Sera from healthy donors collected prior to the onset of the SARS-CoV-2 pandemic were used to set up the ELISA and served as negative controls.

Ethics declarations

The study was conducted in accordance with the Declaration of Helsinki.

All the data in this study were anonymized by deleting all sensitive information. All study participants provided informed consent, as required by Article 40 of Decree-Law No. 23 dated April 8, 2020, in Italy.

Ethical Committee of National Institute of Infectious Diseases (INMI) Lazzaro Spallanzani—IRCCS approved the study (approval number 2/2020).

Production of recombinant proteins

The recombinant viral proteins produced were as follows: Spike S1 (RBD), AA 292–573 (YP_009724390); Nucleocapsid, AA 1–209 (YP_009724397); Membrane: AA 109–222 (YP_009724393). The PCR fragments encoding the three recombinant proteins were retrotranscribed and amplified using the strain SARS-CoV-2 isolated in Italy during the initial epidemic wave (GSAID code: Italy/CDG1/2020|EPI ISL 412973|2020-02-20) as a template. The three RT-PCRs were assembled using the same protocol and amplification program. The reactions were set up in a final volume of 50 μl using the SuperScript™ III One-Step RT-PCR System kit (Thermo Fisher Scientific) with 0.3 μM of each primer and 1 μl of genomic RNA. The amplification programme consisted of a reverse transcription at 50 °C for 30 min, followed by 95 °C for 2 min and then 12 cycles of: 95 °C × 10 s, 58 °C × 30 s (ΔT − 0.5 °C each cycle), 68 °C × 40 s. A further 30 cycles were carried out with the following temperature profile: 95 °C × 10 s, 60 °C × 30 s and 68 °C × 1 min. The primers used are listed in Supplementary Table 4. All oligonucleotides were synthesised by Eurofins Genomics (https://eurofinsgenomics.eu/).

The three fragments were purified and digested with the restriction endonucleases listed in Supplementary Table 4, cloned into the previously opened pRSETC vector by the T4 DNA ligase, and subsequently propagated into the One Shot™ TOP10 chemically competent E. coli cells (Thermofisher). Purified vectors underwent sequencing analysis using the automated DNA sequencer CEQ 8000 (Beckman, Brea, CA, USA) prior to transformation with BL21-AI™ One Shot® Chemically Competent E. coli. Transformed cells were induced for 3 h and His-tagged recombinant proteins were purified by Ni–NTA affinity chromatography kit according to manufacturer’s instructions (QIAGEN, Hilden, Germany). The recombinant proteins were analysed and quantified by polyacrylamide gel electrophoresis (PAGE) and 2100 Bioanalyzer Instrument (Agilent, www.agilent.com). The specificity of the three recombinant proteins was also evaluated by western blot analysis using a monoclonal antibody targeting the poly-His tag.

Enzyme-linked immunosorbent assay (ELISA)

ELISAs were carried out using Costar 96-well plates (Corning, USA) coated with 100 ng/well recombinant protein in carbonate-bicarbonate buffer pH 9.6 for 18 h at 4 °C. The plates were blocked using 300 µl of 5% low-fat milk (www.genespin.com) in wash buffer (WB, 1× PBS buffer, 0.1% Tween20) at room temperature (RT) for a minimum of three hours. The sera samples were diluted 1/100 in 2% milk WB and incubated for 1 h at RT. All samples were run in duplicate. Four wash cycles were performed with 300 µl of WB. Subsequently, the samples were incubated with 100 µl of secondary antibody (peroxidase-conjugated anti-human IgG) diluted at 1:10,000 in WB for 1 h at room temperature. Following four cycles of washing, 100 µl of the substrate (OPD, SIGMA) was added, incubated, and the reaction stopped according to the manufacturer’s instructions. The absorbance was quantified at 490 nm using the VICTOR X4 instrument (PerkinElmer 2023 Multilabel Reader). The cut-off value was calculated by summing the mean of three or more negative sera and adding three times the standard deviation.

Statistical methods

The ELISA results were subjected to a t-test for independent samples (Mann–Whitney U-test), χ2-tests for associations, and binary logistic regressions (logit) using the Jamovi software (https://www.jamovi.org).