Introduction

Invasive pulmonary aspergillosis (IPA) is the most severe form of Aspergillus-related pulmonary infection and has a poor prognosis and a high mortality rate1. IPA primarily affects immunocompromised individuals, such as those with hematologic diseases, recipients of organ transplants, patients with cancer undergoing chemotherapy or radiation, and those with neutropenia2,3. It also occurs in patients with chronic obstructive pulmonary disease (COPD), bronchiectasis, diabetes, pulmonary tuberculosis, and coronavirus disease 2019 (COVID-19)4,5,6. Its clinical presentation is nonspecific, which complicates the diagnostic process. This issue is particularly pronounced in non-neutropenic IPA, which tends to progress rapidly and has a worse prognosis7,8. Hence, early and accurate diagnosis of IPA is important.

Although the gold standard for diagnosis of IPA is the detection of Aspergillus via histopathological examination and culture of clinical specimens, the sensitivity of fungal smears and the delayed results from fungal cultures hamper its early detection. Furthermore, patients with suspected IPA frequently have underlying pulmonary structural issues, severe thrombocytopenia, and other critical conditions, which increase the risk of histopathological examination by computed tomography (CT)-guided percutaneous or transbronchoscopic biopsy. Testing for galactomannan (GM) in serum and bronchoalveolar lavage fluid (BALF) is commonly used to diagnose aspergillosis. However, it is susceptible to interference from antifungal treatments and neutrophil counts, which can lead to false negatives. Furthermore, there is no agreement on an appropriate threshold, so manufacturers’ analytical thresholds are used2,9,10. Consequently, there is a pressing need to identify noninvasive or minimally invasive markers that enable the rapid and accurate diagnosis of IPA.

Metagenomic next-generation sequencing (mNGS) is a non-targeted, high-throughput, and sensitive nucleic acid-based technique used to detect pathogens causing infections of the lower respiratory tract11. It has shown promise for diagnosing infectious diseases but its use for IPA is supported mainly by case reports and small studies12,13,14. Further studies are needed to validate these results. Here, we investigated the value of mNGS and GM testing for the early diagnosis of IPA compared to conventional microbiological tests (CMTs).

Materials and methods

Study design and participants

The protocols were approved by the Medical Ethics Committee of Henan Provincial People’s Hospital (approval no. 2023151) and were performed in accordance with its regulations and guidelines. This retrospective study involved 120 patients clinically diagnosed with IPA at Henan Provincial People’s Hospital between June 2021 and February 2024. The control group included 117 patients with non-IPA pulmonary infections. All of the enrolled patients had undergone mNGS testing. According to the criteria of the European Organization for Research and Treatment of Cancer and the Mycoses Study Group Education and Research Consortium (EORTC/MSGERC)15, the diagnosis of IPA is based on the following: host risk factors for Aspergillus infection; clinical features, including persistent fever unresponsive to antibiotics for 96 h, and typical imaging characteristics such as patchy infiltrates with or without a halo sign, air-crescent sign, cavitation, multiple nodules, and wedge-shaped and segmental or lobar consolidation; mycological evidence, including microscopical detection of fungal elements in valid respiratory tract specimens, positive Aspergillus polymerase chain reaction (PCR), and any single serum GM ≥ 1.0 or BALF-GM ≥ 1.0 or single serum GM ≥ 0.7 and BALF-GM ≥ 0.8. According to these criteria, IPA is categorized as proven IPA (based on histopathological examination and/or culture positivity for Aspergillus), probable IPA (based on clinical diagnosis, host factors, clinical features, and at least one mycological criterion), and possible IPA (based on host factors and clinical features but without mycological evidence). Of the 120 patients with IPA enrolled in this study, which we termed clinically diagnosed IPA, 62 had proven IPA and 58 had probable IPA.

Collection of clinical data

Clinical data were collected from the electronic medical records system, including variables such as sex, age, hospitalization duration, underlying diseases, clinical symptoms, laboratory-related parameters, imaging features, histopathological examination results, treatments, and outcomes.

Conventional microbiological tests

All participants underwent bronchoscopy conducted by an experienced endoscopist according to the 2019 Guideline for Diagnostic Flexible Bronchoscopy in Adults16. Based on CT of the chest, the most affected segmental or subsegmental bronchi were selected for bronchoalveolar lavage and bronchial brushing to obtain BALF and bronchial brush specimens. This method can reduce specimen contamination by directly obtaining specimens from the lower respiratory tract. Extracted BALF was placed in a sterile container and immediately transported to the microbiology laboratory for mNGS testing and CMTs. Blood samples and suitable sputum samples were also collected.

Respiratory specimens (sputum, BALF, and bronchial brushings) were subjected to fluorescence microscopy and fungal fluorescence staining (Jiangsu Lifetime Biological Technology Co., Ltd., China) to visualize filamentous fungi. A sample was considered positive if septate hyaline hyphae branching at 45 degrees, suggesting Aspergillus species, were observed.

Sputum and BALF were cultured using Sabouraud dextrose agar (SDA) at 25 °C for 5 days. Fluffy colonies were transferred to potato dextrose agar (PDA) and incubated at 25℃ for 3–7 days for evaluation of colony characteristics and conidiophore morphology. Aspergillus species were identified via matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (Bruker, Germany) and/or morphology.

BALF supernatant samples and serum were analyzed for GM using a Dynamiker Aspergillus Galactomannan Assay (Dynamiker Biotechnology [Tianjin] Co., Ltd., China), which is based on a sandwich enzyme-linked immunosorbent assay (ELISA). Samples were evaluated using the DNK-A600 Automated ELISA Reader (Dynamiker Biotechnology) according to the manufacturer’s recommendations. A serum GM > 0.5 ng/mL and BALF-GM > 0.8 ng/mL were considered positive according to the threshold stipulated in the manufacturer’s instructions.

mNGS procedures

Host depletion, DNA extraction, library construction and sequencing

Lower respiratory tract infections (LRTIs) were diagnosed using a DNA-based mNGS standard operating procedure. Briefly, pathogens and human cells were separated from 1 mL BALF samples via centrifugation at 12,000 g for 5 min. Host nucleic acid was removed from the precipitate using 1 U Benzonase (Sigma) and 0.5% Tween 20 (Sigma) at 37 °C for 5 min. Terminal buffer (400 µL) was added to halt the reaction. The mixture (600 µL) was transferred to a new tube containing 500 µL ceramic beads and homogenized using Minilys Personal TGrinder H24 Homogenizer (Tiangen, China). Nucleic acids were extracted and eluted from 400 µL pretreated samples using a QIAamp UCP Pathogen Mini Kit in 60 µL elution buffer (Qiagen, Germany). DNA was quantified using a Qubit dsDNA HS Assay Kit (Invitrogen, CA)17,18. Next, cDNA was produced from 10 µL purified RNA. The KAPA Low-Throughput Library Construction Kit (KAPA Biosystems) was used to create a DNA/cDNA library in accordance with the manufacturer’s instructions. Library quality was assessed using the Qubit dsDNA HS Assay Kit and High-Sensitivity DNA Kit (Agilent) on an Agilent 2100 Bioanalyzer. Library pools were loaded onto an Illumina Nextseq CN500 sequencer for 75 cycles of single-end sequencing, generating approximately 20 million reads per library.

Bioinformatics analysis

Trimmomatic software was used to eliminate low-quality reads, duplicate reads, adapter contamination, and reads shorter than 70 bp19. Low-complexity reads were removed using Kcomplexity with the default settings. Human sequence data were located and eliminated using SNAP v. 1.0beta.18 to match the human sequence data to the hg38 reference genome20. The Kraken 2 criteria for selecting representative assemblies for microorganisms (bacteria, viruses, fungi, protozoa, and other multicellular eukaryotic pathogens) from the NCBI Assembly and Genome databases (https://benlangmead.github.io/aws-indexes/k2) were used to create the microbial genome database. Microbial reads were aligned to the database using Burrows-Wheeler Aligner software21. Reads with 90% identity to the reference were defined as mapped reads. In addition, reads with multiple locus alignments within the same genus were excluded from the secondary analysis. Only reads mapped to the genome of the same species were considered. The clinical reportable ranges (CRRs) for pathogens were established according to the Johns Hopkins ABX Guide22 (https://www.hopkinsguides.com/hopkins/index/Johns_Hopkins_ABX_Guide/Pathogens), The Manual of Clinical Microbiology23, and clinical case reports or research articles published in peer-reviewed journals.

Criteria for a positive mNGS result

The specifically mapped read number (SMRN) of each microbial taxon was normalized to SMRN per 20 million (M) of total sequencing reads (standardized SMRN, SDRN). The criteria for reporting mNGS results were as follows24,25: SDRN ≥ 3 (mycobacteria excluded) for bacteria, SDRN ≥ 3 for fungi/DNA viruses, SDRN ≥ 1 for RNA viruses, SDRN ≥ 100 for parasites, SDRN ≥ 3 for Mycoplasma/Chlamydia species, SDRN ≥ 1 for the Mycobacterium tuberculosis (MTB) complex, and positive mNGS for Nocardia species.

Quality control

To monitor sources of potential contamination, NC and sterile deionized water as non-template controls were prepared in parallel in each batch24. We used sterile cotton swabs dipped in sterile deionized water to wipe the surfaces of the centrifuge and biosafety cabinet to generate a list of the background microorganisms present in our laboratory.

Statistical analysis

Statistical analyses were performed using R (v. 4.3.3). pROC (v. 1.18.5) was used to calculate the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Youden Index, and area under the receiver operating characteristic curve (AUC). ggplot2 (v. 3.5.1) was used to draw ROC curves. Normally distributed continuous data are expressed as means ± standard deviations, and between-group comparisons were performed using independent ttests. Non-normally distributed continuous data are expressed as medians (interquartile ranges) and were compared using Mann–Whitney Utests. Categorical data are described as frequencies and percentages and were compared using χ2 or Fisher’s exact tests. Clinical diagnosis of IPA was used as the gold standard to calculate sensitivity, specificity, PPV, and NPV. The diagnostic performances of serum GM and BALF-GM were assessed using three reported cutoff values1,7,9, and ROC curves were plotted. The optimal cutoff value for BALF-GM was determined using the Youden Index, and the use of mNGS and BALF-GM testing in combination for IPA was evaluated via serial and parallel analyses. The serial analysis was defined as both mNGS and BALF-GM being positive for Aspergillus, while the parallel analysis was defined as either mNGS or BALF-GM being positive for Aspergillus. Statistical significance was set as a two-sided Pvalue < 0.05.

Results

Clinical characteristics

Among the 237 patients, there were 120 IPA and 117 non-IPA patients. There were no statistically significant differences in sex or age between the groups (P > 0.05). The proportion of patients with underlying diseases was significantly higher in the IPA group than the non-IPA group. The CT imaging findings differed between the groups (Table 1).

Table 1 Clinical characteristics between IPA and Non-IPA groups.

Diagnostic performance of mNGS and conventional microbiological methods

Using clinical diagnosis of IPA as the reference standard, mNGS identified 111 Aspergillus-specific sequences in 120 patients with IPA and 7 in 117 patients who did not have IPA. The mNGS and CMTs showed high consistency (89 of 120, 74.2%) for diagnosing IPA, whereas mNGS identified the only mycological evidence in 31 IPA cases that were missed by CMTs. In comparison, nine patients with IPA missed by mNGS were diagnosed by CMTs. The sensitivity, specificity, PPV, and NPV of mNGS were 92.5%, 94.02%, 94.07%, and 92.44%, respectively. The sensitivity of a BALF GM ≥ 1 for diagnosing IPA was 48.33% with a specificity of 94.02%, which was second only to mNGS. Other conventional microbiological methods showed notably lower sensitivity than mNGS (Table 2).

Table 2 Diagnostic performance of mNGS and conventional microbiological methods in IPA patients.

Diagnostic performance of different cutoff values for serum GM and BALF-GM

For serum GM tests, at cutoff values of 0.5, 0.7, and 1, sensitivity decreased to 35%, 27.5%, and 18.33%, respectively, and specificity increased to 93.16%, 96.58%, and 97.44% (Table 3). For BALF-GM tests, sensitivity decreased to 67.5%, 53.33%, and 48.33% with cutoff values of 0.5, 0.8, and 1, respectively, and specificity increased to 79.49%, 88. 89%, and 94.02% (Table 4). The AUC for serum GM detection was 0.61. The optimal cutoff value for serum GM, determined using the Youden Index, was 0.295, corresponding to a sensitivity of 46.7% and specificity of 85.5%. The AUC for BALF GM detection was 0.8. The optimal cutoff value for BALF-GM, determined using the Youden Index, was 0.546, corresponding to a sensitivity of 66.7% and specificity of 82.1% (Fig. 1).

Table 3 Diagnostic efficacy of Different Cutoff Values for Serum GM in IPA patients.
Table 4 Diagnostic efficacy of different cutoff values for BALF-GM in IPA patients.
Fig. 1
figure 1

ROC curve of serum GM and BALF-GM testing for diagnosing IPA. IPA, invasive pulmonary aspergillosis; ROC, receiver operating characteristic; GM, galactomannan antigen; BALF, bronchoalveolar lavage fluid; AUC, area under curve.

Diagnostic performance of mNGS and BALF-GM testing in combination for IPA

Using a BALF-GM cutoff value of 0.546, the diagnostic performance of mNGS combined with BALF-GM for IPA was calculated using serial and parallel analyses; the results are given in Table 5; Fig. 2.

Table 5 Diagnostic efficacy for mNGS combination with BALF-GM in IPA patients.
Fig. 2
figure 2

ROC curve of BALF-GM or/and mNGS testing for diagnosing IPA. IPA, invasive pulmonary aspergillosis; ROC, receiver operating characteristic; GM, galactomannan antigen; BALF, bronchoalveolar lavage fluid; AUC, area under curve (AUC).

Discussion

As a group of pulmonary mycotic diseases, IPA often affects immunocompromised patients and can take an angio-invasive or airway-invasive form1,26. The incidence of IPA has increased in recent years because of the aging population; the widespread use of broad-spectrum antibiotics, antineoplastic drugs, and immunosuppressants; the COVID-19 pandemic; and the high frequency of invasive procedures and organ transplantation27,28,29,30. Given the rapid progression of IPA, early diagnosis is crucial to improve outcomes. However, its diagnosis is hampered by a lack of specific clinical symptoms and the difficulty obtaining tissue, enhancing the importance of microbiological evidence31,32,33.

Fungal culture of respiratory specimens is the most used diagnostic method34,35 but is hampered by contamination and airway colonization, and it is time-consuming with low diagnostic efficiency36,37. Direct smear microscopy, although more rapid than culture, depends on the microbial load in the sample and the skill of the microbiologist, resulting in low sensitivity and potential bias10,38. In this study, sputum smear, bronchial brushings, and BALF cultures had sensitivities < 30%, and sputum culture had the lowest sensitivity (13.33%). These limitations make these methods unsuitable for the early, rapid, and accurate diagnosis of IPA.

GM testing is an immunological method for the diagnosis of IPA. Serum GM testing has low sensitivity and is prone to false negatives, and it is useful primarily for diagnosing IPA in neutropenic patients39. In non-neutropenic patients, the sensitivity of BALF-GM testing is superior to serum GM testing15,40. However, BALF-GM testing can be affected by, for example, β-lactam antibiotics, intravenous drugs containing galactomannan, antifungal medications, mild disease, and low antigen concentrations, leading to false-positive and false-negative results. Moreover, the optimum cutoff values are unclear9,10. In this study, BALF-GM testing had a sensitivity of 48.33% for diagnosing IPA, significantly higher than serum GM (18.33%) and conventional respiratory-specimen methods, suggesting that BALF-GM has potential for the early diagnosis of IPA. The optimum cutoff for BALFGM is unclear, and the sensitivity of serum GM and BALF-GM is 24.3–46.9% and 54.5–80.0%, respectively, according to the cutoff value used41,42,43 In this study, we compared the diagnostic utility of several cutoff values for GM testing; sensitivity decreased as the cutoff value increased. For accurate diagnosis of IPA, the optimal cutoff value for BALF-GM, determined using an ROC curve and the maximum Youden Index, was 0.546, which had a sensitivity of 66.7% and specificity of 82.1%. A variety of factors influence GM results, so this BALF-GM cutoff value may be suitable at our institution but possibly not at others.

mNGS testing involves the extraction of microbial nucleic acids from clinical specimens followed by high-throughput sequencing and bioinformatics to identify microbes, thereby enabling the detection of the pathogens causing infectious diseases44. However, its diagnostic utility in IPA is unclear. Some studies have suggested that it is more rapid, sensitive, and precise than conventional microbiological methods, although those studies had small sample sizes and were retrospective45,46,47,48. We evaluated its diagnostic utility in a large number of IPA cases, comparing it sputum smear microscopy, sputum culture, BALF culture, and bronchial brushings, as well as serum and BALF-GM methods. We also explored the diagnostic utility of mNGS and BALF-GM testing in combination. mNGS alone had a sensitivity of 92.5% for diagnosing IPA, which increased to 96.67% when combined with BALF-GM testing, with a specificity of 78.63%. This performance significantly surpasses conventional microbiological methods. Given the rapid disease progression and poor prognosis of IPA, particularly in high-risk populations, there is a need for highly sensitive diagnostic methods to facilitate early antifungal treatment. When interpreting reports on the use of mNGS for IPA, the focus ought to be not only on the number of specific sequences and genome coverage of the identified pathogenic fungi but also the clinical manifestations, imaging findings, patient-specific factors, the results of other microbiological and pathological tests, and other indirect indicators of infection. The ability of mNGS to identify diverse pathogens is an important advantage. In patients with IPA, who have a high rate of mixed infections, mNGS facilitates the detection and differentiation of co-pathogens, thereby enabling selection of anti-infective treatments.

The use of mNGS in routine clinical practice has several issues. The need for costly and time-consuming (24–48 h) data analysis is a barrier to its widespread use, particularly in developing countries. The interpretation of mNGS results lacks universal consensus, hampering the determination of whether the microorganisms detected are pathogens or colonizers. Clinicians must consider the characteristics, laboratory tests, and imaging findings of suspected cases. Therefore, mNGS cannot replace conventional microbiological tests.

This study had several limitations. First, as a single-center, retrospective study, selection bias was inevitable. Second, Aspergillus PCR was not used to validate the diagnosis because this test is not routinely performed at our hospital. Finally, the clinical diagnostic standard used for case inclusion acknowledges that even experienced physicians cannot avoid clinical misjudgments, which can lead to discrepancies in diagnostic efficacy among methods. Therefore, the utility of mNGS for the diagnosis of IPA requires further clinical validation.

Conclusion

In this study, mNGS had good diagnostic performance for IPA, which was enhanced by its combination with BALF-GM testing. It enabled the identification of pathogens in mixed infections, underscoring its utility for the early diagnosis and targeted treatment of IPA.