Abstract
Respiratory pathogens are a significant source of global morbidity, mortality, and economic burden, with the COVID-19 pandemic driving increased interest in and funding for respiratory disease surveillance. Syndromic panel multiplex nucleic acid amplification tests (NAATs) such as the BioFire Respiratory Panel (RP) are designed to identify the most common etiologic agents of respiratory illness. Untargeted metagenomic sequencing is a powerful tool for pathogen-agnostic detection, enabling the recovery of complete genomes for genomic epidemiology and variant tracking. In this study, we performed untargeted metagenomic sequencing of 305 samples previously negative by BioFire RP and SARS-CoV-2 testing and 26 samples that were previously positive by either of the diagnostic tests. A subset of 78 samples underwent probe-capture enrichment sequencing targeting human viruses. Using these methods, we identified human respiratory viruses in 16 of the 305 previously negative samples (5%). The most common viruses identified were Influenza C virus, Human Bocavirus, Rhinovirus A and C, and SARS-CoV-2. Consensus genomes were recovered for 14 viruses with > 90% coverage breadth, revealing closely related Bocavirus strains from neighboring counties and distinct Rhinovirus strains across samples. We also identified 21 samples with a single predominant bacterial or fungal species in the previous negative cohort. These findings underscore the challenges of identifying causal agents from multiplex NAAT-negative cases and highlight the utility of metagenomics for expanding the scope of pathogen surveillance.
Similar content being viewed by others
Data availability
The de-identified untargeted metagenomic and TWIST-enriched Illumina sequencing datasets are publicly available in the NCBI Sequencing Read Archive under BioProject PRJNA1296491.
References
Sirota, S. B. et al. Global, regional, and national burden of upper respiratory infections and otitis media, 1990–2021: a systematic analysis from the Global Burden of Disease Study 2021. Lancet Infect. Dis. 25, 36–51 (2025).
Eales, O. et al. Key challenges for respiratory virus surveillance while transitioning out of acute phase of COVID-19 pandemic. Emerg. Infect. Disease J. 30, 1 (2024).
Clark, E. C. et al. Changes to public health surveillance methods due to the COVID-19 pandemic: scoping review. JMIR Public. Health Surveill. 10, e49185 (2024).
WHO. WHO Policy Brief: COVID-19 Surveillance (World Health Organization, 2024).
Tisza, M. et al. Wastewater sequencing reveals community and variant dynamics of the collective human Virome. Nat. Commun. 14, 6878 (2023).
Wyler, E. et al. Pathogen dynamics and discovery of novel viruses and enzymes by deep nucleic acid sequencing of wastewater. Environ. Int. 190, 108875 (2024).
Smith Matthew, F. et al. Seasonality of respiratory, enteric, and urinary viruses revealed by wastewater genomic surveillance. mSphere 9, e00105-00124 (2024).
Rajagopala, S. V. et al. Metatranscriptomics to characterize respiratory virome, microbiome, and host response directly from clinical samples. Cell. Rep. Methods. 1, 100091 (2021).
Cheemarla, N. R. et al. Nasal host response-based screening for undiagnosed respiratory viruses: a pathogen surveillance and detection study. Lancet Microbe. 4, e38–e46 (2023).
Madi, N., Al-Nakib, W., Mustafa, A. S. & Habibi, N. Metagenomic analysis of viral diversity in respiratory samples from patients with respiratory tract infections in Kuwait. J. Med. Virol. 90, 412–420 (2018).
Gauthier, N. P. G. et al. Validation of an Automated, End-to-End metagenomic sequencing assay for agnostic detection of respiratory viruses. J. Infect. Dis. 230, e1245–e1253 (2024).
Tan, J. K. et al. Laboratory validation of a clinical metagenomic next-generation sequencing assay for respiratory virus detection and discovery. Nat. Commun. 15, 9016 (2024).
Ceballos-Garzon, A., Comtet-Marre, S. & Peyret, P. Applying targeted gene hybridization capture to viruses with a focus to SARS-CoV-2. Virus Res. 340, 199293 (2024).
Mourik, K. et al. Comparison of the performance of two targeted metagenomic virus capture probe-based methods using reference control materials and clinical samples. J. Clin. Microbiol. 62, e00345–e00324 (2024).
Wani, A. K. et al. Metagenomics in the fight against zoonotic viral infections: A focus on SARS-CoV-2 analogues. J. Virol. Methods. 323, 114837 (2024).
Cooksey, G. L. S. et al. Severe acute respiratory syndrome coronavirus 2 and respiratory virus Sentinel Surveillance, California, USA, May 10, 2020-June 12, 2021. Emerg. Infect. Dis. 28, 9–19 (2022).
Creager, H. M. et al. Clinical evaluation of the BioFire® respiratory panel 2.1 and detection of SARS-CoV-2. J. Clin. Virol. 129, 104538 (2020).
Chen, S. F. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. Imeta https://doi.org/10.1002/imt2.107 (2023).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Danecek, P. et al. Twelve years of samtools and BCFtools. Gigascience. 10, giab008 (2021).
Li, H. et al. The sequence Alignment/Map format and samtools. Bioinformatics 25, 2078–2079 (2009).
Katz, K. et al. The sequence read archive: a decade more of explosive growth. Nucleic Acids Res. 50, D387–D390 (2022).
Stop neglecting fungi. Nat. Microbiol. 2, 17120 (2017).
NCBI. Human Read Removal Tool, (2023). https://ncbiinsights.ncbi.nlm.nih.gov/2023/02/02/scrubbing-human-sequences-sra-submissions/
Kim, D., Song, L., Breitwieser, F. P. & Salzberg, S. L. Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res. 26, 1721–1729 (2016).
Sayers, E. W. et al. Database resources of the National center for biotechnology information. Nucleic Acids Res. 50, D20–D26 (2022).
Martí Jose, M. et al. Addressing the dynamic nature of reference data: a new nucleotide database for robust metagenomic classification. mSystems 10, e01239-01224 (2025).
Marti, J. M. & Recentrifuge Robust comparative analysis and contamination removal for metagenomics. PLoS Comput. Biol. 15, e1006967 (2019).
McMurdie, P. J. & Holmes, S. Phyloseq: an R package for reproducible interactive analysis and graphics of Microbiome census data. PLoS One. 8, e61217 (2013).
Steinegger, M. & Söding, J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 35, 1026–1028 (2017).
Lu, X. et al. US CDC Real-Time reverse transcription PCR panel for detection of severe acute respiratory syndrome coronavirus 2. Emerg. Infect. Dis. 26, 1654–1665 (2020).
Sederdahl, B. K. & Williams, J. V. Epidemiology and clinical characteristics of influenza C virus. Viruses. 12, 89–89 (2020).
Rayamajhi Thapa, R., Nascimento-Carvalho, C., Allander, T., Jartti, T. & Söderlund-Venermo, M. A Diagnostic approach to separate acute human bocavirus 1 respiratory tract infection from long-lasting virus shedding. J. Infect. Dis. jiaf130 (2025).
Uršič, T. et al. Acute human bocavirus 1 infection in children hospitalized for acute bronchiolitis: a 2-Year prospective study. Pediatr. Infect. Dis. J. 44, 195–199 (2025).
Goldstein, E. J. C., Murphy, T. F. & Parameswaran, G. I. Moraxella catarrhalis, a human respiratory tract pathogen. Clin. Infect. Dis. 49, 124–131 (2009).
Yi, H., Yong, D., Lee, K., Cho, Y. J. & Chun, J. Profiling bacterial community in upper respiratory tracts. BMC Infect. Dis. 14, 583 (2014).
Liu, Y. et al. Outcomes of respiratory viral-bacterial co-infection in adult hospitalized patients. eClinicalMedicine 37 (2021).
Meskill, S. D. & O’Bryant, S. C. Respiratory virus Co-infection in acute respiratory infections in children. Curr. Infect. Dis. Rep. 22, 3 (2020).
Hanada, S., Pirzadeh, M., Carver, K. Y. & Deng, J. C. Respiratory viral Infection-Induced Microbiome alterations and secondary bacterial pneumonia. Front. Immunol. 9 (2018).
Acknowledgements
This work was supported by the Lawrence Livermore National Laboratory’s Laboratory Derived Research & Development program. We extend gratitude to Dr. Seema Jain, Dr. Jake Pry, Gail Sondermeyer Cooksey, Samuel Schildhauer, Lauren Linde, and the rest of the CDPH and local health department CalSRVSS Team. We are grateful to the CDPH VRDL Team for their work processing, managing, and testing CalSRVSS samples at CDPH including Cynthia Bernas, Brandon Brown, Alice Chen, Nick D’Angelo, Ambar Espinoza, Bianca Gonzaga, Ydelita Gonzales, Hugo Guevara, April Hatada, Blanca Molinar, Lisa Moua, Nichole Osugi, Tasha Padilla, Chao-Yang Pan, Clarence Reyes, Estela Saguar, Maria Salas, Ioana Seritan, Hilary Tamnanchit, Maria Uribe Fuentes, Chelsea Wright. The residual samples used in this study were obtained from the original community surveillance project CalSRVSS which was funded in part and by the Centers for Disease Control and Prevention Epidemiology and Laboratory Capacity for Community Surveillance (grant 6 Nu50CK000539). Disclaimer The findings and conclusions in this article are those of the authors and do not necessarily represent the views or opinions of the California Department of Public Health, the California Health and Human Services Agency or Lawrence Livermore National Laboratory. This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Funding
The residual samples used in this study were obtained from the original community surveillance project CalSRVSS which was funded in part and by the Centers for Disease Control and Prevention Epidemiology and Laboratory Capacity for Community Surveillance (grant 6 Nu50CK000539).
Author information
Authors and Affiliations
Contributions
RSK, ACM, CRK and CJ contributed to the writing of the of the draft manuscript. RSK, CRK, ACM and JBT analyzed and interpreted the data. CJ, DAW, CM and SM acquired the samples. JBT, ACM, MB, CJ, DAW, CM and SM designed the study and experiments. ACM, JBT, DAW, SM and CM performed experiments. CJ and DAW contributed to project management. All authors contributed to reviewing and editing the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Mascarenhas, A.C., Kantor, R.S., Thissen, J. et al. Metagenomic sequencing identifies potential respiratory pathogens in PCR-negative subset of surveillance samples. Sci Rep (2026). https://doi.org/10.1038/s41598-025-33917-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-33917-4


