Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Communications
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. nature communications
  3. articles
  4. article
XA-Novo: high-throughput mass spectrometry-based de novo sequencing technology for monoclonal antibodies and antibody mixtures
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 12 March 2026

XA-Novo: high-throughput mass spectrometry-based de novo sequencing technology for monoclonal antibodies and antibody mixtures

  • Yueting Xiong  ORCID: orcid.org/0009-0008-8471-98001 na1,
  • Wenbin Jiang  ORCID: orcid.org/0000-0003-2812-85191,2 na1,
  • Jin Xiao1 na1,
  • Qingfang Bu1,
  • Jingyi Wang1,
  • Zhenjian Jiang3,
  • Ling Luo1,
  • Xiaoqing Chen1,
  • Yijie Qiu1,
  • Yangtao Wu  ORCID: orcid.org/0000-0002-9429-266X1,
  • Fan Liu1,
  • Rongshan Yu  ORCID: orcid.org/0000-0003-2179-173X1,2,4,
  • Ningshao Xia  ORCID: orcid.org/0000-0003-0179-52661 &
  • …
  • Quan Yuan  ORCID: orcid.org/0000-0001-5487-561X1 

Nature Communications , Article number:  (2026) Cite this article

  • 4564 Accesses

  • 2 Altmetric

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Mass spectrometry
  • Protein sequencing
  • Proteomics

Abstract

Elucidating antibody sequences by mass spectrometry-based de novo sequencing is essential but remains technically challenging. Here we present XA-Novo, an accurate and high-throughput de novo sequencing solution that integrates a single-pot multi-enzymatic gradient digestion method with a beam search-based assembler (Fusion) to reconstruct full-length antibody sequences directly from bottom-up mass spectrometry data. Benchmarking across well-characterized antibodies from multiple species demonstrates that XA-Novo outperforms commercial solutions in identification sensitivity, sequence completeness, and reconstruction accuracy. Furthermore, XA-Novo successfully reconstructs six immunotherapeutic antibodies with unknown sequences, and in vitro/vivo assays validate that these generated antibodies exhibit functionality equivalent to their commercial counterparts. Moreover, XA-Novo achieves over 99.54% accurate sequence coverage in distinguishing mixed COVID-19 neutralizing antibodies, exceeding the performance of current assemblers reported for single-antibody sequencing. Overall, XA-Novo establishes a reliable, scalable, and broadly applicable workflow for routine antibody sequencing, thereby accelerating both fundamental antibody research and therapeutic antibody development.

Similar content being viewed by others

De novo protein sequencing of antibodies for identification of neutralizing antibodies in human plasma post SARS-CoV-2 vaccination

Article Open access 10 October 2024

Single-chain dimers from de novo immunoglobulins as robust scaffolds for multiple binding loops

Article Open access 23 September 2023

Unveiling inverted D genes and D-D fusions in human antibody repertoires unlocks novel antibody diversity

Article Open access 28 January 2025

Data availability

The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the partner repository iProX under the dataset identifier PXD060500. The weights of the Casanovo model utilized in XA-Novo and the split datasets for training Casanovo are both available on Zenodo [https://zenodo.org/records/17266057] and [https://zenodo.org/records/18627093], respectively. Additionally, the revised nine-species benchmark dataset by Wen et al. is available on Zenodo [https://zenodo.org/records/13653420]. Unless otherwise stated, all data supporting the results of this study can be found in the article, supplementary, and source data files. Source data are provided with this paper.

Code availability

Result files and code to reproduce the results in this study are available on GitHub [https://github.com/biocc/SP-MEGD_Fusion]. An executable version of the code and the computational environment used in this study are also available as a Code Ocean capsule [https://codeocean.com/capsule/4653442/tree].

References

  1. Lu, L. L., Suscovich, T. J., Fortune, S. M. & Alter, G. Beyond binding: antibody effector functions in infectious diseases. Nat. Rev. Immunol. 18, 46–61 (2018).

    Google Scholar 

  2. Oostindie, S. C., Lazar, G. A., Schuurman, J. & Parren, P. Avidity in antibody effector functions and biotherapeutic drug design. Nat. Rev. Drug Discov. 21, 715–735 (2022).

    Google Scholar 

  3. Watson, C. T., Glanville, J. & Marasco, W. A. The Individual and Population Genetics of Antibody Immunity. Trends Immunol. 38, 459–470 (2017).

    Google Scholar 

  4. Ejazi, S. A., Ghosh, S. & Ali, N. Antibody detection assays for COVID-19 diagnosis: an early overview. Immunol. Cell Biol. 99, 21–33 (2021).

    Google Scholar 

  5. Ning, L., Abagna, H. B., Jiang, Q., Liu, S. & Huang, J. Development and application of therapeutic antibodies against COVID-19. Int J. Biol. Sci. 17, 1486–1496 (2021).

    Google Scholar 

  6. Lu, R. M. et al. Development of therapeutic antibodies for the treatment of diseases. J. Biomed. Sci. 27, (2020).

  7. Mason, D. M. et al. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. Nat. Biomed. Eng. 5, 600–612 (2021).

    Google Scholar 

  8. Mattsson, J. et al. Sequence enrichment profiles enable target-agnostic antibody generation for a broad range of antigens. Cell Rep. Methods 3, 100475 (2023).

    Google Scholar 

  9. Parray, H. A. et al. Hybridoma technology a versatile method for isolation of monoclonal antibodies, its applicability across species, limitations, advancement and future perspectives. Int Immunopharmacol. 85, 106639 (2020).

    Google Scholar 

  10. Tomita, M. & Tsumoto, K. Hybridoma technologies for antibody production. Immunotherapy 3, 371–380 (2011).

    Google Scholar 

  11. Subas et al. NAb-seq: an accurate, rapid, and cost-effective method for antibody long-read sequencing in hybridoma cell lines and single B cells. MAbs 14, 2106621 (2022).

    Google Scholar 

  12. Chen, Y. et al. Barcoded sequencing workflow for high throughput digitization of hybridoma antibody variable domain sequences. J. Immunol. Methods 455, 88–94 (2018).

    Google Scholar 

  13. Schardt, J. S., Sivaneri, N. S. & Tessier, P. M. Monoclonal antibody generation using single B-cell screening for treating infectious diseases. BioDrugs 38, 477–486 (2024).

    Google Scholar 

  14. de Graaf, S. C., Hoek, M., Tamara, S. & Heck, A. J. R. A perspective toward mass spectrometry-based de novo sequencing of endogenous antibodies. MAbs 14, 2079449 (2022).

    Google Scholar 

  15. Schulte, D., Peng, W. & Snijder, J. Template-based assembly of proteomic short reads for de novo antibody sequencing and repertoire profiling. Anal. Chem. 94, 10391–10399 (2022).

    Google Scholar 

  16. Le Bihan, T. et al. De novo protein sequencing of antibodies for identification of neutralizing antibodies in human plasma post SARS-CoV-2 vaccination. Nat. Commun. 15, 8790 (2024).

    Google Scholar 

  17. Sen, K. I. et al. Automated Antibody De Novo sequencing and its utility in biopharmaceutical discovery. J. AM Soc. Mass Spectr. 28, 803–810 (2017).

    Google Scholar 

  18. Gadush, M. V. et al. Template-Assisted De Novo Sequencing of SARS-CoV-2 and Influenza Monoclonal Antibodies By Mass Spectrometry. J. Proteome Res. 21, 1616–1627 (2022).

    Google Scholar 

  19. He, M.-T. et al. Do-It-Yourself De Novo Antibody Sequencing Workflow That Achieves Complete Accuracy Of The Variable Regions. J. Proteome Res. 24, 3062–3073 (2025).

    Google Scholar 

  20. Pinto, D. et al. Broad betacoronavirus neutralization by a stem helix-specific human antibody. Science 373, 1109–1116 (2021).

    Google Scholar 

  21. Ye, X. et al. Integrated proteomics sample preparation and fractionation: Method development and applications. Trends Anal. Chem. 120, 115667 (2019).

    Google Scholar 

  22. Tsiatsiani, L. & Heck, A. J. R. Proteomics beyond trypsin. FEBS J. 282, 2612–2626 (2015).

    Google Scholar 

  23. Morsa, D. et al. Multi-enzymatic limited digestion: the next-generation sequencing for proteomics? J. Proteome Res. 18, 2501–2513 (2019).

    Google Scholar 

  24. Yilmaz M., Fondrie W., Bittremieux W., Oh S., Noble W. S. De novo mass spectrometry peptide sequencing with a transformer model. Int. Conf. Mach. Learn. 162, 25514-25522 (2022).

  25. Yilmaz, M. et al. Sequence-to-sequence translation from mass spectra to peptides with a transformer model. Nat. Commun. 15, 6427 (2024).

    Google Scholar 

  26. Cao, Y. et al. BA.2.12.1, BA.4 and BA.5 escape antibodies elicited by Omicron infection. Nature 608, 593–602 (2022).

    Google Scholar 

  27. Wu, Y. et al. Lineage-mosaic and mutation-patched spike proteins for broad-spectrum COVID-19 vaccine. Cell Host Microbe 30, 1732–1744.e1737 (2022).

    Google Scholar 

  28. Tran, N. H. et al. Complete De Novo assembly of monoclonal antibody sequences. Sci. Rep. 6, 31730 (2016).

    Google Scholar 

  29. Schulte, D. & Snijder, J. A Handle on Mass Coincidence Errors in De Novo sequencing of antibodies by bottom-up proteomics. J. Proteome Res. 23, 3552–3559 (2024).

    Google Scholar 

  30. Guthals, A. et al. De Novo MS/MS sequencing of native human antibodies. J. Proteome Res. 16, 45–54 (2017).

    Google Scholar 

  31. Guthals, A., Clauser, K. R., Frank, A. M. & Bandeira, N. Sequencing-Grade De novo Analysis of MS/MS Triplets (CID/HCD/ETD) From Overlapping Peptides. J. Proteome Res. 12, 2846–2857 (2013).

    Google Scholar 

  32. Peng, W. et al. Reverse-engineering the anti-MUC1 antibody 139H2 by mass spectrometry-based de novo sequencing. Life Sci. Alliance 7, e202302366 (2024).

    Google Scholar 

  33. Chen, D. S. & Mellman, I. Oncology meets immunology: the cancer-immunity cycle. Immunity 39, 1–10 (2013).

    Google Scholar 

  34. Zhao, Y. L. et al. Comparison of the characteristics of macrophages derived from murine spleen, peritoneal cavity, and bone marrow. J. Zhejiang Univ. Sci. B 18, 1055–1063 (2017).

    Google Scholar 

  35. Bronte, V. & Pittet, M. ikaelJ. The spleen in local and systemic regulation of immunity. Immunity 39, 806–818 (2013).

    Google Scholar 

  36. Khoury, D. S. et al. Neutralizing antibody levels are highly predictive of immune protection from symptomatic SARS-CoV-2 infection. Nat. Med. 27, 1205–1211 (2021).

    Google Scholar 

  37. Cameroni, E. et al. Broadly neutralizing antibodies overcome SARS-CoV-2 Omicron antigenic shift. Nature 602, 664–670 (2022).

    Google Scholar 

  38. Tortorici, M. A. et al. Broad sarbecovirus neutralization by a human monoclonal antibody. Nature 597, 103–108 (2021).

    Google Scholar 

  39. Copin, R. et al. The monoclonal antibody combination REGEN-COV protects against SARS-CoV-2 mutational escape in preclinical and human studies. Cell 184, 3949–3961.e3911 (2021).

    Google Scholar 

  40. Cao, Y. et al. Rational identification of potent and broad sarbecovirus-neutralizing antibody cocktails from SARS convalescents. Cell Rep. 41, 111845 (2022).

    Google Scholar 

  41. Wang, M. et al. Assembling the Community-Scale Discoverable Human Proteome. Cell Syst. 7, 412–421.e415 (2018).

    Google Scholar 

  42. Beslic, D., Tscheuschner, G., Renard, B. Y., Weller, M. G. & Muth, T. Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly. Brief. Bioinform. 24, bbac542 (2022).

    Google Scholar 

  43. Wen, B. & Noble, W. S. A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models. Sci. Data 11, 1207 (2024).

    Google Scholar 

  44. Tran, N. H., Zhang, X., Xin, L., Shan, B. & Li, M. De novo peptide sequencing by deep learning. Proc. Natl. Acad. Sci. USA 114, 8247–8252 (2017).

    Google Scholar 

  45. Kessner, D., Chambers, M., Burke, R., Agus, D. & Mallick, P. ProteoWizard: open source software for rapid proteomics tools development. Bioinformatics 24, 2534–2536 (2008).

    Google Scholar 

  46. Needleman, S. B. & Wunsch, C. D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48, 443–453 (1970).

    Google Scholar 

Download references

Acknowledgements

This work was supported by the National Science and Technology Major Project for Innovative Drug Research and Development (2025ZD1803703 to Q.Y.); the National Natural Science Foundation of China (32401237 to Y.X. and 92369110 to Q.Y.); the Fujian Provincial Natural Science Foundation of China (2024J08358 to Y.X.); the Natural Science Foundation of Xiamen, China (3502Z202371039 to Y.X.); the State Key Laboratory of Vaccines for Infectious Diseases, Xiang An Biomedicine Laboratory (2025XAKJ0200001 to R.Y.); and the Scientific Research Foundation of the State Key Laboratory of Vaccines for Infectious Diseases (2024SKLVDzy06 to Y.X.).

Author information

Author notes
  1. These authors contributed equally: Yueting Xiong, Wenbin Jiang, Jin Xiao.

Authors and Affiliations

  1. State Key Laboratory of Vaccines for Infectious Diseases, Xiang An Biomedicine Laboratory, National Institute for Data Science in Health and Medicine, School of Public Health, Xiamen University, Fujian, China

    Yueting Xiong, Wenbin Jiang, Jin Xiao, Qingfang Bu, Jingyi Wang, Ling Luo, Xiaoqing Chen, Yijie Qiu, Yangtao Wu, Fan Liu, Rongshan Yu, Ningshao Xia & Quan Yuan

  2. School of Informatics, Xiamen University, Fujian, China

    Wenbin Jiang & Rongshan Yu

  3. Department of Pathology, Zhongshan Hospital, Fudan University (Xiamen Branch), Fujian, China

    Zhenjian Jiang

  4. Aginome Scientific, Xiamen, Fujian, China

    Rongshan Yu

Authors
  1. Yueting Xiong
    View author publications

    Search author on:PubMed Google Scholar

  2. Wenbin Jiang
    View author publications

    Search author on:PubMed Google Scholar

  3. Jin Xiao
    View author publications

    Search author on:PubMed Google Scholar

  4. Qingfang Bu
    View author publications

    Search author on:PubMed Google Scholar

  5. Jingyi Wang
    View author publications

    Search author on:PubMed Google Scholar

  6. Zhenjian Jiang
    View author publications

    Search author on:PubMed Google Scholar

  7. Ling Luo
    View author publications

    Search author on:PubMed Google Scholar

  8. Xiaoqing Chen
    View author publications

    Search author on:PubMed Google Scholar

  9. Yijie Qiu
    View author publications

    Search author on:PubMed Google Scholar

  10. Yangtao Wu
    View author publications

    Search author on:PubMed Google Scholar

  11. Fan Liu
    View author publications

    Search author on:PubMed Google Scholar

  12. Rongshan Yu
    View author publications

    Search author on:PubMed Google Scholar

  13. Ningshao Xia
    View author publications

    Search author on:PubMed Google Scholar

  14. Quan Yuan
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Y.X., W.J. and J.X. contributed equally to this work. W.J., Y.X., R.Y., N.X. and Q.Y. conceived and designed the study. J.X. and J.W. conducted mass spectrometry experiments. Q.B., X.C. and Y.W. prepared recombinant antibodies and performed animal experiments. W.J. developed the algorithms. Z.J., L.L., Y.Q. and F.L. performed data analysis. Y.X. and W.J. wrote the draft of the manuscript. R.Y., N.X. and Q.Y. revised and edited the manuscript. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Yueting Xiong, Rongshan Yu, Ningshao Xia or Quan Yuan.

Ethics declarations

Competing interests

R.Y. is a shareholder of Aginome Scientific. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Albert Heck, Samantha Sarrett, and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Description of Additional Supplementary Files (download PDF )

Supplementary Data 1 (download XLSX )

Summary (download PDF )

Transparent Peer Review file (download PDF )

Source data

Source Data 1 (download XLSX )

Source Data 2 (download XLSX )

Source Data 3 (download XLSX )

Source Data 4 (download XLSX )

Source Data 5 (download XLSX )

Source Data 6 (download XLSX )

Source Data 7 (download XLSX )

Source Data 8 (download XLSX )

Source Data 9 (download XLSX )

Source Data 10 (download XLSX )

Source Data 11 (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiong, Y., Jiang, W., Xiao, J. et al. XA-Novo: high-throughput mass spectrometry-based de novo sequencing technology for monoclonal antibodies and antibody mixtures. Nat Commun (2026). https://doi.org/10.1038/s41467-026-70496-y

Download citation

  • Received: 16 January 2025

  • Accepted: 26 February 2026

  • Published: 12 March 2026

  • DOI: https://doi.org/10.1038/s41467-026-70496-y

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Videos
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Editors
  • Journal Information
  • Open Access Fees and Funding
  • Calls for Papers
  • Editorial Values Statement
  • Journal Metrics
  • Editors' Highlights
  • Contact
  • Editorial policies
  • Top Articles

Publish with us

  • For authors
  • For Reviewers
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Nature Communications (Nat Commun)

ISSN 2041-1723 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research