Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Data
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific data
  3. data descriptors
  4. article
Complete genome sequence of Sphingomonas sp. gentR, a high-level gentamicin-resistant bacterium
Download PDF
Download PDF
  • Data Descriptor
  • Open access
  • Published: 13 March 2026

Complete genome sequence of Sphingomonas sp. gentR, a high-level gentamicin-resistant bacterium

  • Yi Liu1,
  • Lijing Jiang1,
  • Jinhua Zhang1,
  • Qiufen Li1,2 &
  • …
  • Baosheng Liu  ORCID: orcid.org/0009-0002-9310-71591,2 

Scientific Data , Article number:  (2026) Cite this article

  • 1075 Accesses

  • 3 Altmetric

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Antimicrobial resistance
  • Bacterial genomics
  • DNA sequencing

Abstract

We present the complete genome sequence of Sphingomonas sp. gentR, a strain exhibiting high-level resistance to gentamicin (MIC = 40 mg/mL). The genome was assembled from hybrid Illumina and Nanopore sequencing data into a gap-free sequence of 4.0 Mbp, comprising one chromosome and two plasmids. A total of 3,692 coding sequences were predicted, with comprehensive functional annotation revealing genes associated with antibiotic resistance, stress adaptation, and metabolic diversity. Three confirmed resistance genes—ANT(2″)-Ia, ANT(3″)-IIa, and Sul1—were co-localized within a genomic island on plasmid B. This dataset provides insight into the genetic basis of high-level aminoglycoside resistance in Sphingomonas and serves as a valuable resource for studying horizontal gene transfer, environmental adaptation, and bioremediation potential. The genome sequence is publicly available under GenBank accessions CP144670–CP144672 and China National Genomics Data Center (accession number GWHDOHA00000000).

Similar content being viewed by others

Genome-wide identification of fitness-genes in aminoglycoside-resistant Escherichia coli during antibiotic stress

Article Open access 20 February 2024

A whole-genome sequencing dataset of nanopore raw signals for bacterial genotyping and methylation analysis

Article Open access 02 December 2025

Expanding the genome information on Bacillales for biosynthetic gene cluster discovery

Article Open access 21 November 2024

Data availability

The sequencing data and assembled genome sequence generated in this study have been deposited in publicly accessible repositories. The details are as follows:

Raw sequencing reads

The datasets have been deposited in the NCBI Sequence Read Archive (SRA) under BioProject accession number PRJNA1072271 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1072271). The data are organized under BioSample SAMN39740430 and SRA experiment accession SRR3618163746. The repository contains the following files:

• JNDY-SC.fq.gz – Compressed FASTQ file containing Nanopore long reads.

• JNDY-SC_1.clean.fq.gz – Compressed FASTQ file containing paired-end Illumina short reads (read 1).

• JNDY-SC_2.clean.fq.gz – Compressed FASTQ file containing paired-end Illumina short reads (read 2).

Assembled Genome Sequence

The complete, annotated genome assembly of Sphingomonas sp. gentR has been deposited in the NCBI GenBank database under the same BioProject PRJNA1072271 and BioSample SAMN39740430. The assembly is available as a FASTA file (assembly.fna) and is organized into three records:

• CP144670: Chromosome sequence.

• CP144671: Plasmid A sequence.

• CP144672: Plasmid B sequence.

Genome Warehouse Access

The complete genome sequence is also available in the Genome Warehouse of the China National Genomics Data Center under accession number GWHDOHA00000000 (https://ngdc.cncb.ac.cn/gwh).

These datasets are freely accessible and can be used to explore the genomic basis of high-level gentamicin resistance and other functional traits in Sphingomonas sp. gentR.

Code availability

All software tools used in this study have been properly cited or accompanied by relevant website links. Unless otherwise specified in the manuscript, all data analyses were performed using default parameters as described in the respective software manuals. The running codes for genome assembly and annotation using Unicycler and prokka, respectively, have been uploaded as a supplementary information (SI1) to Figshare.

References

  1. Chen, H. et al. Reclassification and emended description of Caulobacter leidyi as Sphingomonas leidyi comb. nov., and emendation of the genus Sphingomonas. Int. J. Syst. Evol. Microbiol. 62, 2835–2843 (2012).

    Google Scholar 

  2. Yabuuchi, E. et al. Proposals of Sphingomonas paucimobilis gen. nov. and comb. nov., Sphingomonas parapaucimobilis sp. Nov., Sphingomonas yanoikuyae sp. nov., Sphingomonas adhaesiva sp. nov., Sphingomonas capsulata comb. nov., and two genospecies of the genus Sphingomonas. Microbiol. Immunol. 34, 99–119 (1990).

    Google Scholar 

  3. White, D. C., Sutton, S. D. & Ringelberg, D. B. The genus sphingomonas: physiology and ecology. Curr. Opin. Biotechnol. 7, 301–306 (1996).

    Google Scholar 

  4. Li, Y. et al. Sphingomonas yabuuchiae sp. nov. and Brevundimonas nasdae sp. nov., isolated from the Russian space laboratory Mir. Int. J. Syst. Evol. Microbiol. 54, 819–825 (2004).

    Google Scholar 

  5. Kampfer, P. et al. Flavobacterium plantiphilum sp. nov., Flavobacterium rhizophilum sp. nov., Flavobacterium rhizosphaerae sp. nov., Chryseobacterium terrae sp. nov., and Sphingomonas plantiphila sp. nov. isolated from salty soil showing plant growth promoting potential. Syst. Appl. Microbiol. 48, 126588 (2025).

    Google Scholar 

  6. Liu, H. et al. Fed-batch fermentation strategy for efficient welan gum production by Sphingomonas sp. FM01. J. Sci. Food Agric. 105, 926–936 (2025).

    Google Scholar 

  7. Sanchez-Arroyo, A., Plaza-Vinuesa, L., de Las Rivas, B., Mancheno, J. M. & Munoz, R. Analysis of the subtype I amidohydrolase responsible for Ochratoxin A degradation in the Sphingomonas genus. Int. J. Biol. Macromol. 306, 141720 (2025).

    Google Scholar 

  8. Asaf, S., Numan, M., Khan, A. L. & Al-Harrasi, A. Sphingomonas: from diversity and genomics to functional role in environmental remediation and plant growth. Crit. Rev. Biotechnol. 40, 138–152 (2020).

    Google Scholar 

  9. Jiang, L. et al. Identification of a high-level gentamicin-resistant Sphingomonas strain and its antimicrobial susceptibility test. Biol. Disaster Sci. 48, 181–190 (2025). (in Chinese).

    Google Scholar 

  10. Vanbroekhoven, K. et al. Streptomycin as a selective agent to facilitate recovery and isolation of introduced and indigenous Sphingomonas from environmental samples. Environ. Microbiol. 6, 1123–1136 (2004).

    Google Scholar 

  11. Park, H. K. et al. Sphingomonas aeria sp. Nov. from indoor air of a pharmaceutical environment. Antonie Van Leeuwenhoek 107, 47–53 (2015).

    Google Scholar 

  12. Wick, R. R., Judd, L. M., Gorrie, C. L. & Holt, K. E. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput. Biol. 13, e1005595 (2017).

    Google Scholar 

  13. Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9, e112963 (2014).

    Google Scholar 

  14. Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 9, 357–359 (2012).

    Google Scholar 

  15. Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).

    Google Scholar 

  16. Syberg-Olsen, M. J., Garber, A. I., Keeling, P. J., McCutcheon, J. P. & Husnik, F. Pseudofinder: detection of pseudogenes in prokaryotic genomes. Mol. Biol. Evol. 39, msac153 (2022).

    Google Scholar 

  17. Grissa, I., Vergnaud, G. & Pourcel, C. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic. Acids. Res. 35, W52–W57 (2007).

    Google Scholar 

  18. Bertelli, C. et al. IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets. Nucleic. Acids. Res. 45, W30–W35 (2017).

    Google Scholar 

  19. Arndt, D. et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic. Acids. Res. 44, W16–W21 (2016).

    Google Scholar 

  20. Blin, K. et al. AntiSMASH 8.0: extended gene cluster detection capabilities and analyses of chemistry, enzymology, and regulation. Nucleic. Acids. Res. 53, W32–W38 (2025).

    Google Scholar 

  21. The UniProt Consortium. UniProt: the universal protein knowledgebase in 2025. Nucleic. Acids. Res. 53, D609–D617 (2025).

    Google Scholar 

  22. Mistry, J. et al. Pfam: the protein families database in 2021. Nucleic. Acids. Res. 49, D412–D419 (2021).

    Google Scholar 

  23. Goldfarb, T. et al. NCBI RefSeq: reference sequence standards through 25 years of curation and annotation. Nucleic. Acids. Res. 53, D243–D257 (2025).

    Google Scholar 

  24. Haft, D. H. et al. TIGRFAMs and genome properties in 2013. Nucleic. Acids. Res. 41, D387–D395 (2012).

    Google Scholar 

  25. Aleksander, S. A. et al. The gene ontology knowledgebase in 2023. Genetics 224, iyad031 (2023).

    Google Scholar 

  26. Kanehisa, M., Furumichi, M., Sato, Y., Matsuura, Y. & Ishiguro-Watanabe, M. KEGG: biological systems database as a model of the real world. Nucleic. Acids. Res. 53, D672–D677 (2025).

    Google Scholar 

  27. Galperin, M. Y. et al. COG database update 2024. Nucleic. Acids. Res. 53, D356–D363 (2025).

    Google Scholar 

  28. Grant, J. R. et al. Proksee: in-depth characterization and visualization of bacterial genomes. Nucleic. Acids. Res. 51, W484–W492 (2023).

    Google Scholar 

  29. Chen, M. et al. Genome warehouse: a public repository housing genome-scale data. Genom. Proteomics Bioinformatics 19, 584–589 (2021).

    Google Scholar 

  30. Bai, X. et al. Database resources of the national genomics data center, China national center for bioinformation in 2024. Nucleic. Acids. Res. 52, D18–D32 (2024).

    Google Scholar 

  31. NGDC https://ngdc.cncb.ac.cn/bioproject/browse/PRJCA016435 (2024).

  32. NCBI GenBank http://identifiers.org/insdc.gca:GCA_036596345.1 (2024).

  33. Tatusova, T. et al. NCBI prokaryotic genome annotation pipeline. Nucleic. Acids. Res. 44, 6614–6624 (2016).

    Google Scholar 

  34. Li, W. et al. RefSeq: expanding the prokaryotic genome annotation pipeline reach with protein family model curation. Nucleic. Acids. Res. 49, D1020–D1028 (2021).

    Google Scholar 

  35. Liu, B. & Pop, M. ARDB–antibiotic resistance genes database. Nucleic. Acids. Res. 37, D443–D447 (2009).

    Google Scholar 

  36. Alcock, B. P. et al. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic. Acids. Res., gkz935 (2019).

  37. Alvarez, A. F. & Georgellis, D. Environmental adaptation and diversification of bacterial two-component systems. Curr. Opin. Microbiol. 76, 102399 (2023).

    Google Scholar 

  38. Wang, X. et al. SRNA molecules participate in hyperosmotic stress response regulation in sphingomonas melonis TY. Appl. Environ. Microbiol. 90, e0215823 (2024).

    Google Scholar 

  39. Yoon, S., Ha, S., Lim, J., Kwon, S. & Chun, J. A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie. Van. Leeuwenhoek. 110, 1281–1286 (2017).

    Google Scholar 

  40. Meier-Kolthoff, J. P., Auch, A. F., Klenk, H. & Göker, M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics 14 (2013).

  41. Madhaiyan, M., Saravanan, V. S., Wirth, J. S. & Whitman, W. B. Reclassification of Sphingomonas aeria as a later heterotypic synonym of Sphingomonas carotinifaciens based on whole-genome sequence analysis. Int. J. Syst. Evol. Microbiol. 70, 2355–2358 (2020).

    Google Scholar 

  42. Ali, A. et al. Biotransformation of benzoin by Sphingomonas sp. LK11 and ameliorative effects on growth of Cucumis sativus. Arch. Microbiol. 201, 591–601 (2019).

    Google Scholar 

  43. Chen, Y. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7, 1–6 (2018).

    Google Scholar 

  44. Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).

    Google Scholar 

  45. Li, H. & Durbin, R. Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics 25, 1754–1760 (2009).

    Google Scholar 

  46. NCBI Sequence Read Archive http://identifiers.org/dbest:SRP648589 (2025).

Download references

Acknowledgements

This work was financially supported by the National Natural Science Foundation of China (Grant No. 32360880, 31360553).

Author information

Authors and Affiliations

  1. Jiangxi Provincial Key Laboratory for Animal Health, College of Animal Science and Technology, Jiangxi Agricultural University, Nanchang, 330045, Jiangxi, China

    Yi Liu, Lijing Jiang, Jinhua Zhang, Qiufen Li & Baosheng Liu

  2. Institute of Veterinary Drug, Jiangxi Agricultural University, Nanchang, 330045, Jiangxi, China

    Qiufen Li & Baosheng Liu

Authors
  1. Yi Liu
    View author publications

    Search author on:PubMed Google Scholar

  2. Lijing Jiang
    View author publications

    Search author on:PubMed Google Scholar

  3. Jinhua Zhang
    View author publications

    Search author on:PubMed Google Scholar

  4. Qiufen Li
    View author publications

    Search author on:PubMed Google Scholar

  5. Baosheng Liu
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Jinhua Zhang and Baosheng Liu conceived the project. Yi Liu, Lijing Jiang and Qiufen Li collected the samples, performed the genome assembly, gene annotation and other bioinformatics analysis. Yi Liu and Lijing Jiang wrote the manuscript. Qiufen Li and Baosheng Liu revised the manuscript. Yi Liu and Lijing Jiang contributed equally to this work.

Corresponding authors

Correspondence to Qiufen Li or Baosheng Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Table S1 (download XLSX )

SI1 Codes running for assembly and annotation (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, Y., Jiang, L., Zhang, J. et al. Complete genome sequence of Sphingomonas sp. gentR, a high-level gentamicin-resistant bacterium. Sci Data (2026). https://doi.org/10.1038/s41597-026-06723-4

Download citation

  • Received: 22 September 2025

  • Accepted: 26 January 2026

  • Published: 13 March 2026

  • DOI: https://doi.org/10.1038/s41597-026-06723-4

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Editors & Editorial Board
  • Journal Metrics
  • Policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Contact

Publish with us

  • Submission Guidelines
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Data (Sci Data)

ISSN 2052-4463 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing