Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Data
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific data
  3. data descriptors
  4. article
The chromosomal-level genome assembly and annotation of Phyllospadix iwatensis (Surfgrass)
Download PDF
Download PDF
  • Data Descriptor
  • Open access
  • Published: 12 March 2026

The chromosomal-level genome assembly and annotation of Phyllospadix iwatensis (Surfgrass)

  • Junyi Wang1,
  • Dawei Wang2,
  • Ke Zhao2,
  • Zhining Liu2 na1 &
  • …
  • Quansheng Zhang1 na1 

Scientific Data , Article number:  (2026) Cite this article

  • 769 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Plant evolution
  • Plant molecular biology

Abstract

Phyllospadix iwatensis is a unique seagrass species adapted to rocky substrate anchorage and dioecy and belongs to marine submerged flowering plants with a distinctive evolutionary history. The chromosomal-scale genome was constructed by integrating Illumina, PacBio HiFi, and high-throughput chromosome conformation capture (Hi-C) sequencing techniques. A total of 340.56 Mb of sequences were anchored to 10 chromosomes with an anchoring rate of 96.44%. The contig and scaffold N50 values reached 30.64 Mb and 33.59 Mb, respectively. Precisely 94.64% of the 23,198 predicted protein-coding genes received functional annotation. In the meantime, 180.19 Mb of repetitive sequences were found, representing 52.91% of the assembled genome. The chromosomal-level genome data of P. iwatensis will reveal its special process of differentiation and enrich the understanding of the multiple adaptations of seagrass populations to marine habitats.

Similar content being viewed by others

Chromosome-level genome assembly for the ecologically and economically important alga Saccharina japonica

Article Open access 18 February 2025

Chromosome-level genome assembly and annotation of the Patagonian toothfish Dissostichus eleginoides

Article Open access 16 November 2024

Chromosome-scale genome assembly and annotation of Cotoneaster glaucophyllus

Article Open access 22 April 2024

Data availability

The complete dataset of P. iwatensis, including raw sequencing data (Illumina, PacBio, Hi-C, and RNA sequencing reads) and the assembled genome, is publicly available via the following repositories:

NCBI SRA:

https://identifiers.org/ncbi/insdc.sra:SRR34629676

https://identifiers.org/ncbi/insdc.sra:SRR34629675

https://identifiers.org/ncbi/insdc.sra:SRR34629674

https://identifiers.org/ncbi/insdc.sra:SRR34629673

NCBI GenBank: https://identifiers.org/ncbi/insdc:JBTXFO000000000.1

Figshare: https://doi.org/10.6084/m9.figshare.29652089

Code availability

All bioinformatics analyses in this study were performed in strict accordance with the guidelines of the respective tools. No custom scripts were developed; all operations adhered to the standard protocols of the employed software. These tools are publicly accessible, with detailed information on their versions and parameter settings provided in the Methods section.

References

  1. Short, F. T. et al. Extinction risk assessment of the world’s seagrass species. Biol. Conserv. 144, 1961–1971 (2011).

    Google Scholar 

  2. Unsworth, R. K. F., Cullen-Unsworth, L. C., Jones, B. L. H. & Lilley, R. J. The planetary role of seagrass conservation. Science. 377, 609–613 (2022).

    Google Scholar 

  3. McKenzie, L. J. et al. The global distribution of seagrass meadows. Environ. Res. Lett. 15, 74041 (2020).

    Google Scholar 

  4. Duffy, J. E. et al. Toward a Coordinated Global Observing System for Seagrasses and Marine Macroalgae. Front. Mar. Sci. 6, 317 (2019).

    Google Scholar 

  5. Gallagher, A. J. et al. Tiger sharks support the characterization of the world’s largest seagrass ecosystem. Nat. Commun. 13, 6328 (2022).

    Google Scholar 

  6. Olsen, J. L. et al. The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea. Nature 530, 331–335 (2016).

    Google Scholar 

  7. Ma, X. et al. Seagrass genomes reveal ancient polyploidy and adaptations to the marine environment. Nat. Plants 10, 240–255 (2024).

    Google Scholar 

  8. Ma, X. et al. Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass). F1000Research 10, 289 (2021).

    Google Scholar 

  9. Lee, H. et al. Genomic comparison of two independent seagrass lineages reveals habitat-driven convergent evolution. J. Exp. Bot. 69, 3689–3702 (2018).

    Google Scholar 

  10. Lee, H. et al. The genome of a Southern Hemisphere seagrass species (Zostera muelleri). Plant Physiol. 172, 272–283 (2016).

    Google Scholar 

  11. Van De Peer, Y., Mizrachi, E. & Marchal, K. The evolutionary significance of polyploidy. Nat. Rev. Genet. 18, 411–424 (2017).

    Google Scholar 

  12. Böse, M., Lüthgens, C., Lee, J. R. & Rose, J. Quaternary glaciations of northern Europe. Quat. Sci. Rev. 44, 1–25 (2012).

    Google Scholar 

  13. Sullivan, B. K. & Short, F. T. Taxonomic revisions in Zosteraceae (Zostera, Nanozostera, Heterozostera and Phyllospadix). Aquat. Bot. 187, 103636 (2023).

    Google Scholar 

  14. Cao, Z., Zhang, W. & Zhao, H. Morphology and anatomy of Phyllospadix iwatensis Makino and their adaptation to marine environment. Oceanol. Limnol. Sin. 46, 1326–1332 (2015).

    Google Scholar 

  15. Han, T. et al. The seed morphology and internal characteristics of seagrass, surfgrass Phyllospadix iwatensis. J. Trop. Oceanogr. 41, 105–113 (2022).

    Google Scholar 

  16. Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).

    Google Scholar 

  17. Marçais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–770 (2011).

    Google Scholar 

  18. Ranallo-Benavidez, T. R., Jaron, K. S. & Schatz, M. C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun. 11, 1432 (2020).

    Google Scholar 

  19. The Tomato Genome Consortium. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635–641 (2012).

    Google Scholar 

  20. Su, X. et al. A high-continuity and annotated tomato reference genome. BMC Genomics 22, 898 (2021).

    Google Scholar 

  21. Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).

    Google Scholar 

  22. Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).

    Google Scholar 

  23. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).

    Google Scholar 

  24. Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. 117, 9451–9457 (2020).

    Google Scholar 

  25. Bao, Z. & Eddy, S. R. Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res. 12, 1269–1276 (2002).

    Google Scholar 

  26. Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21, i351–i358 (2005).

    Google Scholar 

  27. Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).

    Google Scholar 

  28. Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics 9, 18 (2008).

    Google Scholar 

  29. Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).

    Google Scholar 

  30. Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinforma. 4, 4.10.1–4.10.14 (2009).

    Google Scholar 

  31. Beier, S., Thiel, T., Münch, T., Scholz, U. & Mascher, M. MISA-web: a web server for microsatellite prediction. Bioinformatics 33, 2583–2585 (2017).

    Google Scholar 

  32. Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).

    Google Scholar 

  33. Stanke, M., Diekhans, M., Baertsch, R. & Haussler, D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 24, 637–644 (2008).

    Google Scholar 

  34. Korf, I. Gene finding in novel genomes. BMC Bioinformatics 5, 59 (2004).

    Google Scholar 

  35. Keilwagen, J. et al. Using intron position conservation for homology-based gene prediction. Nucleic Acids Res. 44, e89–e89 (2016).

    Google Scholar 

  36. Hou, X., Wang, D., Cheng, Z., Wang, Y. & Jiao, Y. A near-complete assembly of an Arabidopsis thaliana genome. Mol. Plant 15, 1247–1250 (2022).

    Google Scholar 

  37. An, D. et al. Plant evolution and environmental adaptation unveiled by long-read whole-genome sequencing of Spirodela. Proc. Natl. Acad. Sci. 116, 18893–18899 (2019).

    Google Scholar 

  38. Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).

    Google Scholar 

  39. Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).

    Google Scholar 

  40. Tang, S., Lomsadze, A. & Borodovsky, M. Identification of protein-coding regions in RNA transcripts. Nucleic Acids Res. 43, e78–e78 (2015).

    Google Scholar 

  41. Nip, K. M. et al. RNA-Bloom enables reference-free and reference-guided sequence assembly for single-cell transcriptomes. Genome Res. 30, 1191–1200 (2020).

    Google Scholar 

  42. Haas, B. J. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).

    Google Scholar 

  43. Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, R7 (2008).

    Google Scholar 

  44. Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44, D457–D462 (2016).

    Google Scholar 

  45. Finn, R. D. Pfam: clans, web tools and services. Nucleic Acids Res. 34, D247–D251 (2006).

    Google Scholar 

  46. Boeckmann, B. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370 (2003).

    Google Scholar 

  47. Huerta-Cepas, J. et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 47, D309–D314 (2019).

    Google Scholar 

  48. NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR34629676 (2026).

  49. NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR34629675 (2026).

  50. NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR34629674 (2026).

  51. NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR34629673 (2026).

  52. NCBI GenBank https://identifiers.org/ncbi/insdc:JBTXFO000000000.1 (2026).

  53. Wang, J. The chromosomal-level genome assembly and annotation ofPhyllospadix iwatensis(Surfgrass), Figshare, https://doi.org/10.6084/m9.figshare.29652089 (2026).

  54. Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).

    Google Scholar 

  55. Parra, G., Bradnam, K. & Korf, I. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23, 1061–1067 (2007).

    Google Scholar 

  56. Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).

    Google Scholar 

Download references

Acknowledgements

This reaserch was supported by the National Natural Science Foundation of China (NO. 42476112) and the Shandong Provincial Bureau of Geology and Mineral Resources project (NO. HJ202510).

Author information

Author notes
  1. These authors jointly supervised this work: Zhining Liu, Quansheng Zhang.

Authors and Affiliations

  1. Ocean School, Yantai University, Yantai, 264005, China

    Junyi Wang & Quansheng Zhang

  2. No. 6 Geological Team, Shandong Provincial Bureau of Geology and Mineral Resources, Weihai, 264209, China

    Dawei Wang, Ke Zhao & Zhining Liu

Authors
  1. Junyi Wang
    View author publications

    Search author on:PubMed Google Scholar

  2. Dawei Wang
    View author publications

    Search author on:PubMed Google Scholar

  3. Ke Zhao
    View author publications

    Search author on:PubMed Google Scholar

  4. Zhining Liu
    View author publications

    Search author on:PubMed Google Scholar

  5. Quansheng Zhang
    View author publications

    Search author on:PubMed Google Scholar

Contributions

L.Z.N. and Z.Q.S. conceived and designed the study, secured funding, and participated in manuscript writing, review, and editing. W.D.W. and Z.K. conducted the experiments and analyzed the data. W.J.Y. analyzed the data and drafted the initial manuscript. All authors reviewed and approved the final version of the manuscript.

Corresponding authors

Correspondence to Zhining Liu or Quansheng Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, J., Wang, D., Zhao, K. et al. The chromosomal-level genome assembly and annotation of Phyllospadix iwatensis (Surfgrass). Sci Data (2026). https://doi.org/10.1038/s41597-026-06911-2

Download citation

  • Received: 28 July 2025

  • Accepted: 17 February 2026

  • Published: 12 March 2026

  • DOI: https://doi.org/10.1038/s41597-026-06911-2

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Editors & Editorial Board
  • Journal Metrics
  • Policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Contact

Publish with us

  • Submission Guidelines
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Data (Sci Data)

ISSN 2052-4463 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing