Abstract
Paracondylactis sinensis is a burrowing sea anemone inhabiting soft sediments along the Chinese coast, representing an ecologically and economically important actiniarian species. Despite its unique adaptations to hypoxia and sediment-associated stressors, genomic resources for burrowing sea anemones have been lacking. Here, we report a high-quality, chromosome-level genome assembly of P. sinensis. With PacBio HiFi long reads (39.77 × coverage), Illumina short reads, and Hi-C data, a 210.63 Mb genome with a contig N50 of 8.70 Mb and a scaffold N50 of 9.41 Mb was generated. A total of 93.44% of the assembly was anchored to 19 pseudo-chromosomes. BUSCO analysis indicated 95.91% completeness, confirming high assembly quality. Comprehensive annotation identified 19,420 protein-coding genes, of which 91.35% were functionally annotated. Repetitive elements accounted for 26.43% of the genome, with transposable elements representing 20.47%. This genome provides a crucial reference for understanding the genetic basis of environmental adaptation in P. sinensis and supports future efforts in its conservation, aquaculture, and bioactive compound exploration.
Similar content being viewed by others
Background &Summary
Although the majority of sea anemones are sessile, certain species have evolved a burrowing character by embedding themselves in soft sediments1,2. These burrowing sea anemones play crucial roles in benthic ecosystems. Their movements create microhabitats for small organisms and facilitate biogeochemical cycling and material exchange between water and sediment layers. Unlike other sessile species that are surrounded solely by seawater, burrowing sea anemones inhabit a heterogeneous microenvironment composed of water mixed with sand or mud. To survive under such conditions, they have evolved tolerance to stressors such as hypoxia, pathogens and strongly reducing conditions3,4. Despite their distinct adaptations and ecological significance, genomic resources of burrowing sea anemones are limited, hindering our understanding of their molecular adaptation mechanisms.
Paracondylactis sinensis is a typical burrowing sea anemone and also an economically important species in China. However, due to overfishing and other anthropogenic disturbances, its wild populations have declined5. Our previous SSR-based analysis revealed signs of inbreeding in both northern and southern populations, highlighting the urgent need for conservation and artificial breeding efforts6. In this study, we present a chromosome-level genome assembly of P. sinensis, which provides a valuable resource for investigating its adaptive strategies and supports future studies on population genetics and resource management.
Beyond their ecological roles and economic value, sea anemones are recognized as a natural source of diverse bioactive polypeptides7. These include antimicrobial peptides used to defend against external pathogens and venom-derived toxins employed for prey capture—both of which hold promise for pharmaceutical applications such as infection control and wound healing8,9. Compared to sessile species, burrowing sea anemones like P. sinensis are exposed to more complex environment with low oxygen contents and distinct microbial communities. These unique conditions may drive the evolution of lineage-specific active compounds with novel biological activities uncharacterized before. The chromosome-level genome generated in this study provides a foundational resource for identifying such compounds and advancing bioprospecting and gene-mining efforts in this ecologically and pharmacologically valuable species.
Methods
Sample collection and DNA extraction
Samples of the sea anemones Paracondylactis sinensis were dug by hand during ebb at the coast of Taizhou, China (121°39.53′E, 28°20.12′N). The sampling procedures complied with relevant regulations of the Institutional Animal Care and Use Committee of the Institute of Oceanology, Chinese Academy of Science. After collection, they were transported to the lab and cultured in an aerated tank with temperature at 20°C and salinity at 33‰ (Fig. 1). One of the P. sinensis with active and healthy status was selected and rinsed with sterile seawater for subsequent DNA extraction. The body column of the sample was dissected with sterile scissors and forceps and then flash frozen in liquid nitrogen. DNA for PacBio HiFi sequencing was extracted using the MagAttract® HMW DNA Kit (Qiagen, Germany) according to the manufacturer’s instructions. The quantity, integrity and purity of the DNA were measured using a Qubit 3.0 fluorometer, pulsed-field gel electrophoresis (5–80 kV, 17 h) and a NanoDrop® instrument respectively. The value of A260/280 of the extracted DNA was 1.82 and a dominant high-molecular-weight DNA band larger than 48 kb was observed (Fig. 2), indicating its suitability for long-read sequencing. The extracted DNA was sent to Novogene Bioinformatics Technology Co., Ltd. (TianJin, China) for library construction and sequencing.
Paracondylactis sinensis buried in the sandy substrate of a laboratory culture tank.
Pulsed-field gel electrophoresis (PFGE) assessment of the extracted high-molecular-weight genomic DNA. Lane M: DNA size marker (10–48 kb); lane 1: the extracted DNA.
Illumina sequencing and genome survey
A paired-end DNA library for genome survey analysis was constructed with an insert size of 350 bp using a Rapid Plus DNA Lib Prep Kit for Illumina (ABclonal, USA). The library was subsequently sequenced on an Illumina NovaSeq X Plus sequencer (2 × 150 bp paired-end reads) (Illumina, USA). A total of 21.92 Gb data was generated. Data cleaning for the Illumina short reads was performed by Trimmomatic 0.36.310 with the parameters as follows: “ILLUMINACLIP:TruSeq. 3-PE-2.fa:2:30:10 LEADING:15 TRAILING:15 SLIDINGWINDOW:4:15 MINLEN:40”, which was followed by FastUniq 1.1 to remove duplicates11. Finally, 18.48 Gb clean data was obtained (Table 1). GenomeScope 2.0 and Jellyfish 2.2.312 were used for genome survey. The k-mer distribution with 17 nt was constructed using clean Illumina short-read data (Fig. 3). According to the formula genome size = knum/kdepth, where knum is the number of k-mers and kdepth is the expected depth of k-mers, the genome size was estimated to be 209.71 Mb, with a heterozygosity rate of 0.72% and a repeat rate of 31.47%.
17-mer frequency distribution in the Paracondylactis sinensis genome.
PacBio HiFi sequencing and Hi-C sequencing
For HiFi-read sequencing, a 15 kb SMRTbell library was constructed with a molecular weight of genomic DNA larger than 15 kb, following the instructions of the SMRTbell Express Template Prep Kit 3.0 (PacBio, USA). Fragment size distribution of the library detected by Agilent 5200 (Agilent, USA) revealed a dominant insertion peak at approximately 16 kb, with the majority of the fragments distributed between 10 and 40 kb (Fig. 4). The library was subsequently sequenced on a PacBio Revio system with a Revio SPRQ polymerase kit (PacBio, USA). A total of 8.34 Gb high-quality HiFi reads (39.77× genome coverage) were finally produced (Table 1). The length distribution of the HiFi reads showed a major peak at approximately 11 kb (Fig. 5), with a mean read length of 11,344 bp and an N50 at the length of 13,891 bp.
Quality check of the SMRTbell library by Agilent 5200.
Length distribution diagram of HiFi-reads.
For a haploid assembly, the sample used for Hi-C scaffolding was the same individual as one used for PacBio HiFi sequencing and Illumina sequencing. Dissected body column tissue was cross-linked (4% formaldehyde, 5 min, RT), quenched (0.20 M glycine, 1 min, RT), chilled ( > 15 min, ice), lysed (10 mM NaCl, 0.20% IGEPAL CA-630, 10 mM Tris–HCl, 1 × protease inhibitors), digested with DpnII, biotin-labeled, and end-repaired13. Library with insert size 350 bp was prepared with the Rapid Plus DNA Library Prep Kit for Illumina (ABclonal, USA) and sequenced on an Illumina NovaSeq X Plus sequencer (2 × 150 bp paired-end reads) (Illumina, USA). After quality control performed by Trimmomatic 0.36.310 and FastUniq 1.111, a total of 15.92 Gb Hi-C reads with 88.12× genome coverage was generated (Table 1).
Chromosome-level genome assembly
The genome was preliminarily assembled using hifiasm 0.20.014 with the generated PacBio HiFi reads under the parameters “-l 3 -s 0.5”. Sequencing contamination was screened by FCS-GX 0.5.515, and no sequences were detected to originate from foreign organisms. The contig-level genome assembly was 210.63 Mb in length, closely matching the estimated genome size (209.71 Mb) by genome survey analysis. The contig number of this initial assembly was 181, with contig N50 as long as 8.70 Mb (Table 2). The GC content of the assembly was 39.02%, which was close to that of the model sea anemone species Nematostella vectensis (40.72%; GCA_93252622516), suggesting a comparable base composition to published actiniarian genome.
Hi-C reads were used to scaffold the draft assembly to the chromosome level. Paired-end Hi-C reads were aligned to the contig-level genome using Chromap 0.2.317 with the “--preset hic” and “--remove-pcr-duplicates” options enabled. The resulting SAM alignments were sorted and converted to BAM format using SAMtools 1.1618. Contig clustering and scaffolding were performed with YaHS 1.2.219 under default parameters. The scaffolded assembly and Hi-C contact maps were further processed using the pre and post modules of Juicer Tools 1.22.0120 to generate.hic files and finalize chromosome-scale scaffolds through manual review. Finally a total of 196.80 Mb (93.44%) of the genome was anchored to 19 pseudo-chromosomes (Fig. 6, Table 3), with the sizes ranging from 5.40 Mb to 25.01 Mb. The scaffold N50 of the chromosome-level genome was 9.41 Mb (Table 2). The total length of the unanchored scaffolds was 13.83 Mb, comprising 147 sequences, of which 86 scaffolds (58.50%) were distributed between 30–60 kb (Fig. 7). These unplaced scaffolds lacked sufficient Hi-C linkage evidence for confident anchoring with contact map largely blank (Fig. 8), which may be attributable to repeat-associated multi-mapping and/or conflicting placements across pseudochromosomes that precluded reliable assignment to a unique chromosomal locus.
Circos plot of 19 pseudochromosomal linkage groups of Paracondylactis sinensis genome showing the marker distribution in 1 Mb sliding windows (from the outer to inner circle: GC content, gene density and repeat density; lines in the middle depict intrachromosomal synteny).
Length distribution of the unanchored scaffolds.
Unanchored scaffolds showed in the Hi-C contact map.
Annotation of the repetitive sequences
Repetitive sequences were annotated using a combination of homology-based and de novo approaches. A species-specific repeat library was built from the assembled genome using RepeatModeler 2.021 with the -LTRStruct option enabled. Homology-based annotation was performed with RepeatMasker 4.122 against the Actiniaria repeat database using the parameters of -e rmblast, -nolow, -s, -a, and -gff. De novo repeat annotation was then conducted by masking the genome with the species-specific library generated by RepeatModeler 2.0. Results from both homology-based and de novo runs were combined using the combineRMFiles.pl utility supplied with RepeatMasker 4.1 to produce a comprehensive repeat annotation. Finally, 26.43% of the genome at the length of 55.68 Mb was annotated as repeats. Among the repetitive sequences, 43.11 Mb were transposable elements (TEs), accounting for 20.47% of the genome (Table 4). The proportion of repeats in the genome of P. sinensis was similar to that of the model actiniarian species Exaiptasia diaphana (24.24%; GCA_02432226523) predicted in our previous study24.
Gene prediction and functional annotation
Gene structure prediction was performed by integrating evidence from transcriptome data, protein homology, and ab initio prediction. RNA-Seq data from different tissues of P. sinensis were downloaded (NCBI SRA accessions: SRR28341359, SRR28327126, SRR28327127, and SRR28327128)6 and aligned to the soft-masked genome using HISAT2 2.2.125 with default parameters. The alignments were sorted and indexed using SAMtools 1.1618. Gene prediction was then conducted with BRAKER3 3.0.326, which integrates GeneMark-ETP 1.00, AUGUSTUS 3.4.0, and extrinsic evidence. RNA-Seq alignments were incorporated via the --bam option, while protein homology information was provided using the --prot_seq option. Protein sequences from seven closely related sea anemone species—Actinostola sp.27, Alvinactis sp.28, Exaiptasia diaphana23, Nematostella vectensis29, Scolanthus callimorphus29, Actinia tenebrosa30, and Actinoscyphia liui24—were downloaded from NCBI and Figshare databases, which were used to guide evidence-based gene structure prediction. To further improve the structural annotation by incorporating untranslated regions (UTRs), PASA 2.5.331 was employed for transcript-based refinement. Transcripts assembled by trinity 2.15.132 were aligned to the genome using minimap2 2.3033, and integrated into the annotation framework in two rounds. The first round generated transcript alignments and preliminary structural models, while the second round updated the BRAKER3 gene models by adding UTR annotations and refining exon-intron boundaries based on transcript evidence. The representative transcript for each gene was then selected using AGAT 1.4.134. This integrative workflow produced a total of 19,420 genes (Table 5), with a gene length of 123,281,089 bp, exon length of 43,511,086 bp, CDS length of 30,009,737 bp, and intron length of 79,770,003 bp. After blasted in public databases (NR, SwissProt, eggNOG, KEGG, Pfam and GO), a total of 17,740 genes (91.35%) were functionally annotated (Table 6). The total number of predicted genes in P. sinensis (19,420) is comparable to that reported for other sea anemones, such as Nematostella vectensis with 19,231 predicted genes under accession number GCA_93252622516) and Anthopleura sola (GCA_02334942535,36) with 19,899 genes estimated in our previous study29, although this information was not reported in the original genome publication33.
Data Records
The sequencing data generated in this study have been deposited in the National Center for Biotechnology Information (NCBI) under BioProject number PRJNA1334318. Specifically, the Illumina, PacBio HiFi, and Hi-C datasets are available in the NCBI Sequence Read Archive under accession numbers SRR35627867, SRR35627868, and SRR3562786637, respectively. The chromosome-level assembly of Paracondylactis sinensis genome was deposited in the NCBI genome database under accession number GCA_05449177538 and the corresponding BioSample ID is SAMN51854516. The genome annotation results are accessible from the Figshare database (https://doi.org/10.6084/m9.figshare.30209509)39.
Technical Validation
The completeness of the chromosome-level genome was evaluated by benchmarking universal single-copy orthologue (BUSCO 5.4.7) analysis40, Core Eukaryotic Genes Mapping Approach (CEGMA 2.5)41 and a k-mer based approach implemented in Merqury 1.342. BUSCO analysis with the metazoan (odb10) gene set revealed that the genome of P. sinensis and its predicted genes covered 95.91% and 97.69% of the “core” metazoan genes, respectively (Table 2). CEGMA analysis indicated the genome assembly covered 96.37% of highly conserved proteins in eukaryotic genomes. K-mer based evaluation was performed using a 21-mer database constructed from Illumina short reads. The result showed a high consensus accuracy (QV = 47.45, i.e., an error rate of 1.80 × 10−5), but a relatively lower completeness of 87.82% for the assembly. The spectra-cn plot indicated that most missing k-mers (the “read-only” part) were concentrated around the heterozygous peak (Fig. 9a), potentially representing heterozygous regions in the haplotype assembly. To confirm this, the completeness of a diploid representation which included both haplotype-resolved assemblies generated by hifiasm 0.20.014 was evaluated. The k-mer completeness of the diploid increased to 99.31% and the “read-only” k-mers were not aggregated around primary or heterozygous peaks (Fig. 9b). This result indicated that the lower completeness of the primary assembly mainly reflects unrepresented allele-specific k-mers rather than widespread assembly errors.
Merqury copy number spectrum (spectra-cn) plots for haplotype (A) and diploid (B) assemblies of a Paracondylactis sinensis genome. Colors represent the different copy numbers of a certain k-mer found in the assembly. “read-only” means the k-mer found only in the Illumina reads.
The trimmed Illumina short reads and PacBio long reads were mapped against the assembled genome using BWA 0.7.1943. The mapping rates were 99.70% and 99.93%, respectively (Table 2), also indicating high accuracy of the assembly.
Data availability
The raw sequencing data are available in the NCBI Sequence Read Archive (SRA) database (https://identifiers.org/ncbi/insdc.sra:SRP627585), the genome assembly is available in NCBI GenBank (https://identifiers.org/ncbi/insdc.gca:GCA_054491775.1), and the genome and annotation files are available via Figshare (https://doi.org/10.6084/m9.figshare.30209509.v1).
Code availability
Genome survey:
(1) Trimmomatic: parameter: ILLUMINACLIP:TruSeq. 3-PE-2.fa:2:30:10 LEADING:15 TRAILING:15 SLIDINGWINDOW:4:15 MINLEN:40.
(2) FastUniq: parameter: -t q -c 1.
(3) Jellyfish:
count: parameter: -G 2 -m 17 -C.
histo: all parameters were set as default.
stats: all parameters were set as default.
(4) GenomeScope: parameter: -p 2 -k 17 -m 10000.
Contamination screen:
(1) FCS-GX: parameter: --tax-id 1 --div None --split-fasta true --action-report true.
Genome assembly:
(1) Hifiasm: parameter: -l 3.
(2) Chromap: parameter: --preset hic --remove-pcr-duplicates.
(3) Yahs: all parameters were set as default.
(4) Juicer: all parameters were set as default.
(5) Samtools: all parameters were set as default.
Genome annotation:
(1) RepeatModeler: parameter: BuildDatabase -name, -LTRStruct.
(2) RepeatMasker: parameter: -e rmblast -gff -s -a -nolow.
(3) Hisat2: all parameters were set as default.
(4) Trinity: parameter: --genome_guided_bam --genome_guided_max_intron 20000.
(5) BRAKER3: all parameters were set as default.
(6) PASA: parameter:
Launch_PASA_pipeline.pl -C -R --ALIGNERS minimap2.
Launch_PASA_pipeline.pl –A.
(7) AGAT: all parameters were set as default.
Completeness evaluation
(1) BUSCO (for genome assembly): parameter: -m genome -l metazoa_odb10.
(2) BUSCO (for translated protein sequence): parameter: -m prot -l metazoa_odb10.
(3) CEGMA: all parameters were set as default.
(4) Merqury: parameter: meryl count k = 21.
Read mapping:
(1) BWA:
short reads: all parameters were set as default.
PacBio HiFi reads: parameter: -x pacbio.
(2) Picard: all parameters were set as default.
(3) Samtools: all parameters were set as default.
References
Mangum, D. C. Burrowing behavior of the sea anemone Phyllactis. Biol. Bull. 138, 316–325 (1970).
Ivanova, N. Y. Classification and evolution of the burrowing sea anemones (Anthozoa: Actiniaria: Athenaria): a review of the past and current views. Zoosyst. Rossica 29, 213–237 (2020).
Wang, Y. et al. Comparison of the levels of bacterial diversity in freshwater, intertidal wetland, and marine sediments by using millions of illumina tags. Appl. Environ. Microbiol. 78, 8264–8271 (2012).
Sassaman, C. & Mangum, C. P. Adaptations to environmental oxygen levels in infaunal and epifaunal sea anemones. Biol. Bull. 143, 657–678 (1972).
Li, J. et al. The complete mitochondrial genome of an economic sea anemone (Paracondylactis sinensis) in the East China Sea. Mitochondrial DNA Part B. Resources 8, 977–980 (2023).
Li, J. et al. Development of microsatellite markers and evaluation of the genetic diversity of the edible sea anemone Paracondylactis sinensis (Cnidaria, Anthozoa) in China. Biodivers. Data J. 12, e134363 (2024).
Ramírez-Carreto, S., Miranda-Zaragoza, B., Simões, N., González-Muñoz, R. & Rodríguez-Almazán, C. Marine Bioprospecting: enzymes and stress proteins from the sea anemones Anthopleura dowii and Lebrunia neglecta. Mar. Drugs 22, 12 (2024).
Mizuno, M., Ito, Y. & Morgan, B. P. Exploiting the nephrotoxic effects of venom from the sea anemone, Phyllodiscus semoni, to create a hemolytic uremic syndrome model in the rat. Mar. Drugs 10, 1582–1604 (2012).
Grafskaia, E. N. et al. Discovery of novel antimicrobial peptides: A transcriptomic study of the sea anemone Cnidopus japonicus. J. Bioinf. Comput. Biol. 16, 1840006 (2018).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Xu, H. et al. FastUniq: A fast de novo duplicates removal tool for paired short reads. PLOS ONE 7, e52249 (2012).
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics, 14 (2017).
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170–175 (2021).
Astashyn, A. et al. Rapid and sensitive detection of genome contamination at scale with FCS-GX. Genome Biology 25, 60 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_932526225 (2022).
Zhang, H., Song, L. & Wang, X. Fast alignment and preprocessing of chromatin profiles with Chromap. Nat. Commun. 12, 6566 (2021).
Li, H. et al. The sequence alignment/map format and sAMtools. Bioinformatics 25, 2078–2079 (2009).
Zhou, C., McCarthy, S. A. & Durbin, R. YaHS: yet another Hi-C scaffolding tool. Bioinformatics 39, btac808 (2023).
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA. 117, 9451–9457 (2020).
Chen, N. Using Repeatmasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinf. 5, 4.10.11–14.10.14 (2004).
Shum, C. W. et al. Genome of the sea anemone Exaiptasia pallida and transcriptome profiles during tentacle regeneration. Front. Cell Dev. Biol. 10, 900321 (2022).
Li, J. et al. Chromosome-level genome assembly of a deep-sea Venus flytrap sea anemone sheds light upon adaptations to an extremely oligotrophic environment. Mol. Ecol. 33, e17504 (2024).
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
Gabriel, L. et al. BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA. Genome Res. 34, 769–777 (2024).
Liu, C. et al. Whole genome sequencing of a novel sea anemone (Actinostola sp.) from a deep-sea hydrothermal vent. Sci. Data 11, 102 (2024).
Zhou, Y. et al. Genetic adaptations of sea anemone to hydrothermal environment. Sci. Adv. 9, eadh0474 (2023).
Putnam, N. H. et al. Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization. Science 317, 86–94 (2007).
Surm, J. M., Stewart, Z. K. & Papanicolaou, A. The draft genome of Actinia tenebrosa reveals insights into toxin evolution. Ecol. Evol. 9, 11314–11328 (2019).
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic. Acids Res. 31, 5654–5666 (2003).
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Dainat, J. et al. AGAT: Another GFF analysis toolkit to handle annotations in any GTF, GFF format. Zenodo (2020).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_023349425 (2022).
Li, J. Y. Annotation files of five sea anemones (Actinoscyphia liui, Diadumene lineata, Anthopleura sola, Telmatactis stephensoni and Actinia tenebrosa). figshare https://doi.org/10.6084/m9.figshare.25966645.v4 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP627585 (2026).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_054491775.1 (2026).
Li, J. Y. The genome and annotation file of sea anemone Paracondylactis sinensis. figshare https://doi.org/10.6084/m9.figshare.30209509.v1 (2026).
Simão, F., Waterhouse, R. M., Panagiotis, I., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics, 3210–3212 (2015).
Parra, G., Bradnam, K. & Korf, I. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23, 1061–1067 (2007).
Rhie, A. et al. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 21, 245 (2020).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Acknowledgements
This work was supported by Shandong Provincial Natural Science Foundation (No. ZR2025MS656), the Taizhou Science and Technology Plan Project (No. 25nyb03), and National Natural Science Foundation of China (No. 32570619). Thanks are given to the Oceanographic Data Center, IOCAS, for providing computing power.
Author information
Authors and Affiliations
Contributions
Junyuan Li: Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Visualization and Writing. Rongye Tang: Conceptualization, Investigation, Project administration, Resources, Supervision, and Validation. Juan Feng: Investigation, Resources and Validation. Tinghui Xie: Resources and Validation. Sitong Liu: Software. Yang Li: Conceptualization, Validation,Funding acquisition, Project administration, Resources, Supervision,Writing -review& editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Li, J., Tang, R., Feng, J. et al. Chromosome-scale genome of the burrowing sea anemone Paracondylactis sinensis. Sci Data 13, 457 (2026). https://doi.org/10.1038/s41597-026-06838-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-026-06838-8










