The high-quality genome sequence of Agroathelia rolfsii (Syn. Sclerotium rolfsii), the causal agent of southern blight in Coptis chinensis

You, Jingmao; You, Lei; Tang, Tao; Wang, Fanfan; Huang, Houyun; Duan, Yuanyuan; Zheng, Lanlan; Wang, Xiaoyue; Zhou, Wuxian; Zhang, Yonghong

doi:10.1038/s41597-025-06137-8

Download PDF

Data Descriptor
Open access
Published: 19 November 2025

The high-quality genome sequence of Agroathelia rolfsii (Syn. Sclerotium rolfsii), the causal agent of southern blight in Coptis chinensis

Jingmao You^1,2^na1,
Lei You⁴^na1,
Tao Tang¹,
Fanfan Wang¹,
Houyun Huang³,
Yuanyuan Duan¹,
Lanlan Zheng⁴,
Xiaoyue Wang¹,
Wuxian Zhou¹ &
…
Yonghong Zhang⁴

Scientific Data volume 12, Article number: 1833 (2025) Cite this article

2126 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Agroathelia rolfsii (A. rolfsii) is a globally distributed soilborne fungal pathogen that infects a wide range of crops, causing basal stem rot and yield losses. In this study, we report a high-quality genome assembly of A. rolfsii strain LC1 (43.3 Mb, N50 = 3.3 Mb, GC content = 46.3%) using integrated PacBio long-read and Illumina short-read sequencing technologies. Genome annotation identified 8,826 protein-coding genes, with 30.93% comprising repetitive sequences. Benchmarking Universal Single-Copy Ortholog (BUSCO) analysis demonstrated 97.7% genome completeness, supported by a high QV value of 44, with functional annotations covering 91.1% of the genes. A total of 1,260 carbohydrate-active enzyme (CAZyme) genes, predominantly glycoside hydrolases (37.7%), and 22 secondary metabolite biosynthetic gene clusters were identified, indicating strong host tissue degradation capability and diverse virulence factors. Phylogenetic analysis supported genetic diversity within the Agroathelia genus. This study provides key genomic resources for understanding the pathogenic mechanisms, host interactions, and resistance gene discovery of A. rolfsii, laying the foundation for the development of virulence-targeted control strategies.

A haplotype-resolved genome assembly of Anoectochilus roxburghii

Article Open access 03 December 2025

Hybrid de novo genome-reassembly reveals new insights on pathways and pathogenicity determinants in rice blast pathogen Magnaporthe oryzae RMg_Dl

Article Open access 25 November 2021

Modeling global habitat suitability of Agroathelia rolfsii causing Sclerotium wilt disease of tomato with emphasis on Ghana

Article Open access 30 September 2025

Background & Summary

The fungal pathogen Agroathelia rolfsii poses a significant threat to global agriculture, particularly in tropical and subtropical regions¹. It affects over 500 plant species, including economically important crops such as peanut, tomato, and cotton^2,3,4. Its asexual stage, Sclerotium rolfsii, is characterized by the production of highly resilient sclerotia, which can survive under adverse environmental conditions⁵. Once infected, plants exhibit basal stem and root rot, leading to wilting and, in severe cases, total crop failure. Yield losses can reach to 30% ~50%, making A. rolfsii a critical target for plant disease management research^6,7.

While chemical fungicides, such as tebuconazole and flutolanil, have been used to control A. rolfsii, their prolonged application has raised concerns over environmental pollution and the development of fungicide resistance^8,9. Biological control strategies, including the use of antagonistic microorganisms like Trichoderma spp., have shown potential but may lack consistency and universal applicability in field conditions¹⁰. Furthermore, the pathogen’s broad host range and strong ecological adaptability complicate efforts to develop effective and sustainable control measures.

In recent years, the rapid advancement of high-throughput sequencing technologies has provided powerful tools for studying fungal pathogens. Genomic studies have revealed virulence-related gene families in many plant-pathogenic fungi, including glycoside hydrolases, peptidoglycanases, and secondary metabolite biosynthetic gene clusters, which play key roles in host infection and tissue colonization^11,12. Comparative genomic analyses have also provided insights into the genetic basis of pathogenicity, host adaptation, and environmental resilience, helping to identify key genes involved in fungal survival and infection strategies¹³. The emergence of genome-editing technologies, such as CRISPR-Cas9, has opened new possibilities for enhancing plant resistance by modifying key susceptibility genes^14,15. Additionally, genomic information has facilitated the identification of novel targets for disease control, including genes essential for fungal virulence and survival¹⁶.

Studies on closely related pathogens, such as Sclerotium delphinii, which causes southern blight in Dendrobium officinale, provide additional context¹⁷. Genomic analyses of S. delphinii have revealed a significantly higher abundance of glycoside hydrolases (GHs) compared to A. rolfsii, which may explain its enhanced virulence. Furthermore, the identification of secondary metabolite gene clusters and pathogen-host interaction (PHI) factors in S. delphinii highlights the diverse strategies employed by fungal pathogens during infection. These findings emphasize the importance of comparative studies in understanding the molecular basis of fungal pathogenicity.

Here, we assembled a high-quality genome of A. rolfsii using an integrated approach that combined PacBio Single Molecule Real-Time (SMRT) long-read sequencing and Illumina short-read sequencing technologies. The quality of the genome assembly and annotation was evaluated based on the contiguity of assembled sequences (N50) and the completeness of conserved protein-coding genes. Protein functional annotations were performed by aligning the sequences against multiple databases, including Nr, SwissProt, KEGG, and KOG, ensuring comprehensive coverage. Furthermore, evolutionary insights were gained through comparative analyses with genomes of other Agroathelia species. This study provides a valuable genomic resource for advancing research on the pathogenicity, genetic diversity, and evolutionary dynamics of the Agroathelia genus.

Methods

Fungal strains

The A. rolfsii strain LC1 was isolated from the leaf of Coptis chinensis, and is stored at the Institute of Chinese Herbal Medicines, Hubei Academy of Agricultural Sciences, China. For fugal growth, the strain was incubated on solid potato dextrose agar (PDA) at 28 °C in the dark (Fig. 1).

DNA extraction and genome sequencing

Total genomic DNA was extracted using the GP1 method (Novogene, Beijing, China) following the manufacturer’s instructions. The purity of the DNA was assessed using a NanoPhotometer® spectrophotometer (IMPLEN, CA, USA), and DNA concentration was quantified using the Qubit® DNA Assay Kit with a Qubit® 2.0 Fluorometer (Life Technologies, CA, USA). For Illumina short-read sequencing, a DNA library with a 350 bp insert size was prepared using the TruSeq Nano DNA HT Sample Preparation Kit (Illumina, USA) in accordance with the manufacturer’s recommendations. Sequencing was performed on the Illumina HiSeq X Ten platform with a paired-end read length of 2 × 150 bp, generating 2.3 Gb of Illumina data, corresponding to approximately 53 × genome coverage. For PacBio long-read sequencing, a sequencing library was constructed and sequenced on the Sequel II System platform. High-quality long reads were obtained using the Guppy basecalling software with default parameters, yielding 8.06 Gb of PacBio sequencing data, which represents approximately 186 × genome coverage.

Genome assembly

The genome size of A. rolfsii was first inferred from a 15-mer distribution using jellyfish software (v2.3.0) based on Illumina paired-end reads¹⁸, and the genome size was estimated to be 41.18 Mb (Fig. 2a; Table 1). For de novo genome assembly, PacBio long reads corrected with Falcon (v1.8.7)¹⁹ were assembled to generate an initial assembly by Wtdbg2 with the following parameters: -x sq and -g 44 m. The PacBio sequencing data provides significant advantages for genome assembly due to its long-read lengths, which simplify the complexity of assembly algorithms by enabling overlap-based assembly. To further enhance the accuracy and completeness of the assembly, Pilon (v1.23)²⁰ was utilized to perform three iterative rounds of base correction using high-quality Illumina short reads with default settings. To the end, the assembled genome of A. rolfsii was found to be 43.3 Mb in length, with a Contig N50 of 3.3 Mb and a GC content of 46.3% (Fig. 2b; Table 1). The assembled genome consists of 50 Contigs (Table 1).

Table 1 Statistics of the Agroathelia rolfsii genome assembly.

Full size table

Genome annotation

Using RepeatModeler v1.0.8 (http://www.repeatmasker.org/RepeatModeler/), a de novo repeat library was generated and subsequently combined with the Repbase library v23.09 (http://www.girinst.org/repbase/). The merged library was then used for repeat identification with RepeatMasker v4.1.2-p1. This analysis identified 13.38 Mb of repetitive sequences, which accounted for 30.93% of the genome. Among these, long terminal repeats (LTRs; 27.56%) and DNA elements (1.6%) were the most abundant repetitive components, with unclassified repeats also making a significant contribution.

The repeat-masked A. rolfsii genome was annotated using Funannotate (http://github.com/nextgenusfs/funannotate), an automated pipeline designed for fungi but also applicable to other eukaryotes. Protein evidence was incorporated from curated UniProtKB/SwissProt datasets included in Funannotate. Gene prediction employed multiple tools integrated within the pipeline, including including Augustus v3.3.3²¹, GeneMark-ES v4.32²², SNAP v2006-07-28²³, and GlimmerHMM v3.0.4²⁴ (all ab initio predictors), as well as CodingQuarry v2.0²⁵, which utilizes transcriptome data for gene model inference. The combined results yielded a consensus gene model comprising 8,826 protein-coding genes, with an average length of 2,504 bp and a maximum length of 15,609 bp. A total of 7,907 (89.6%) of which were successfully annotated in at least one database (Table 2). Similarly, we utilized several complementary software tools to predict non-coding RNAs in the A. rolfsii genome, including RNAmmer for rRNA prediction, tRNAscan-SE for identifying tRNA regions and secondary structures, and cmscan (with default parameters) for detecting sRNAs and miRNAs by aligning sequences to the Rfam database. A total of 209 ncRNAs were identified in the A. rolfsii genome, including 160 tRNAs, 41 rRNAs (13 5S, 12 28S, and 16 18S), 1 sRNA, and 7 miRNAs.

Table 2 Functional annotation of Agroathelia rolfsii proteins.

Full size table

Identification of CAZymes and biosynthetic gene cluster

The putative carbohydrate-active enzymes (CAZymes) in A. rolfsii were annotated using the dbCAN server (dbCAN HMMs 5.0)²⁶, applying thresholds of an e-value lower than 1e-5 and at least 70% sequence coverage. CAZymes were categorized based on the modules outlined in the CAZyme database classification (http://www.cazy.org). The A. rolfsii genome encodes a total of 1,260 genes for putative CAZymes, which are distributed across six functional categories. Among these, glycoside hydrolases (475 genes, 37.7%) were the most prevalent, followed by glycosyltransferases (331 genes, 26.3%) (Table 3). Polysaccharide lyases were the least represented, with only one gene identified (Table 3). To explore secondary metabolism, the antiSMASH 4.0 tool was employed for identifying gene clusters within the A. rolfsii genome. A total of 22 clusters associated with secondary metabolites were predicted, including four terpene clusters, three type I polyketide synthase clusters, and one non-ribosomal peptide synthase cluster (Fig. 3a).

Table 3 Identification of CAZymes in the Agroathelia rolfsii genome.

Full size table

Data Record

All sequencing data, including the raw PacBio long reads and Illumina short reads, have been deposited in the Genome Sequence Archive (GSA²⁷) in National Genomics Data Center²⁸ under BioProject PRJCA042508. The PacBio raw data are available under accession CRA027430²⁹, and the Illumina raw data under CRA027417³⁰. The assembled genome has been made available in GenBank with JBQWDA000000000³¹. Additionally, the annotation data has been deposited at the Figshare repository³².

Data Overview

Protein sequences of 9 genomes were used for comparative genomics and phylogenomic analyses: A. rolfsii (this study), A. delphinii SD strain (GCA_041475395.1), A. rolfsii GP3 isolate (GCA_018343915.1), A. rolfsii Sr-RCBB strain (GCA_002940785.1), A. rolfsii CCF1 strain (GCA_036172465.1), A. rolfsii ZY isolate (GCA_018343895.1), A. rolfsii MR10 strain (GCA_000961905.2), Fusarium oxysporum Fo5176 strain (GCA_030345115.2), Fusarium solani F421 strain (GCA_040436825.1). The OrthoFinder (v2.5.1)³³ with default settings was used to identify orthologous and paralogous genes. All protein sequences were aligned with MAFFT (v7.123b)³⁴ and trimmed by trimAl (v1.4.rev22)³⁵. The sequences were concatenated into a gene matrix and used for phylogenetic analysis (Fig. 3b). A phylogenetic tree was constructed by maximum likelihood (ML) method using IQ-TREE (v2.1.2)³⁶, with 1,000 ultrafast bootstrap replicates. The best model (JTT + F + R6) was determined by ModelFinder³⁷ and two Fusarium species were used as outgroups. A total of 758 single-copy orthogroups were identified and extracted for phylogenetic analysis (Fig. 3b). Among the analyzed strains, F. oxysporum and F. solani exhibit the highest gene counts, with 11,419 and 10,495 genes, respectively, forming an outgroup relative to the Agroathelia species. The Agroathelia rolfsii strain analyzed in this study contains 6,921 genes and clusters closely with A. delphinii (SD strain, 9,877 genes). Within A. rolfsii, notable gene count variations are observed, ranging from 4,051 genes in the GP3 isolate to 10,816 genes in the ZY isolate.These results highlight substantial genomic diversity within A. rolfsii, which may be attributed to differences in ecological adaptation, host specificity, or pathogenicity potential.

Technical Validation

Quality control of PacBio long reads was performed using Chopperv1.5.0 (http://gitcode.com/gh_mirrors/cho/chopper), resulting in 606,486 reads with an N50 length of 26 kb. Illumina short reads were processed using Fastp³⁸ to remove low-quality reads, generating a total of 15,321,002 high-quality reads for subsequent analysis. The completeness of the genome was assessed using BUSCO (Benchmarking Universal Single-Copy Orthologs) v5.5.0 with the basidiomycota_odb10 database (genome mode)³⁹. The evaluation yielded the following BUSCO statistics: 97.7% complete, 96.8% complete single-copy, 0.3% duplicated, 0.6% fragmented, and 2.3% missing (Table 4). The annotated and integrated proteins were further evaluated using BUSCO v5.5.0 with the basidiomycota_odb10 database (protein mode), which showed a complete BUSCO score of 88.6% (Table 5). Additionally, 89.6% of the genes were functionally annotated across four major databases (Nr, SwissProt, KEGG, and KOG) (Table 2), and future improvements will involve refining the genome annotation with full-length transcriptome data using the cDNA_Cupcake toolkit (https://github.com/Magdoll/cDNA_Cupcake). The genome assembly quality value (QV value) was assessed using Merqury (http://github.com/marbl/merqury), a k-mer-based tool designed for evaluating genome assembly accuracy and completeness without a reference genome. The process involved comparing the k-mers (specifically 21-mers) from short-read sequencing data with those from the assembled genome. The assessment yielded a high QV value of 44, reflecting the high base accuracy of the assembly.

Table 4 Busco of the Agroathelia rolfsii genome.

Full size table

Table 5 Busco of the Agroathelia rolfsii protein-coding genes.

Full size table

Data availability

All sequencing data generated in this study have been deposited in public repositories. The raw PacBio long reads are available in the Genome Sequence Archive (GSA)²⁷ at the National Genomics Data Center²⁸ under accession number CRA027430 (BioProject PRJCA042508)²⁹. The raw Illumina short reads are available under accession number CRA027417³⁰. The assembled genome has been deposited in GenBank under accession number JBQWDA000000000³¹. The annotation data are accessible at the Figshare repository³².

Code availability

This study did not involve the use of custom scripts or customized command lines. All comparative analyses were conducted using publicly available software. Detailed information on the software versions and parameters can be found in the Methods section. For any software tools where specific parameters are not specified, default settings were applied.

References

Patra, G. K., Acharya, G. K., Panigrahi, J., Mukherjee, A. K. & Rout, G. R. The soil-borne fungal pathogen Athelia rolfsii: past, present, and future concern in legumes. Folia Microbiol (Praha) 68, 677–690, https://doi.org/10.1007/s12223-023-01086-4 (2023).
Article CAS PubMed Google Scholar
Shukla, V., Kumar, S., Tripathi, Y. N. & Upadhyay, R. S. Bacillus subtilis- and Pseudomonas fluorescens-Mediated Systemic Resistance in Tomato Against Sclerotium rolfsii and Study of Physio-Chemical Alterations. Front Fungal Biol 3, 851002, https://doi.org/10.3389/ffunb.2022.851002 (2022).
Article PubMed PubMed Central Google Scholar
Wei, X., Langston, D. B. Jr. & Mehl, H. L. Comparison of Current Peanut Fungicides Against Athelia rolfsii Through a Laboratory Bioassay of Detached Plant Tissues. Plant Dis 106, 2046–2052, https://doi.org/10.1094/pdis-12-21-2789-re (2022).
Article CAS PubMed Google Scholar
Yi, R. H., Liao, H. Z., Li, K. Y. & Feng, F. Fruit rot causing by Athelia rolfsii (Sclerotium rolfsii) on jackfruit (Artocarpus heterophyllus) in China. Plant Dis, https://doi.org/10.1094/pdis-09-22-2250-pdn (2023).
Yu, D. et al. Progress on pathogenicity differentiation in Sclerotium rolfsii isolates from peanut. Chinese Journal of Oil Crop Sciences 44, 930–936 (2022).
Google Scholar
Kumar, V. & Thirumalaisamy, P. Diseases of groundnut. Disease of field crops and their management. Indian Phytopathological Society, Today and Tomorrow’s Printers and Publishers, New Delhi, 459-487 (2016).
Chen, K. et al. Research progress on peanut southern stem rot caused by Sclerotium rolfsii. Chin J Oil Crop Sci 40, 302–308 (2018).
Google Scholar
Shim, M. Y., Starr, J. L., Keller, N. P., Woodard, K. E. & Lee, T. A. Jr. Distribution of Isolates of Sclerotium rolfsii Tolerant to Pentachloronitrobenzene in Texas Peanut Fields. Plant Dis 82, 103–106, https://doi.org/10.1094/pdis.1998.82.1.103 (1998).
Article PubMed Google Scholar
Franke, M. D., Brenneman, T. B., Stevenson, K. L. & Padgett, G. B. Sensitivity of Isolates of Sclerotium rolfsii from Peanut in Georgia to Selected Fungicides. Plant Dis 82, 578–583, https://doi.org/10.1094/pdis.1998.82.5.578 (1998).
Article CAS PubMed Google Scholar
Singh, S. et al. Harnessing Trichoderma Mycoparasitism as a Tool in the Management of Soil Dwelling Plant Pathogens. Microb Ecol 87, 158, https://doi.org/10.1007/s00248-024-02472-2 (2024).
Article ADS PubMed PubMed Central Google Scholar
Li, Y. et al. Fungal acetylome comparative analysis identifies an essential role of acetylation in human fungal pathogen virulence. Commun Biol 2, 154, https://doi.org/10.1038/s42003-019-0419-1 (2019).
Article PubMed PubMed Central Google Scholar
Deng, K., Zhang, Y., Lv, S., Zhang, C. & Xiao, L. Decoding Pecan’s Fungal Foe: A Genomic Insight into Colletotrichum plurivorum Isolate W-6. Journal of Fungi 11, 203 (2025).
Article CAS PubMed PubMed Central Google Scholar
Xia, C. et al. Current Status and Future Perspectives of Genomics Research in the Rust Fungi. Int J Mol Sci 23, https://doi.org/10.3390/ijms23179629 (2022).
Song, R. et al. CRISPR/Cas9 genome editing technology in filamentous fungi: progress and perspective. Appl Microbiol Biotechnol 103, 6919–6932, https://doi.org/10.1007/s00253-019-10007-w (2019).
Article CAS PubMed PubMed Central Google Scholar
Xie, Z. et al. Genome editing in the edible fungus Poria cocos using CRISPR-Cas9 system integrating genome-wide off-target prediction and detection. Front Microbiol 13, 966231, https://doi.org/10.3389/fmicb.2022.966231 (2022).
Article PubMed PubMed Central Google Scholar
Case, N. T. et al. Fungal impacts on Earth’s ecosystems. Nature 638, 49–57, https://doi.org/10.1038/s41586-024-08419-4 (2025).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, Y. J. et al. Whole-genome sequence of Sclerotium delphinii, a pathogenic fungus of Dendrobium officinale southern blight. Genomics 116, 110932, https://doi.org/10.1016/j.ygeno.2024.110932 (2024).
Article CAS PubMed Google Scholar
Marcais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–770, https://doi.org/10.1093/bioinformatics/btr011 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chin, C. S. et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat Methods 13, 1050–1054, https://doi.org/10.1038/nmeth.4035 (2016).
Article CAS PubMed PubMed Central Google Scholar
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9, e112963, https://doi.org/10.1371/journal.pone.0112963 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Stanke, M., Diekhans, M., Baertsch, R. & Haussler, D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 24, 637–644, https://doi.org/10.1093/bioinformatics/btn013 (2008).
Article CAS PubMed Google Scholar
Lomsadze, A., Burns, P. D. & Borodovsky, M. Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm. Nucleic Acids Res 42, e119, https://doi.org/10.1093/nar/gku557 (2014).
Article CAS PubMed PubMed Central Google Scholar
Korf, I. Gene finding in novel genomes. BMC bioinformatics 5, 59 (2004).
Article PubMed PubMed Central Google Scholar
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879, https://doi.org/10.1093/bioinformatics/bth315 (2004).
Article CAS PubMed Google Scholar
Testa, A. C., Hane, J. K., Ellwood, S. R. & Oliver, R. P. CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts. BMC Genomics 16, 170, https://doi.org/10.1186/s12864-015-1344-4 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zheng, J. et al. dbCAN3: automated carbohydrate-active enzyme and substrate annotation. Nucleic Acids Res 51, W115–w121, https://doi.org/10.1093/nar/gkad328 (2023).
Article CAS PubMed PubMed Central Google Scholar
Chen, T. et al. The Genome Sequence Archive Family: Toward Explosive Data Growth and Diverse Data Types. Genomics Proteomics Bioinformatics 19, 578–583, https://doi.org/10.1016/j.gpb.2021.08.001 (2021).
Article PubMed PubMed Central Google Scholar
Members, C.-N. & Partners. Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2023. Nucleic Acids Res 51, D18–D28, https://doi.org/10.1093/nar/gkac1073 (2023).
Article CAS Google Scholar
National Genomics Data Center https://ngdc.cncb.ac.cn/gsa/browse/CRA027430 (2025).
National Genomics Data Center https://ngdc.cncb.ac.cn/gsa/browse/CRA027417 (2025).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_052928445.1 (2025).
You, J. et al. High-Quality Genome Assembly and Annotation of Athelia rolfsii LC-1. figshare https://doi.org/10.6084/m9.figshare.30018793.v1 (2025).
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 20, 238, https://doi.org/10.1186/s13059-019-1832-y (2019).
Article PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30, 772–780, https://doi.org/10.1093/molbev/mst010 (2013).
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973, https://doi.org/10.1093/bioinformatics/btp348 (2009).
Article CAS PubMed PubMed Central Google Scholar
Minh, B. Q. et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol 37, 1530–1534, https://doi.org/10.1093/molbev/msaa015 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods 14, 587–589, https://doi.org/10.1038/nmeth.4285 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890, https://doi.org/10.1093/bioinformatics/bty560 (2018).
Article CAS PubMed PubMed Central Google Scholar
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A. & Zdobnov, E. M. BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Mol Biol Evol 38, 4647–4654, https://doi.org/10.1093/molbev/msab199 (2021).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was supported by the Hubei Key Research and Development Program [2024BBB090]; the Science Research Program of Key Laboratory of Biology and Cultivation of Chinese Herbal Medicines, Ministry of Agriculture and Rural Affair [BZ2024001]; the Advantages Discipline Group (Biology and Medicine) Project in Higher Education of Hubei Province (2021–2025) [2022BMXKQT4] at Hubei University of Medicine; the National Nature Science Foundation of China [32200680]; the Science Research Program of Hubei Provincial Department of Education [T2023016]; the Joint supported by Hubei Provincial Natural Science Foundation and Shiyan-of China [2025AFD179, 2025AFD205, 2025AFD172] and the Young Top-notch Talent Cultivation Program of Hubei Province.

Author information

These authors contributed equally: Jingmao You, Lei You.

Authors and Affiliations

Institute of Chinese Herbal Medicines, Hubei Academy of Agricultural Sciences, Enshi, 445000, China
Jingmao You, Tao Tang, Fanfan Wang, Yuanyuan Duan, Xiaoyue Wang & Wuxian Zhou
Key Laboratory of Biology and Cultivation of Chinese Herbal Medicines, Ministry of Agriculture and Rural Affairs, Enshi, 445000, China
Jingmao You
The Plant Germplasm Resources Laboratory, School of Forestry and Horticulture, Hubei Minzu University, Enshi, 445000, China
Houyun Huang
Shiyan Key Laboratory of Medicinal Plants and Evolutionary Genetics, Hubei Key Laboratory of Wudang local Chinese Medicine Research, Academy of Bio-Medicine Research, School of Basic Medicine, Hubei University of Medicine, Shiyan, 442000, China
Lei You, Lanlan Zheng & Yonghong Zhang

Authors

Jingmao You
View author publications
Search author on:PubMed Google Scholar
Lei You
View author publications
Search author on:PubMed Google Scholar
Tao Tang
View author publications
Search author on:PubMed Google Scholar
Fanfan Wang
View author publications
Search author on:PubMed Google Scholar
Houyun Huang
View author publications
Search author on:PubMed Google Scholar
Yuanyuan Duan
View author publications
Search author on:PubMed Google Scholar
Lanlan Zheng
View author publications
Search author on:PubMed Google Scholar
Xiaoyue Wang
View author publications
Search author on:PubMed Google Scholar
Wuxian Zhou
View author publications
Search author on:PubMed Google Scholar
Yonghong Zhang
View author publications
Search author on:PubMed Google Scholar

Contributions

J.Y. and Y.Z. designed the study; L.Y., T.T., F.W. and H.H. assembled, annotated and analyzed the genome; J.Y., Y.D., X.W. and W.Z. drafted the manuscript; Y.Z. and L. Z. revised the manuscript.

Corresponding authors

Correspondence to Jingmao You or Yonghong Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

You, J., You, L., Tang, T. et al. The high-quality genome sequence of Agroathelia rolfsii (Syn. Sclerotium rolfsii), the causal agent of southern blight in Coptis chinensis. Sci Data 12, 1833 (2025). https://doi.org/10.1038/s41597-025-06137-8

Download citation

Received: 09 July 2025
Accepted: 13 October 2025
Published: 19 November 2025
Version of record: 19 November 2025
DOI: https://doi.org/10.1038/s41597-025-06137-8