Haplotype-resolved genomes of Trichophyton mentagrophytes and Trichophyton tonsurans

Feng, Sijie; Wang, Zhenhui; Lin, Kainan; Wang, Kun; Zheng, Shuting; Wang, Qianqian; Lin, Lianyu; Lu, Yunkun

doi:10.1038/s41597-025-04835-x

Download PDF

Data Descriptor
Open access
Published: 10 April 2025

Haplotype-resolved genomes of Trichophyton mentagrophytes and Trichophyton tonsurans

Sijie Feng^1,2^na1,
Zhenhui Wang¹^na1,
Kainan Lin²^na1,
Kun Wang¹,
Shuting Zheng¹,
Qianqian Wang²,
Lianyu Lin³ &
…
Yunkun Lu ORCID: orcid.org/0000-0001-7605-1259²

Scientific Data volume 12, Article number: 559 (2025) Cite this article

2166 Accesses
Metrics details

Subjects

Abstract

Dermatophytes have posed a significant health concern due to their ability to parasitize human and animal skin, hair, and nails, causing a spectrum of dermatological conditions. However, the absence of high-quality genomes hinders our understanding of the dermatophytes. In this study, we utilized the circular consensus sequencing (CCS) technology to generate haplotype-resolved, nearly-complete genomes for two representative dermatophytes, Trichophyton mentagrophytes and Trichophyton tonsurans. Total sizes of the genomes ranged from 23.8 Mb to 25.2 Mb, with the contig N50 lengths of 6.47 Mb and 12.65 Mb, respectively. Each genome assembly was gapless and possessed three pseudochromosomes, with two achieving telomere-to-telomere (T2T) level. BUSCO analysis of the assemblies revealed approximately 99% of genome completeness. More than 7500 protein-coding genes were identified, and over 99% of the genes were well annotated through multiple gene function databases. Approximately 10% of the genomes were covered by repeats, particularly retrotransposons. Our findings provided valuable genomic resources of dermatophytes, paving the way for developing more effective medical interventions and public health strategies against Trichophyton infections.

Chromosomal-level genome assembly of Trichogramma japonicum (Hymenoptera: Trichogrammatidae)

Article Open access 02 September 2025

Chromosome-level genome assembly for clubrush (Scirpus × mariqueter) endemic to China

Article Open access 22 May 2025

Two near-chromosomal-level genomes of globally-distributed Macroascomycete based on single-molecule fluorescence and Hi-C methods

Article Open access 04 September 2024

Background & Summary

Trichophyton is a genus of dermatophytes, a group of fungi that are parasitic on the skin, hair, and nails of humans and animals. These fungi are renowned for their keratolytic abilities, allowing them to break down keratin, a key structural protein found in skin, hair, and nails¹. They can cause a variety of clinical manifestations, including tinea capitis (scalp ringworm), tinea corporis (body ringworm), tinea pedis (athlete’s foot), and onychomycosis (nail infection). The infections are generally acquired through direct contact with an infected individual or animal, or indirectly through contaminated surfaces or objects².

Trichophyton belongs to the family Arthrodermataceae and is classified within the phylum Ascomycota. Some of the most well-known species in this genus include Trichophyton rubrum, Trichophyton mentagrophytes, Trichophyton tonsurans, and Trichophyton verrucosum^3,4. T. mentagrophytes and T. tonsurans are two species that have significant medical importance⁵. T. mentagrophytes is known to cause a wide range of dermatophytoses, including tinea corporis and tinea pedis. It is also a common cause of ringworm in small mammals, which can act as a reservoir for human infections^6,7. T. tonsurans is a leading cause of tinea capitis, particularly in children, and has become increasingly resistant to conventional antifungal treatments⁸.

To date, nearly all available Trichophyton genomes are fragmented into contigs or scaffolds, with none achieving chromosome-level resolution. For instance, prior assemblies of T. mentagrophytes and T. tonsurans exhibited contig N50 values below 400 kb and lacked telomere-to-telomere (T2T) continuity (Table 1), which hindered precise gene synteny analysis, repeat element characterization, and haplotype-specific investigations. These limitations impede the identification of virulence factors, antifungal resistance markers, and species-specific genomic features critical for diagnostics and therapeutics.

Table 1 Comparison of assembly quality with the existing genomes of T. mentagrophytes and T. tonsurans.

Full size table

To address this gap, we employed PacBio HiFi circular consensus sequencing (CCS) technology to generate long-read data (average read length ~20 kb, N50 > 19 kb) for T. mentagrophytes and T. tonsurans, implying the reliability of sequencing data. Using Hifiasm, we achieved gapless, haplotype-resolved assemblies for both species. The genome sizes of T. mentagrophytes and T. tonsurans span 24.48 Mb and 25.24 Mb, respectively, with contig N50 values of 6.47 Mb and 12.65 Mb. Each assembly comprises three pseudochromosomes (accounting for 95.58–96.67% genome coverage), two of which are fully T2T with canonical telomeric repeats (TAACCC/GGGTTA) at both ends. BUSCO analysis confirmed exceptional completeness (99% for fungi_odb10), while HiFi read remapping rates (>99.99%) validated assembly accuracy. Notably, our annotations revealed 7,577 (T. mentagrophytes) and 7,519 (T. tonsurans) protein-coding genes, with >99% functionally annotated through GO, KEGG, and Swiss-Prot databases. We further identified lineage-specific repeat expansions: T. tonsurans harbors abundant LINE retrotransposons (2.29% of the genome), absent in T. mentagrophytes, suggesting post-speciation transposable element activity. These chromosome-scale assemblies provide the first comprehensive view of Trichophyton genome architecture, enabling precise comparative genomics, and pathogenicity island mapping. This resource is pivotal for developing targeted antifungals, molecular diagnostics, and understanding mechanisms underlying drug resistance and host adaptation.

Methods

Sample collection and fungal cultivation

The clinical isolates of T. mentagrophytes and T. tonsurans were harvested from School of Medicine, Henan Polytechnic University, Jiaozuo city, Henan Province, P. R. China. For fungal cultivation, the isolates were cultured in Sabouraud Dextrose Broth (SDB) prepared using (40 g Dextrose (Glucose), 10 g Peptone, 1000 mL Distilled water and pH adjusted to 5.6) for 5–7 days of incubation at 28 °C in a shaking incubator with steady speed of 100 rpm. The mycelium was filtered out using a sterilized cheesecloth and rinse off culture media with sterilized distilled water.

DNA extraction and sequencing

High-quality genomic DNA of T. mentagrophytes and T. tonsurans was extracted from hyphae using the modified CTAB⁹. Specifically, about 0.2–0.5 g of harvested mycelium was weighed and frozen in liquid nitrogen. The frozen samples were transferred into pre-chilled mortar containing nitrogen and ground with a pestle into a fine powder. The ground samples were transferred into 1.5 mL Eppendorf tubes and 500 µL of CTAB extraction buffer (100 mM Tris-HCl (pH 8.0), 20 mM EDTA, 1.4 M, 1% (w/v) PVP, and 0.2% (v/v) β-mercaptoethanol) and homogenized by gently vertexing the tube for 5–10 minutes to allow proper mixing and phase separation. The content was centrifuged at 12,000 rpm for 10–15 minutes at room temperature. 300 μL of the upper aqueous phase supernatant (containing the DNA) was pipetted into a new sterilized EP tube and 0.6 volumes of cold isopropanol added mix gently and incubated in −20 °C for 30 minutes to precipitate the DNA.

The content was centrifuged at 12,000 rpm for 10 minutes at 4 °C to pellet the DNA. The supernatant was discarded and 500 µL of 70% ethanol was added to the DNA pellet, gently inverted to wash the DNA and then centrifuged again at 12,000 rpm for 5 minutes at 4 °C. The ethanol wash was discarded and the DNA pellet air dried for 10–15 minutes to remove residual ethanol. The extracted DNA was eluted with a pre-warmed 50 μL RNAse-free water (the RNAse-free water was prewarmed to 55–60 °C to improve the elution) and stored −80 °C.

The quality and quantity of the extracted DNA were examined using a NanoDrop 2000 spectrophotometer (NanoDrop Technologies, Wilmington, DE, USA), Qubit dsDNA HS Assay Kit on a Qubit 3.0 Fluorometer (Life Technologies, Carlsbad, CA, USA) and electrophoresis on a 0.8% agarose gel, respectively. The SMRTbell library was constructed using the SMRTbell Express Template Prep kit 2.0 (Pacific Biosciences). Briefly, 15 μg of the genomic DNA was carried into the first enzymatic reaction to remove single-stranded overhangs followed by treatment with repair enzymes to repair any damage that may be present on the DNA backbone. After DNA damage repair, ends of the double-stranded fragments were polished and subsequently tailed with an A-overhang. Ligation with T-overhang SMRTbell adapters was performed at 20 °C for 60 minutes. Following ligation, the SMRTbell library was purified with 1X AMPure PB beads. The size distribution and concentration of the library were assessed using the FEMTO Pulse automated pulsed-field capillary electrophoresis instrument (Agilent Technologies, Wilmington, DE) and the Qubit 3.0 Fluorometer (Life Technologies, Carlsbad, CA, USA). Following library characterization, 3 μg was subjected to a size selection step using the BluePippin system (Sage Science, Beverly, MA) to remove SMRTbells ≤15 kb. After size selection, the library was purified with 1X AMPure PB beads. Library size and quantity were assessed using the FEMTO Pulse and the Qubit dsDNA HS reagents Assay kit. Sequencing primer and Sequel II DNA Polymerase were annealed and bound, respectively, to the final SMRTbell library. The library was loaded at an on-plate concentration of 120 pM using diffusion loading. SMRT sequencing was performed using a single 8 M SMRT Cell on the Sequel II System with Sequel II Sequencing Kit, 1800-minute movies by Frasergen. Finally, we obtained 5.51 Gb and 8.33 Gb high fidelity (HiFi) reads for T. mentagrophytes and T. tonsurans, with an average depth of 225X and 330X, respectively (Table 2). Both the average length and N50 length of the HiFi data were close to 20 kb.

Table 2 Statistics of HiFi reads of two Trichophyton species.

Full size table

De novo genome assembly

A total of 269 thousand and 450 thousand HiFi reads for T. mentagrophytes and T. tonsurans were assembled using the Hifiasm software (version 0.19.5-r587) with parameters of ‘–telo-m CCCTAAA’¹⁰. The genomes of T. mentagrophytes and T. tonsurans showed no gap, with the total sizes of 24.48 Mb and 25.24 Mb, respectively (Table 3). The contig numbers were 19 and 24, with the contig N50 values of 6.47 Mb and 12.65 Mb, respectively. Notably, we found three super-contigs that accounted for 95.58% and 96.67% of the T. mentagrophytes and T. tonsurans genomes, respectively. Moreover, two super-contigs of the two assemblies exhibited the canonical telomere repeat sequences in both ends (Fig. 1A,B, Table 4). Together, these results suggested that the three super-contigs should represent pseudo-chromosomes of the genomes and two of them had achieved telomere-to-telomere (T2T) level. Moreover, we used the previously published mitochondrion sequences of T. mentagrophytes¹¹ to search the mitochondrion genome in the draft genomes of two species. Finally, we obtained two mitochondrion genomes for T. mentagrophytes and T. tonsurans, with the total sizes of 24,299 bp and 24,306 bp, respectively. The genome assembly was visualized using Circos (version 0.69-6)¹². The genomic synteny was analyzed and visualized using JCVI (version 1.4.22) software with default parameters¹³.

Table 3 Genomic characteristics of two Trichophyton species.

Full size table

Table 4 Information of telomere in the genomes of two Trichophyton species.

Full size table

The completeness of the genome was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO, version 5.5.0) with fungi_odb10, ascomycota_odb10 databases¹⁴. The results showed that the two genomes yielded over 99% of genomic completeness (Fig. 2, Table 3). The HiFi reads were realigned to the assembly using Minimap2 (version 2.26-r1175, with parameters of ‘-ax map-hifi’) and Samtools (version 1.17) software^15,16, revealing that more than 99.99% of the HiFi reads can be mapped to the corresponding assembly. These results indicated the high quality, accuracy, continuity and completeness of the T. mentagrophytes and T. tonsurans genome assemblies.

Two haploid genomes of each species were assembled using Hifiasm (version 0.19.5-r587) with parameters of ‘–telo-m CCCTAAA’, and they showed similar genome sizes and the contig N90 values (Table 3). Crucially, all of the haploid genomes contained three pseudo-chromosomes that accounted for more than 95% of the genomes, and we also observed the canonical telomere repeats the ends of chromosomes (Table 4), highlighting the high quality and continuity of the haploid genomes. Moreover, BUSCO analysis for each haploid genome showed a high completeness (Table 4).

Genome annotation

Gene structure prediction was carried out using the GETA software (version 2.5.7), which is accessible at its GitHub repository (https://github.com/chenlianfu/GETA). GETA integrates three different gene prediction approaches: the de novo method, homology-based prediction, and transcriptome-based prediction. Throughout this process, a suite of tools was applied, including PASA¹⁷, genewise¹⁸, AUGUSTUS¹⁹, Hisat2²⁰, hmmsearch²¹, BGM2AT, exonerate, and others. In total, 7577 and 7519 genes were identified in the T. mentagrophytes and T. tonsurans genomes, with 1965 and 1826 genes containing alternative isoforms (Table 3). Additionally, we discovered 503 and 421 long-noncoding RNAs (lncRNAs) in the two genomes.

To infer gene functions, interpro-scan (version 5.65–97.0) and eggNOG-Mapper (version 2.1.12) were used, and a range of gene function databases were consulted for detailed functional annotations. These databases included Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Pfam, Clusters of Orthologous Groups (COG), and the Swiss-Prot protein database^{22,23,24,25,26,27,28}. Interestingly, nearly all the genes (7497/7577 in T. mentagrophytes and 7449/7519 in T. tonsurans) were well-annotated, implying the accuracy and completeness of the gene prediction results (Fig. 3A,B).

For repeat annotation in the two genomes, the de novo identification of repeat elements was conducted with RepeatModeler (version 2.0.3)²⁹. The repeat consensus sequences identified were then used as seeds for RepeatMasker (version 4.1.0)³⁰ to locate and analyze all repeat regions within the genome. To predict full-length LTR retrotransposons (LTR-RTs), a combination of tools was used, including LTR_finder (version 1.07, with parameters of ‘-D 15000 -d 1000 -L 7000 -l 100 -p 20 -C -M 0.8’)³¹, LTRharvest (version 1.6.2, with parameters of ‘-similar 80 -vic 10 -seed 20 -seqids yes -minlenltr 100 -maxlenltr 7000 -mintsd 4 -maxtsd 6’)³², and LTR_retriever (version 2.9.0, with default parameters)³³. The simple repeat with the repeat unit of either ‘TAACCC’ or ‘GGGTTA’ was considered as the telomeric repeat³⁴. A total of 8.89% and 9.99% of the T. mentagrophytes and T. tonsurans genomes were covered by repeat elements (Table 3). Among these repeats, LTR retrotransposon (LTR-RT) Ty3/Gypsy elements were predominant in both genomes (Table 5). Notably, we discovered a substantial amount of non-LTR-RT Long interspersed nuclear elements (LINEs) exclusively in the T. tonsurans genome, suggesting that these TEs were amplified after the divergence between the two species.

Table 5 Repeat classification and content in the two Trichophyton genomes.

Full size table

Data Records

The raw HiFi reads of T. tonsurans and T. mentagrophytes were deposited in NCBI under the accessions of SRP536180³⁵ and SRP536145³⁶, respectively. The genome assemblies have been deposited at GeneBank with accession numbers of GCA_047301375.1³⁷ (T. tonsurans) and GCA_047301425.1³⁸ (T. mentagrophytes) Additionally, we have deposited the genome assemblies, including haploid genomes, and gene annotation results in Figshare^39,40.

Technical Validation

Several methods were employed to assess the quality of the genomes. At first, BUSCO analysis with two databases, including fungi_odb10, ascomycota_odb10, was used to evaluate the completeness of genome. The results showed that the two assemblies exhibited over 99% of genome completeness according to fungi_odb10 dataset and 97.3% of genome completeness according to ascomycota_odb10. Then, the genomes of two species contained no gap, and the three pseudo-chromosomes covered over 95% of the genomes, and two of the three pseudo-chromosomes exhibited the telomere repeat sequences in the both ends. Moreover, over 99.99% of the HiFi reads can be realigned to the genome assemblies. In aggregate, these results indicated the high quality, accuracy, continuity and completeness of the T. mentagrophytes and T. tonsurans genome assemblies.

Code availability

The bioinformatic tools, including version and parameters, that were used in this study have been described in the Method section. No customized script was used.

References

Svarcova, M., Vetrovsky, T., Kolarik, M. & Hubka, V. Defining the relationship between phylogeny, clinical manifestation, and phenotype for Trichophyton mentagrophytes/interdigitale complex; a literature review and taxonomic recommendations. Medical mycology 61 https://doi.org/10.1093/mmy/myad042 (2023).
Havlickova, B., Czaika, V. A. & Friedrich, M. Epidemiological trends in skin mycoses worldwide. Mycoses 51(Suppl 4), 2–15, https://doi.org/10.1111/j.1439-0507.2008.01606.x (2008).
Article PubMed Google Scholar
Weitzman, I. & Summerbell, R. C. J. C. M. R. The dermatophytes. 8, 240–259 (1995).
Ajello, L. J. M. E. M. A. Natural history of the dermatophytes and related fungi. 53, 93–110 (1974).
Graser, Y., Kuijpers, A. F., Presber, W. & De Hoog, G. S. Molecular taxonomy of Trichophyton mentagrophytes and T. tonsurans. Medical mycology 37, 315–330, https://doi.org/10.1046/j.1365-280x.1999.00234.x (1999).
Article CAS PubMed Google Scholar
Frias-De-Leon, M. G. et al. Molecular identification of isolates of the Trichophyton mentagrophytes complex. Int J Med Sci 17, 45–52, https://doi.org/10.7150/ijms.35173 (2020).
Article CAS PubMed PubMed Central MATH Google Scholar
Ziolkowska, G. et al. Molecular identification and classification of Trichophyton mentagrophytes complex strains isolated from humans and selected animal species. Mycoses 58, 119–126, https://doi.org/10.1111/myc.12284 (2015).
Article PubMed Google Scholar
Sakata, Y., Ushigami, T., Anzawa, K. & Mochizuki, T. Molecular Epidemiology of Trichophyton tonsurans, the Causative Dermatophyte of the Tinea Gladiatorum Epidemic in Japan between 2011 and 2015. Jpn J Infect Dis 71, 140–144, https://doi.org/10.7883/yoken.JJID.2017.449 (2018).
Article CAS PubMed Google Scholar
Vilanova, S. et al. SILEX: a fast and inexpensive high-quality DNA extraction method suitable for multiple sequencing platforms and recalcitrant plant species. 16, 1–11 (2020).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nature methods 18, 170–175, https://doi.org/10.1038/s41592-020-01056-5 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Wu, Y. et al. Recent dermatophyte divergence revealed by comparative and phylogenetic analysis of mitochondrial genomes. BMC genomics 10, 238, https://doi.org/10.1186/1471-2164-10-238 (2009).
Article CAS PubMed PubMed Central MATH Google Scholar
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome research 19, 1639–1645, https://doi.org/10.1101/gr.092759.109 (2009).
Article CAS PubMed PubMed Central MATH Google Scholar
Tang, H. et al. JCVI: A versatile toolkit for comparative genomics analysis. e211 (2024).
Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212, https://doi.org/10.1093/bioinformatics/btv351 (2015).
Article CAS PubMed Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100, https://doi.org/10.1093/bioinformatics/bty191 (2018).
Article CAS PubMed PubMed Central MATH Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079, https://doi.org/10.1093/bioinformatics/btp352 (2009).
Article CAS PubMed PubMed Central MATH Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biology 9, R7, https://doi.org/10.1186/gb-2008-9-1-r7 (2008).
Article CAS PubMed PubMed Central MATH Google Scholar
Birney, E., Clamp, M. & Durbin, R. GeneWise and Genomewise. Genome Research 14, 988–995, https://doi.org/10.1101/gr.1865504 (2004).
Article CAS PubMed PubMed Central Google Scholar
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Research 34, W435–439, https://doi.org/10.1093/nar/gkl200 (2006).
Article CAS PubMed PubMed Central MATH Google Scholar
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nature biotechnology 37, 907–915, https://doi.org/10.1038/s41587-019-0201-4 (2019).
Article CAS PubMed PubMed Central Google Scholar
Johnson, L. S., Eddy, S. R. & Portugaly, E. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC bioinformatics 11, 431, https://doi.org/10.1186/1471-2105-11-431 (2010).
Article CAS PubMed PubMed Central Google Scholar
Huerta-Cepas, J. et al. Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper. Molecular Biology and Evolution 34, 2115–2122, https://doi.org/10.1093/molbev/msx148 (2017).
Article CAS PubMed PubMed Central MATH Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240, https://doi.org/10.1093/bioinformatics/btu031 (2014).
Article CAS PubMed PubMed Central MATH Google Scholar
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Research 28, 27–30, https://doi.org/10.1093/nar/28.1.27 (2000).
Article CAS PubMed PubMed Central MATH Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature genetics 25, 25–29, https://doi.org/10.1038/75556 (2000).
Article CAS PubMed PubMed Central MATH Google Scholar
Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Research 31, 365–370, https://doi.org/10.1093/nar/gkg095 (2003).
Article CAS PubMed PubMed Central MATH Google Scholar
Punta, M. et al. The Pfam protein families database. Nucleic Acids Research 40, D290–301, https://doi.org/10.1093/nar/gkr1065 (2012).
Article CAS PubMed MATH Google Scholar
Tatusov, R. L., Galperin, M. Y., Natale, D. A. & Koonin, E. V. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Research 28, 33–36, https://doi.org/10.1093/nar/28.1.33 (2000).
Article CAS PubMed PubMed Central MATH Google Scholar
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proceedings of the National Academy of Sciences of the United States of America 117, 9451–9457, https://doi.org/10.1073/pnas.1921046117 (2020).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Current protocols in bioinformatics Chapter 4, Unit 4 10, https://doi.org/10.1002/0471250953.bi0410s05 (2004).
Article PubMed Google Scholar
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic acids research 35, W265–268, https://doi.org/10.1093/nar/gkm286 (2007).
Article PubMed PubMed Central Google Scholar
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC bioinformatics 9, 18, https://doi.org/10.1186/1471-2105-9-18 (2008).
Article CAS PubMed PubMed Central Google Scholar
Ou, S. & Jiang, N. LTR_retriever: A Highly Accurate and Sensitive Program for Identification of Long Terminal Repeat Retrotransposons. Plant physiology 176, 1410–1422, https://doi.org/10.1104/pp.17.01310 (2018).
Article CAS PubMed MATH Google Scholar
Lycka, M. et al. TeloBase: a community-curated database of telomere sequences across the tree of life. Nucleic acids research 52, D311–D321, https://doi.org/10.1093/nar/gkad672 (2024).
Article CAS PubMed Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP536180 (2024).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP536145 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_047301375.1 (2025).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_047301425.1 (2025).
lin, lianyu. Genome assembly of Trichophyton mentagrophytes and Trichophyton tonsurans. figshare. https://doi.org/10.6084/m9.figshare.27247950.v1 (2024).
lin, lianyu. Genome annotation files of Trichophyton mentagrophytes and Trichophyton tonsurans. figshare. https://doi.org/10.6084/m9.figshare.27247986.v1 (2024).

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 82403083, 82403160), the Doctoral Fund of Henan Polytechnic University (No. B2020-51), the Fundamental Research Funds for the Universities of Henan Province (No. NSFRF240631), the Key Scientific and Technological Project of Henan Science and Technology Department (No. 242102310296).

Author information

These authors contributed equally: Sijie Feng, Zhenhui Wang, Kainan Lin.

Authors and Affiliations

School of Medicine, Henan Polytechnic University, 454000, Jiaozuo, China
Sijie Feng, Zhenhui Wang, Kun Wang & Shuting Zheng
School of Medicine, Zhejiang University, 310016, Hangzhou, China
Sijie Feng, Kainan Lin, Qianqian Wang & Yunkun Lu
State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Science, Fujian Agriculture and Forestry University, 350002, Fuzhou, China
Lianyu Lin

Authors

Sijie Feng
View author publications
Search author on:PubMed Google Scholar
Zhenhui Wang
View author publications
Search author on:PubMed Google Scholar
Kainan Lin
View author publications
Search author on:PubMed Google Scholar
Kun Wang
View author publications
Search author on:PubMed Google Scholar
Shuting Zheng
View author publications
Search author on:PubMed Google Scholar
Qianqian Wang
View author publications
Search author on:PubMed Google Scholar
Lianyu Lin
View author publications
Search author on:PubMed Google Scholar
Yunkun Lu
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.L., S.F. and L.L. made the study conception. S.F., Z.W., K.W. and S.Z. prepared the fungal samples. Y.L., S.F. and L.L. wrote the manuscript. Y.L., L.L. and Q.W. supervised this project. Y.L. and L.L. made the bioinformatic analysis. S.F., K.L. and Q.W. prepared the experiments and library construction. K.W., S.Z. and Q.W. provided reagents and helped with experiments. Y.L., S.F., Z.W. and L.L. wrote the manuscript with comments and contributions from all authors.

Corresponding authors

Correspondence to Qianqian Wang, Lianyu Lin or Yunkun Lu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Feng, S., Wang, Z., Lin, K. et al. Haplotype-resolved genomes of Trichophyton mentagrophytes and Trichophyton tonsurans. Sci Data 12, 559 (2025). https://doi.org/10.1038/s41597-025-04835-x

Download citation

Received: 22 October 2024
Accepted: 14 March 2025
Published: 10 April 2025
Version of record: 10 April 2025
DOI: https://doi.org/10.1038/s41597-025-04835-x