Abstract
Chloroplast RNA splicing and ribosome maturation (CRM) domain proteins are central to chloroplast intron splicing, rRNA processing, and plant stress responses. While extensively studied in model plants such as Arabidopsis, rice, and maize, the CRM gene family has not been characterized in Pyrus, a major fruit crop of global commercial agricultural productivity . This study presents the first pangenome-wide analysis of CRM genes across 8 Pyrus genomes, aiming to unravel their evolutionary patterns, structural diversity, and potential functional roles in stress adaptation. In this study, we identified 25 PyCRM pan-genes, comprising 9 core, 15 variable, and 1 species-specific members, reflecting gene presence-absence variation within the family. Phylogenetic analysis grouped these proteins into four conserved subfamilies, with the CRS1 subfamily being the largest. Gene structure and motif profiling revealed subfamily-specific conservation alongside distinct motif variations, suggesting functional divergence. Ka/Ks calculation revealed that most of the PyCRM genes evolved under purifying selection, while a few members experienced positive selection, highlighting functional diversification within the CRM gene family. Promoter analysis identified multiple cis-regulatory elements linked to the light response, hormone signaling, stress responses, and developmental processes. Functional enrichment highlighted roles in RNA binding and mRNA processing. Members of the same family share similar motifs and conserved protein 3D structures. Furthermore, expression profiling under cold, drought, and disease revealed that most PyCRM genes were downregulated under stress conditions, while a few members exhibited stress-specific upregulation, suggesting functional diversification within the gene family. These findings lay the foundation for future investigations into CRM-mediated stress tolerance and crop improvement.
Similar content being viewed by others
Data availability
All data generated and analyzed are included in the main text and Supporting Information.
References
Rolland, N. et al. The biosynthetic capacities of the plastids and integration between cytoplasmic and chloroplast processes. Annu. Rev. Genet. 46 (1), 233–264. https://doi.org/10.1146/annurev-genet-110410-132544 (2012).
Ahmed, F., Benedito, V. A. & Zhao, P. X. Mining functional elements in messenger RNAs: Overview, challenges, and perspectives. Front. Plant Sci. 2, 84. https://doi.org/10.3389/fpls.2011.00084 (2011).
Meng, K., Qin, Y., Qu, J., Lee, S. Y., Lu, L., Zou, P., & Huang, G. (2025). Chloroplast genome characteristics and phylogenetic analysis of Mangifera indica L.‘Guiqi’and M. quadrifida Jack (anacardiaceae). Mitochondrial DNA Part B, 10(10), 996-1001.
Wang, X., Wang, J., Li, S., Lu, C. & Sui, N. An overview of RNA splicing and functioning of splicing factors in land plant chloroplasts. RNA Biol. 19 (1), 897–907. https://doi.org/10.1080/15476286.2022.2096801 (2022).
Jacobs, J. & Kück, U. Function of chloroplast RNA-binding proteins. Cell. Mol. Life Sci. 68 (5), 735–748. https://doi.org/10.1007/s00018-010-0523-3 (2011).
Zhang, Q. et al. Characterization of the CRM gene family and elucidating the function of OsCFM2 in rice. Biomolecules 10 (2), 327. https://doi.org/10.3390/biom10020327 (2020).
Asakura, Y. & Barkan, A. A CRM domain protein functions dually in group I and group II intron splicing in land plant chloroplasts. Plant. Cell. 19 (12), 3864–3875. https://doi.org/10.1105/tpc.107.055160 (2007).
Wang, H. C. et al. PPR14 interacts with PPR-SMR1 and CRM protein Zm-mCSF1 to facilitate mitochondrial intron splicing in maize. Front. Plant Sci. 11, 814. https://doi.org/10.3389/fpls.2020.00814 (2020).
Lee, K. & Kang, H. Roles of organellar RNA-binding proteins in plant growth, development, and abiotic stress responses. Int. J. Mol. Sci. 21 (12), 4548. https://doi.org/10.3390/ijms21124548 (2020).
Zhang, Q. et al. OsCRS2 encoding a peptidyl-tRNA hydrolase protein is essential for chloroplast development in rice. Plant. Growth Regul. 92 (3), 535–545. https://doi.org/10.1007/s10725-020-00655-8 (2020).
Lee, K., Park, S. J., Park, Y. I. & Kang, H. CFM9, a mitochondrial CRM protein, is crucial for mitochondrial intron splicing, mitochondria function and Arabidopsis growth and stress responses. Plant Cell Physiol. 60 (11), 2538–2548. https://doi.org/10.1093/pcp/pcz147 (2019).
Sawant, S. S., Choi, E. D., Song, J. & Seo, H. J. Pear production trends and characteristics of important pests in India. 한국국제농업개발학회지 33 (3), 265–269. https://doi.org/10.12719/KSIA.2021.33.3.265 (2021).
Quinet, M. & Wesel, J. P. Botany and taxonomy of pear. In The Pear Genome, 1–33 (Springer, 2019).
Hu, X. et al. Identification of the CDPK Pan-Genomic Family in Pear (Pyrus spp.) and Analysis of Its Response to Venturia nashicola. Horticulturae 11 (10), 1181. https://doi.org/10.3390/horticulturae11101181 (2025).
Chen, S. et al. The pear genomics database (PGDB): a comprehensive multi-omics research platform for Pyrus spp. BMC Plant Biol. 23 (1), 430. https://doi.org/10.1186/s12870-023-04406-5 (2023).
Zhang, H. et al. Building a foundation for gene family analysis in Rosaceae genomes with a novel workflow: a case study in Pyrus architecture genes. Front. Plant Sci. 13, 975942. https://doi.org/10.3389/fpls.2022.975942 (2022).
Linsmith, G. et al. Pseudo-chromosome–length genome assembly of a double haploid Bartlett pear (Pyrus communis L). Gigascience 8 (12), giz138. https://doi.org/10.1093/gigascience/giz138 (2019).
Wu, J. et al. The genome of the pear (Pyrus bretschneideri Rehd). Genome Res. 23 (2), 396–408. https://doi.org/10.1101/gr.144311.112 (2013).
Gao, Y. et al. High-quality genome assembly of ‘Cuiguan’ pear (Pyrus pyrifolia) as a reference genome for identifying regulatory genes and epigenetic modifications responsible for bud dormancy. Hortic. Res. 8 https://doi.org/10.1038/s41438-021-00632-w (2021).
Shirasawa, K., Itai, A. & Isobe, S. Chromosome-scale genome assembly of Japanese pear (Pyrus pyrifolia) variety ‘Nijisseiki’. DNA Res. 28 (2), dsab001. https://doi.org/10.1093/dnares/dsab001 (2021).
Sun, M. et al. Telomere-to-telomere pear (Pyrus pyrifolia) reference genome reveals segmental and whole genome duplication driving genome evolution. Hortic. Res. 10 (11), uhad201. https://doi.org/10.1093/hr/uhad201 (2023).
Dong, X. et al. De novo assembly of a wild pear (Pyrus betuleafolia) genome. Plant Biotechnol. J. 18 (2), 581–595. https://doi.org/10.1111/pbi.13226 (2020).
Ou, C. et al. A de novo genome assembly of the dwarfing pear rootstock Zhongai 1. Sci. data. 6 (1), 281. https://doi.org/10.6084/m9.figshare.10029539 (2019).
Garcia-Hernandez, M. et al. TAIR: A resource for integrated Arabidopsis data. Funct. Integr. Genom. 2 (6), 239–253. https://doi.org/10.1007/s10142-002-0077-z (2002).
Marchler-Bauer, A. et al. CDD: NCBI’s conserved domain database. Nucleic Acids Res. 43 (D1), D222–D226. https://doi.org/10.1093/nar/gku1221 (2015).
Quevillon, E. et al. InterProScan: Protein domains identifier. Nucleic Acids Res. 33 (suppl_2), W116–W120. https://doi.org/10.1093/nar/gki442 (2005).
Gu, Z. & Hübschmann, D. Make interactive complex heatmaps in R. Bioinformatics 38 (5), 1460–1462. https://doi.org/10.1093/bioinformatics/btab806 (2022).
Horton, P. et al. WoLF PSORT: Protein localization predictor. Nucleic Acids Res. 35 (suppl_2), W585–W587. https://doi.org/10.1093/nar/gkm259 (2007).
Tahir ul Qamar, M. et al. Comparative genomics profiling of Citrus species reveals the diversity and disease responsiveness of the GLP pan-genes family. BMC Plant Biol. 25 (1), 388. https://doi.org/10.1186/s12870-025-06397-x (2025).
Wang, Y. et al. Detection of colinear blocks and synteny and evolutionary analyses based on utilization of MCScanX. Nat. Protoc. 19 (7), 2206–2229. https://doi.org/10.1038/s41596-024-00968-2 (2024).
Ranwez, V., Douzery, E. J., Cambon, C., Chantret, N. & Delsuc, F. MACSE v2: Toolkit for the alignment of coding sequences accounting for frameshifts and stop codons. Mol. Biol. Evol. 35 (10), 2582–2584. https://doi.org/10.1093/molbev/msy159 (2018).
Zhang, Z. et al. KaKs_Calculator: calculating Ka and Ks through model selection and model averaging. Genom. Proteom. Bioinform. 4 (4), 259–263. https://doi.org/10.1016/S1672-0229(07)60007-2 (2006).
Bailey, T. L. et al. MEME SUITE: Tools for motif discovery and searching. Nucleic Acids Res. 37 (suppl_2), W202–W208. https://doi.org/10.1093/nar/gkp335 (2009).
Tao, W. et al. Genomic insights into positive selection during barley domestication. BMC Plant Biol. 22 (1), 267. https://doi.org/10.1186/s12870-022-03655-0 (2022).
Lescot, M. et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 30 (1), 325–327. https://doi.org/10.1093/nar/30.1.325 (2002).
Wimalagunasekara, S. S., Weeraman, J. W., Tirimanne, S. & Fernando, P. C. Protein-protein interaction (PPI) network analysis reveals important hub proteins and sub-network modules for root development in rice (Oryza sativa). J. Genetic Eng. Biotechnol. 21 (1), 69. https://doi.org/10.1186/s43141-023-00515-8 (2023).
Doncheva, N. T., Morris, J. H., Gorodkin, J. & Jensen, L. J. Cytoscape StringApp: Network analysis and visualization of proteomics data. J. Proteome Res. 18 (2), 623–632. https://doi.org/10.1021/acs.jproteome.8b00702 (2018).
Törönen, P., Medlar, A. & Holm, L. PANNZER2: A rapid functional annotation web server. Nucleic Acids Res. 46 (W1), W84–W88. https://doi.org/10.1093/nar/gky350 (2018).
Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630 (8016), 493–500. https://doi.org/10.1038/s41586-024-07487-w (2024).
Chen, V. B. et al. MolProbity: All-atom structure validation for macromolecular crystallography. Biol. Crystallogr. 66 (1), 12–21. https://doi.org/10.1107/S0907444909042073 (2010).
Pettersen, E. F. et al. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci. 30 (1), 70–82. https://doi.org/10.1002/pro.3943 (2021).
Maurya, A., Szymanski, M. & Karlowski, W. M. ARA: A flexible pipeline for automated exploration of NCBI SRA datasets. Gigascience 12, giad067. https://doi.org/10.1093/gigascience/giad067 (2023).
Santos, T. P. F. D., Sánchez-Barrantes, E., Pereira, L. F. & Gatica-Arias, A. Transcriptomic data analysis using the galaxy platform: Coffee (Coffea arabica L.) flowers as example. In Plant Functional Genomics: Methods and Protocols, vol. 1, 225–243 (Springer, 2024).
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33 (3), 290–295. https://doi.org/10.1038/nbt.3122 (2015).
Asakura, Y., Bayraktar, O. A. & Barkan, A. Two CRM protein subfamilies cooperate in the splicing of group IIB introns in chloroplasts. RNA 14 (11), 2319–2332. https://doi.org/10.1261/rna.1223708 (2008). http://www.rnajournal.org/cgi/doi/
Feiz, L. et al. CFM1, a member of the CRM-domain protein family, functions in chloroplast group II intron splicing in Setaria viridis. Plant J. 105 (3), 639–648. https://doi.org/10.1111/tpj.15060 (2021).
Barkan, A. et al. The CRM domain: An RNA binding module derived from an ancient ribosome-associated protein. Rna 13 (1), 55–64. https://doi.org/10.1261/rna.139607 (2007).
Lee, K. et al. A nuclear-encoded chloroplast protein harboring a single CRM domain plays an important role in the Arabidopsis growth and stress response. BMC Plant Biol. 14 (1), 98. https://doi.org/10.1186/1471-2229-14-98 (2014).
Stern, D. B., Hanson, M. R. & Barkan, A. Genetics and genomics of chloroplast biogenesis: maize as a model system. Trends Plant Sci. 9 (6), 293–301. https://doi.org/10.1016/j.tplants.2004.04.001 (2004).
Shen, L. et al. OsCAF2 contains two CRM domains and is necessary for chloroplast development in rice. BMC Plant Biol. 20 (1), 381. https://doi.org/10.1186/s12870-020-02593-z (2020).
Wang, M. et al. An exon skipping in CRS1 is associated with perturbed chloroplast development in maize. Int. J. Mol. Sci. 22 (19), 10668. https://doi.org/10.3390/ijms221910668 (2021).
De Longevialle, A. F., Small, I. D. & Lurin, C. Nuclearly encoded splicing factors implicated in RNA splicing in higher plant organelles. Mol. Plant. 3 (4), 691–705. https://doi.org/10.1093/mp/ssq025 (2010).
Yuanwen Teng, Y., Tanabe, K., Tamura, F. & Itai, A. Genetic relationships of pear cultivars in Xinjiang, China, as measured by RAPD markers. J. Hortic. Sci. Biotechnol. 76 (6), 771–779. https://doi.org/10.1080/14620316.2001.11511444 (2001).
Qanmber, G. et al. Genome-wide identification and characterization of the PERK gene family in Gossypium hirsutum reveals gene duplication and functional divergence. Int. J. Mol. Sci. 20 (7), 1750. https://doi.org/10.3390/ijms20071750 (2019).
Todd, A. E., Orengo, C. A. & Thornton, J. M. Evolution of function in protein superfamilies, from a structural perspective. J. Mol. Biol. 307 (4), 1113–1143. https://doi.org/10.1006/jmbi.2001.4513 (2001).
Wittkopp, P. J. & Kalay, G. Cis-regulatory elements: Molecular mechanisms and evolutionary processes underlying divergence. Nat. Rev. Genet. 13 (1), 59–69. https://doi.org/10.1038/nrg3095 (2012).
Wang, L. et al. Identification of the light-harvesting chlorophyll a/b binding protein gene family in peach (Prunus persica L.) and their expression under drought stress. Genes 14 (7), 1475. https://doi.org/10.3390/genes14071475 (2023).
Li, Z. W., You, Z. H., Chen, X., Gui, J. & Nie, R. Highly accurate prediction of protein-protein interactions via incorporating evolutionary information and physicochemical characteristics. Int. J. Mol. Sci. 17 (9), 1396. https://doi.org/10.3390/ijms17091396 (2016).
Guzman, D. & Dhingra, A. Challenges and opportunities in pear breeding. In Achieving sustainable cultivation temperate zone tree fruits berries, 129–156. https://doi.org/10.1007/978-3-031-46150-7_19 (2019).
Zhu, B. & Hu, X. Wang, X.-s., Luo, X., Bioinformatics analysis of CRM gene family in Brassica rapa. Hubei Agric. Sci. 60(9), 141 .
Lee, K. & Kang, H. Emerging roles of RNA-binding proteins in plant growth, development, and stress responses. Mol. Cells. 39 (3), 179–185. https://doi.org/10.14348/molcells.2016.2359 (2016).
Acknowledgements
Authors would like to acknowledge the support provided by the Ongoing Research Funding program, (ORF-2026-358), King Saud University, Riyadh, Saudi Arabia.
Author information
Authors and Affiliations
Contributions
A.K. and M.S. performed the experiments, data analysis and wrote first draft of manuscript. K.F. contributed to methodology and visualization of results . K.P. and J.B. contributed to the validation of results, provided funding for additional analyses during revision, and revised the manuscript. M.T.u.Q. and S.M. conceived and supervised the study, contributed to project administration, and finalized the manuscript. All authors read and approved the final version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Khalil, A., Sadaqat, M., Fatima, K. et al. Pangenome-wide identification of chloroplast RNA splicing and ribosome maturation (CRM) genes in eight Pyrus genomes indicated their involvement in multiple stresses. Sci Rep (2026). https://doi.org/10.1038/s41598-026-48265-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-48265-0


