replying to Y. Wang et al. Communications Biology https://doi.org/10.1038/s42003-025-09324-w (2025)
Rhaphidophoridae, commonly known as cave crickets, is a wingless and ancient lineage of Tettigoniidea. Kim et al.1 (= KIM24) studied the phylogeny and biogeography of Rhaphidophoridae using a multi-gene dataset with a partition model for their analysis. Wang et al. (= WEA) reanalyzed the dataset of KIM24 and raised questions regarding the reproducibility of the results, and showed that their tree topology based on the mixture model was significantly different compared to that of KIM24. However, we identified several issues in their study and demonstrate here that some arguments presented by WEA are based on weak grounds.
WEA, based on the reexamination of the partitioned maximum likelihood (ML) analysis using MrBayes dataset of KIM24 with IQ-TREE, argued that the reproducibility of the dataset from KIM24 is substantially low, as they observed that replicated runs often differed significantly in their tree topology. To check whether the tree topology of the KIM24’s study is reproducible, we generated 30 replicates of ML trees for both IQ-TREE and MrBayes datasets of KIM24 using the same partition and substitution model as in the original study using IQ-TREE v. 2.2.2.62 utilizing partition model3 and branch support evaluated by 2000 replicates of ultrafast bootstrap (UFBoot) trees4,5 and 1000 replicates of SH-aLRT tests6. In contrast to WEA, the subfamily-level topology of all 30 replicates from both datasets was identical, with only minimal intra-subfamily rearrangements (Fig. 1, refer to supplementary files).
The topology is derived from a maximum likelihood tree based on MrBayes dataset (112 taxa with 3 taxa for outgroup) from KIM24 under partitioning. The node support values of the ML tree for the IQ-TREE dataset and the MrBayes dataset are each represented by colors in squares. A. Anoplophilinae, Aemo. aemodogryllinae, Ceu. ceuthophilinae, G. gammarotettiginae, Macro. macropathinae, Rh. rhaphidophorinae, T. tropidischinae.
As in WEA, we observed low branch support values in some parts of the tree obtained from the MrBayes dataset (Fig. 1). However, in the IQ-TREE dataset, UFBoot support values for the major clade groupings were high (≥95 in case of the tree with the best log-likelihood), except for the (Dolichopodainae+Troglophilinae) and (Anoplophilinae+Gammarotettiginae) pairs, exactly matching the bootstrap support patterns of KIM24. Here, the lower support value for the first grouping is likely caused by the dense sampling of closely related species7,8, whereas the lower support for the second grouping deserves more attention, as it is likely driven by the low data coverage of Gammarotettix genitalis. However, despite the high proportion of missing data, the phylogenetic placements of Gammarotettix were highly consistent across analyses with moderate support. The major difference between the MrBayes and IQ-TREE datasets of KIM24 is the outgroup sampling scheme: the MrBayes dataset included only a single outgroup species, whereas the IQ-TREE dataset included three outgroup species that formed a sister clade to Rhaphidophoridae. However, as shown from a previous phylogenomic study9, the divergence time between these two clades is relatively old. In fact, when only a single distant outgroup is used, as in the current case, a long stem branch arising from an outgroup can potentially result in a random rooting of the phylogeny, yielding lower branch supports at ingroup nodes10,11,12. Such a bias can be mitigated by using multiple outgroup species that could break distant relationships13. This explains why the branch supports obtained by WEA and the current analysis based on the MrBayes dataset are considerably lower than the results from the IQ-TREE dataset.
WEA also suggested a different subfamily-level topology based on the “better fitting” CAT-GTR mixture model14. It is widely known that evolutionary history between genes and sites can be heterogeneous, and failing to account for this can result in a biased estimation of phylogenetic parameters, including the tree topology15,16,17,18. Both partition models and mixture models have been shown to account for the gene-wise heterogeneity relatively well across a wide range of datasets19,20. However, both models have their pros and cons19,21, and thus claims about the superiority of one model over the other must rely on sound statistical justification.
WEA justified the superiority of their CAT-GTR tree topology based on relative model fit tests based on the Bayesian leave-one-out cross-validation (LOO-CV) and the widely applicable information criterion (wAIC)22,23, and also with an absolute model fit test based on the posterior predictive checking24,25 of CAT-GTR and GTR substitution models. The result of WEA shows that the model fit, based on all the above criteria, is much superior for the CAT-GTR model. However, their series of model testing has limited statistical value as one of the hypotheses considers a heterogeneous substitution process (CAT-GTR) while the other (GTR) does not. Therefore, the results merely show the presence of the heterogeneous substitution process within the dataset, an issue which was also pointed out by Boudinot et al.21. To claim the superiority of the CAT-GTR model over the partition model, which was used in KIM24’s analyses, one must directly compare the relative fit of the mixture model over the partition model. However, a comparison of the relative fit of the mixture versus partition models based on traditional model selection approaches can be problematic, as they are biased to favor partition models over mixture models26,27. Despite the availability of some potentially useful model selection criteria28,29, there seems to be no readily applicable methodology that can accurately compare the relative fit of the mixture models and partition models27. Thus, it may be premature to make strong claims about the superiority of either model until further theoretical and methodological advances that allow for a more thorough comparison of model fit between partition and mixture models. We also note that complex infinite mixture models, such as CAT-GTR, might not be the best way to model substitution heterogeneity of datasets like ours, as their computational complexity can far exceed the marginal benefit unless the divergence is exceedingly old, or the evolutionary process was extremely heterogeneous30.
Another thing to note is that the CAT-GTR model has been shown to be susceptible to missing data, especially when they are concentrated in a small number of taxa30,31,32, where, on the other hand, partition models remain relatively robust to incomplete taxa33,34. In this regard, the first-diverging position of Gammarotettiginae in the CAT-GTR tree of WEA must be critically viewed, as Gammarotettix genitalis was presented with a relatively high proportion of missing data. WEA also obtained the first-diverging position of Gammarotettiginae in their unpartitioned GTR tree, but this cannot be used to justify the obtained topology as failing to account for substitution heterogeneity is known to severely affect topology inference17. Collectively, WEA’s tree topology based on CAT-GTR cannot be a superior hypothesis to KIM24’s topology. As can be seen from the past studies and also from our concerns regarding Gammarotettiginae, the CAT-GTR model may not perform well under certain conditions30,31,32. Therefore, we argue that CAT-GTR cannot always be a “magic bullet” that can solve all phylogenetic problems.
It is true that the dataset used in KIM24 is far from perfect. Indeed, the positions of several subfamilies remain uncertain, and genomic-scale data would be needed to resolve a more robust phylogeny of Rhaphidophoridae. Thus, we are currently conducting a follow-up study using a morphology-based cladistics approach, as well as a phylogenomic study based on genomic data generated using the Orthoptera-specific target enrichment (OR-TE) probes35, to reconstruct a robust genome-scale phylogenetic tree. Through this genome-scale analysis, we believe that we will be able to obtain a more robust phylogeny of Rhaphidophoridae to elucidate their enigmatic evolutionary history.
Data availability
The data underlying our analyses are described and accessible from Kim et al.1.
References
Kim, D. Y., Kim, S., Song, H. & Shin, S. Phylogeny and biogeography of the wingless orthopteran family Rhaphidophoridae. Commun. Biol. 7, 401 (2024).
Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Chernomor, O., Von Haeseler, A. & Minh, B. Q. Terrace aware data structure for phylogenomic inference from supermatrices. Syst. Biol. 65, 997–1008 (2016).
Minh, B. Q., Nguyen, M. A. T. & Von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013).
Hoang, D. T., Chernomor, O., Von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Soltis, P. S. & Soltis, D. E. Applying the bootstrap in phylogeny reconstruction. Stat. Sci. 18, 256–267 (2003).
Lemoine, F. et al. Renewing Felsenstein’s phylogenetic bootstrap in the era of big data. Nature 556, 452–456 (2018).
Song, H. et al. Phylogenomic analysis sheds light on the evolutionary pathways towards acoustic communication in Orthoptera. Nat. Commun. 11, 4939 (2020).
Wheeler, W. C. Nucleic acid sequence phylogeny and random outgroups. Cladistics 6, 363–367 (1990).
Gatesy, J., DeSalle, R. & Wahlberg, N. How many genes should a systematist sample? Conflicting insights from a phylogenomic matrix characterized by replicated incongruence. Syst. Biol. 56, 355–363 (2007).
Hirase, S., Takeshima, H., Nishida, M. & Iwasaki, W. Parallel mitogenome sequencing alleviates random rooting effect in phylogeography. Genome Biol. Evol. 8, 1267–1278 (2016).
Bergsten, J. A review of long-branch attraction. Cladistics 21, 163–193 (2005).
Lartillot, N. & Philippe, H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21, 1095–1109 (2004).
Buckley, T. R., Simon, C. & Chambers, G. K. Exploring among-site rate variation models in a maximum likelihood framework using empirical data: effects of model assumptions on estimates of topology, branch lengths, and bootstrap support. Syst. Biol. 50, 67–86 (2001).
Pupko, T., Huchon, D., Cao, Y., Okada, N. & Hasegawa, M. Combining multiple data sets in a likelihood analysis: which models are the best?. Mol. Biol. Evol. 19, 2294–2307 (2002).
Kolaczkowski, B. & Thornton, J. W. Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous. Nature 431, 980–984 (2004).
Štefankovič, D. & Vigoda, E. Pitfalls of heterogeneous processes for phylogenetic reconstruction. Syst. Biol. 56, 113–124 (2007).
Kainer, D. & Lanfear, R. The effects of partitioning on phylogenetic inference. Mol. Biol. Evol. 32, 1611–1627 (2015).
Bujaki, T., Van Looyen, K. & Rodrigue, N. Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches. Bioinf. Adv. 3, vbad091 (2023).
Boudinot, B. E. et al. Systematic bias and the phylogeny of Coleoptera—A response to Cai et al. (2022) following the responses to Cai et al. (2020). Syst. Entomol. 48, 223–232 (2023).
Watanabe, S. Algebraic Geometry and Statistical Learning Theory. Cambridge University Press, Cambridge (2009).
Lartillot, N. Identifying the best approximating model in Bayesian phylogenetics: Bayes factors, cross-validation or wAIC?. Syst. Biol. 72, 616–638 (2023).
Meng, X. L. Posterior predictive p-values. Ann. Stat. 22, 1142–1160 (1994).
Bollback, J. P. Bayesian model adequacy and choice in phylogenetics. Mol. Biol. Evol. 19, 1171–1180 (2002).
Crotty, S. M. & Holland, B. R. Comparing partitioned models to mixture models: Do information criteria apply?. Syst. Biol. 71, 1541–1548 (2022).
Liu, Q., Charleston, M. A., Richards, S. A. & Holland, B. R. Performance of Akaike information criterion and Bayesian information criterion in selecting partition models and mixture models. Syst. Biol. 72, 92–105 (2023).
Jhwueng, D., Huzurbazar, S., O’Meara, B. C. & Liu, L. Investigating the performance of AIC in selecting phylogenetic models. Stat. Appl. Genet. Mol. Biol. 13, 459–475 (2014).
Susko, E. & Roger, A. J. On the use of information criteria for model selection in phylogenetics. Mol. Biol. Evol. 37, 549–562 (2020).
Whelan, N. V. & Halanych, K. M. Who let the CAT out of the bag? Accurately dealing with substitutional heterogeneity in phylogenomic analyses. Syst. Biol. 66, 232–255 (2017).
Li, Y. et al. Phylogenomics of tubeworms (Siboglinidae, Annelida) and comparative performance of different reconstruction methods. Zool. Scr. 46, 200–213 (2017).
Al Jewari, C. & Baldauf, S. L. Conflict over the eukaryote root resides in strong outliers, mosaics and missing data sensitivity of site-specific (CAT) mixture models. Syst. Biol. 72, 1–16 (2023).
Wiens, J. J. Missing data, incomplete taxa, and phylogenetic accuracy. Syst. Biol. 52, 528–538 (2003).
Wiens, J. J. & Tiu, J. Highly incomplete taxa can rescue phylogenetic analyses from the negative impacts of limited taxon sampling. PLoS One 7, e42925 (2012).
Shin, S. et al. Orthoptera-specific target enrichment (OR-TE) probes resolve relationships over broad phylogenetic scales. Sci. Rep. 14, 21377 (2024).
Author information
Authors and Affiliations
Contributions
D.K. and W.L.: study design, data analysis, manuscript writing, visualization. S.K.: study design, manuscript writing. S.S. and H.S.: manuscript writing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Biology thanks Perry G. Beasley-Hall and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Christina Karlsson Rosenthal.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Kim, DY., Kim, S., Lim, W. et al. Reply to: Uncertainties in the phylogeny and biogeography of cave crickets. Commun Biol 8, 1797 (2025). https://doi.org/10.1038/s42003-025-09325-9
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s42003-025-09325-9
