Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Investigating the replicability of the social and behavioural sciences

Abstract

Pursuing replicability — independent evidence for previous claims — is important for creating generalizable knowledge1,2. Here we attempted replications of 274 claims of positive results from 164 quantitative papers published from 2009 to 2018 in 54 journals in the social and behavioural sciences. Replications were high powered on average to detect the original effect size (median of 99.6%), used original materials when relevant and available, and were peer reviewed in advance through a standardized internal protocol. Replications showed statistically significant results in the original pattern for 151 of 274 claims (55.1% (95% confidence interval (CI) 49.2–60.9%)) and for 80.8 of 164 papers (49.3% (95% CI 43.8–54.7%)), weighed for replicating multiple claims per paper. We observed modest variation in replication rates across disciplines (42.5–63.1%), although some estimates had high uncertainty. The median Pearson’s r effect size was 0.25 (95% CI 0.21–0.27) for original studies and 0.10 (95% CI 0.09–0.13) for replication studies, an 82.4% (95% CI 67.8–88.2%) reduction in shared variance. Thirteen methods for evaluating replication success provided estimates ranging from 28.6% to 74.8% (median of 49.3%). Some decline in effect size and significance is expected based on power to detect original effects and regression to the mean because we replicated only positive results. We observe that challenges for replicability extend across social–behavioural sciences, illustrating the importance of identifying conditions that promote or inhibit replicability3,4.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Replication success rates across 13 binary assessments for papers.
The alternative text for this image may have been generated using AI.
Fig. 2: Correlation matrix among binary assessments of replication success across papers.
The alternative text for this image may have been generated using AI.
Fig. 3: Scatterplot of Pearson’s r effect sizes for original and replication studies.
The alternative text for this image may have been generated using AI.
Fig. 4: Scatterplot of Pearson’s r effect sizes for original and replication outcomes for new data and secondary data replication attempts.
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data availability

Data, materials and code associated with this research that can be shared without restriction are publicly available in a living OSF repository (https://doi.org/10.17605/OSF.IO/G5SNY)48. The living OSF repository represents improvements, fixes and additions that occur post-publication. Readers can also access a registered, archived version of this repository that is precisely the data, code and documentation as they existed upon publication of this paper (https://doi.org/10.17605/OSF.IO/BZFGY). The repository includes all available documentation for replication attempts regardless of whether they were completed. This includes most of the data and code from the individual replication attempts, save for any data that is proprietary or protected that will not be made available, or for which analyst teams were uncertain or unable to confirm that they were allowed to share secondary data. It is possible that some data, materials or code that could be shared openly is not available at the time of publication. Readers are encouraged to contact the corresponding author or the authors of the relevant sub-project (Supplementary Table 3) to see if more research content can be shared in the living repository. This paper is part of a collection of papers reporting on the SCORE program. Documentation, data and code for the entire program are available at https://doi.org/10.17605/OSF.IO/DTZX4.

Code availability

Code for individual replication projects is available alongside data and materials for each project in the OSF repository (https://doi.org/10.17605/OSF.IO/G5SNY). This includes a push button package with all code and data used to produce all statistics, figures and tables, and code that populates them directly into the manuscript from a template. Also available is a registered, archived version of the repository containing precisely the data, code and documentation used to generate the outcomes reported in this paper (https://doi.org/10.17605/OSF.IO/BZFGY).

References

  1. Nosek, B. A. & Errington, T. M. What is replication? PLoS Biol. 18, e3000691 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Reproducibility and Replicability in Science (National Academies of Sciences, Engineering and Medicine, 2019).

  3. Nosek, B. A. et al. Replicability, robustness, and reproducibility in psychological science. Annu. Rev. Psychol. 73, 719–748 (2022).

  4. Munafò, M. R. et al. A manifesto for reproducible science. Nat. Hum. Behav. 1, 0021 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  5. Open Science Collaboration. Estimating the reproducibility of psychological science. Science 349, aac4716 (2015).

    Article  Google Scholar 

  6. Klein, R. A. et al. Many Labs 2: investigating variation in replicability across samples and settings. Adv. Methods Pract. Psychol. Sci. 1, 443–490 (2018).

    Article  Google Scholar 

  7. Klein, R. A. et al. Investigating variation in replicability: a “many labs” replication project. Soc. Psychol. 45, 142–152 (2014).

    Article  Google Scholar 

  8. Ebersole, C. R. et al. Many Labs 5: testing pre-data collection peer review as an intervention to increase replicability. Preprint at OSF https://doi.org/10.31234/osf.io/sxfm2 (2019).

  9. Ebersole, C. R. et al. Many Labs 3: evaluating participant pool quality across the academic semester via replication. J. Exp. Soc. Psychol. 67, 68–82 (2016).

    Article  Google Scholar 

  10. Camerer, C. F. et al. Evaluating replicability of laboratory experiments in economics. Science 351, 1433–1436 (2016).

    Article  ADS  CAS  PubMed  Google Scholar 

  11. Camerer, C. F. et al. Evaluating the replicability of social science experiments in Nature and Science between 2010 and 2015. Nat. Hum. Behav. 2, 637–644 (2018).

    Article  PubMed  Google Scholar 

  12. Cova, F. et al. Estimating the reproducibility of experimental philosophy. Rev. Philos. Psychol. 12, 9–44 (2018).

  13. Errington, T. M. et al. Investigating the replicability of preclinical cancer biology. eLife 10, e71601 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Ioannidis, J. P. A. Why most published research findings are false. PLoS Med. 2, e124 (2005).

    Article  PubMed  PubMed Central  Google Scholar 

  15. Button, K. S. et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 14, 365–376 (2013).

    Article  CAS  PubMed  Google Scholar 

  16. Wagenmakers, E.-J., Wetzels, R., Borsboom, D., van der Maas, H. L. & Kievit, R. A. An agenda for purely confirmatory research. Perspect. Psychol. Sci. 7, 632–638 (2012).

    Article  PubMed  Google Scholar 

  17. Simmons, J. P., Nelson, L. D. & Simonsohn, U. False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychol. Sci. 22, 1359–1366 (2011).

    Article  PubMed  Google Scholar 

  18. Greenwald, A. G. Consequences of prejudice against the null hypothesis. Psychol. Bull. 82, 1–20 (1975).

    Article  Google Scholar 

  19. Rosenthal, R. The file drawer problem and tolerance for null results. Psychol. Bull. 86, 638–641 (1979).

    Article  Google Scholar 

  20. Nosek, B. A., Ebersole, C. R., DeHaven, A. C. & Mellor, D. T. The preregistration revolution. Proc. Natl Acad. Sci. USA 115, 2600–2606 (2018).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  21. Giner-Sorolla, R. Science or art? How aesthetic standards grease the way through the publication bottleneck but undermine science. Perspect. Psychol. Sci. 7, 562–571 (2012).

    Article  PubMed  Google Scholar 

  22. Nosek, B. A., Spies, J. R. & Motyl, M. Scientific utopia: II. Restructuring incentives and practices to promote truth over publishability. Perspect. Psychol. Sci. 7, 615–631 (2012).

    Article  PubMed  PubMed Central  Google Scholar 

  23. Nosek, B. A. et al. Promoting an open research culture. Science 348, 1422–1425 (2015).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  24. Freese, J. & Peterson, D. Replication in social science. Annu. Rev. Sociol 43, 147–165 (2017).

    Article  Google Scholar 

  25. Andrews, I. & Kasy, M. Identification of and correction for publication bias. Am. Econ. Rev. 109, 2766–2794 (2019).

    Article  Google Scholar 

  26. Patil, P., Peng, R. D. & Leek, J. T. What should researchers expect when they replicate studies? A statistical view of replicability in psychological science. Perspect. Psychol. Sci. 11, 539–544 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  27. Valentine, J. C. et al. Replication in prevention science. Prev. Sci. 12, 103–117 (2011).

    Article  PubMed  Google Scholar 

  28. Abatayo, A. L. et al. Credibility assessments in the social and behavioral sciences. Preprint at MetaArXiv https://doi.org/10.31222/osf.io/7u58q_v1 (2025).

  29. Fanelli, D. “Positive” results increase down the hierarchy of the sciences. PLoS ONE 5, e10068 (2010).

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  30. Fanelli, D. Negative results are disappearing from most disciplines and countries. Scientometrics 90, 891–904 (2012).

    Article  Google Scholar 

  31. Heyard, R. et al. A scoping review on metrics to quantify reproducibility: a multitude of questions leads to a multitude of metrics. R. Soc. Open Sci. 12, 242076 (2025).

  32. Muradchanian, J., Hoekstra, R., Kiers, H. & van Ravenzwaaij, D. How best to quantify replication success? A simulation study on the comparison of replication success metrics. R. Soc. Open Sci. 8, 201697 (2021).

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  33. Peels, R. Replicability and replication in the humanities. Res. Integr. Peer Rev. 4, 2 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  34. Peels, R. & Bouter, L. The possibility and desirability of replication in the humanities. Palgrave Commun. 4, 95 (2018).

    Article  Google Scholar 

  35. TalkadSukumar, P. & Metoyer, R. Replication and transparency of qualitative research from a constructivist perspective. Preprint at OSF https://doi.org/10.31219/osf.io/6efvp (2019).

  36. Hubbard, D. W. & Carriquiry, A. L. Quality control for scientific research: addressing reproducibility, responsiveness, and relevance. Am. Stat. 73, 46–55 (2019).

    Article  MathSciNet  Google Scholar 

  37. Alipourfard, N. et al. Systematizing Confidence in Open Research and Evidence (SCORE). Preprint at SocArXiv https://doi.org/10.31235/osf.io/46mnb (2021).

  38. Miske, O. et al. Investigating the reproducibility of the social and behavioral sciences. Nature https://doi.org/10.1038/s41586-026-10203-5 (2025).

  39. Ebersole, C. R. et al. Many Labs 5: testing pre-data-collection peer review as an intervention to increase replicability. Adv. Methods Pract. Psychol. Sci. 3, 309–331 (2020).

    Article  Google Scholar 

  40. Held, L., Pawel, S. & Micheloud, C. The assessment of replicability using the sum of p-values. R. Soc. Open Sci. 11, 240149 (2024).

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  41. Micheloud, C., Balabdaoui, F. & Held, L. Assessing replicability with the sceptical p-value: type-I error control and sample size planning. Stat. Neerlandica https://doi.org/10.1111/stan.12312 (2023).

  42. Gronau, Q. F., Heck, D. W., Berkhout, S. W., Haaf, J. M. & Wagenmakers, E.-J. A primer on Bayesian model-averaged meta-analysis. Adv. Methods Pract. Psychol. Sci. https://doi.org/10.1177/25152459211031256 (2021).

    Article  Google Scholar 

  43. Simonsohn, U. Small telescopes: detectability and the evaluation of replication results. Psychol. Sci. 26, 559–569 (2015).

    Article  PubMed  Google Scholar 

  44. Verhagen, J. & Wagenmakers, E.-J. Bayesian tests to quantify the result of a replication attempt. J. Exp. Psychol. Gen. 143, 1457–1475 (2014).

    Article  PubMed  Google Scholar 

  45. Ly, A., Etz, A., Marsman, M. & Wagenmakers, E.-J. Replication Bayes factors from evidence updating. Behav. Res. Methods 51, 2498–2508 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  46. Steiner, P. M., Sheehan, P. & Wong, V. C. Correspondence measures for assessing replication success. Psychol. Methods https://doi.org/10.1037/met0000597 (2023).

  47. Ben-Shachar, M., Lüdecke, D. & Makowski, D. effectsize: Estimation of effect size indices and standardized parameters. J. Open Source Softw. 5, 2815 (2020).

    Article  ADS  Google Scholar 

  48. Haber N. et al. Materials for “Investigating the replicability of the social and behavioral sciences”. OSF https://doi.org/10.17605/OSF.IO/G5SNY (2025).

Download references

Acknowledgements

This work was supported by the DARPA under cooperative agreements no. N660011924015 (to principal investigator B.A.N.) and HR00112020015 (to principal investigator T.M.E.). The views, opinions, findings and conclusions or recommendations expressed in this material are those of the authors and should not be interpreted as representing the official views or policies of the Department of Defense or the US Government. We thank B. Arendt, A. Denis, M. Dirzo, Z. Loomas, B. Luis, L. Markham, E. S. Parsons and A. Russell for their contributions to this project.

Author information

Authors and Affiliations

Authors

Contributions

A.H.T., S.F., M.K.S., F.A., W.J.C., N. Fiala, N. Fox, A.L.J.F., K.E.L., S.L., F.N., R.R., J. Samaha, S. Sankaran, A.N.S., D.T., D.W., B.A.N. and T.M.E. conceptualized the study. A.H.T., N.A.H. and J. MacDonald curated the data. A.H.T., N.A.H., M.K.S., C.L.A., M.A., N.A., P.J.A., M. Andreychik, E. Axxe, F.A., B. Bago, J. Bailey, M. Bakker, G.C.B., E.B., A. Batruch, A. Beatteay, Z.B., M. Bognar, C. Bokhove, R. Bouwman, T.F.B., G.B.J., T.B., R. Briker, G.D.A.B., R.C.M.v.A., K.C., T. Capitán, J. Chandler, T. Charles, C.R.C., K.J.C., W.J.C., V.E.C., G.C., T. Coupé, J. Cummins, A.C.-K., J.d.L., D.D., J.N.D., J.D., M. Dujmović, P.K.D., C.E., T.R.E., A.F., B.F.-C., N. Fiala, J.G.F., L.M.G., I.H.G., J. Gooty, S.M.G., N.G., B.H., P.H.P.H., E.E.H., S. Harper, M.J.H., N.C.H., S. Hippel, S. Hoeppner, S. Hong, T.J.H., B. Jaeger, K.J., J.J.-N., A.M.J., M. Juanchich, P.K., H.K., S.K., M.K., D.K., M.L., L.L., A.C.E.L., J.M.L.F., K.E.L., J.L., S.C.L., C.L., A.-C.L., J. MacDonald, P.M., D.J.M., A. McBride, C. McHugh, G. McMillan, E.M., M.P.M., L. Micheli, M. Milyavskaya, D.C.M., A.G.M., A. Morrow, C. Moya, L.B.M., K.A.M., A.N., H.H.N., F.N., P.L.L.N., J.Oh, A. Oshana, A. Oyebanjo, R. Panczak, J.P., Y.G.P., S.P., M.P., K.P., C.P., V.P., N.D.P., D.P., M.F.P., A.P., M. Ramljak, W.R.R., S.P.R., J.P.R., I.R., J. Rothschild, J. Samaha, S. Sankaran, D.S., A.C.S., K.S., A.N.S., B.A. Schuetze, M.S.-M., E.L.S., S. Shaki, S. Shakya, M. Sirota, M.R.S., M.M.S., L.R.S., L. Smalarz, J.S.S., N.S., F.S., C.N.H.S., M. Syed, A. Szabelska, E. Szumowska, A.T., S. Täuber, L. Tay, S. Thapa, L. Tummers, E.T., M.v.A., N.v.d.V., E.J.V., L.A.V., D.T.H.V., V.V., D.W., L. Walasek, N.W., A.L.W., J.W., J.R.W., K.W., R.W., J.N.W., V.X.Y., S.Y., I. Žeželj and Y.Z. conducted formal analysis. A. Abbasi, F.A., G.B.J., D.D., A.L.J.F., J. Gooty, J.P.R., J. Samaha, A.C.S., A.N.S., B.A.N. and T.M.E. acquired funding. A.H.T., A.L.A., M. Daley, S.F., N. Fox, K.M.H., M.K.S., B.M., O.M., C.K.S., A.A., B.A., M.A., N.A., P.J.A., M. Andreychik, E. Axxe, F.A., M.D.B., B. Bago, J. Bailey, G.B., G.C.B., E.B., A. Batruch, A. Beatteay, S.M.B., N.B., Z.B., J.B., B. Bodroža, M. Bognar, C. Bokhove, D.B., R. Bouwman, T.F.B., S.R.B., G.B.J., T.B., R. Briker, G.D.A.B., K.C., S.C., T. Capitán, J. Chandler, T. Charles, C.R.C., K.J.C., W.J.C., V.E.C., C.C.C., G.C., T. Coupé, J. Cummins, A.C.-K., J.d.L., D.D., J.N.D., J.D., M. Dujmović, C.E., T.R.E., N. Fiala, J.G.F., N. Fong, M.A.F., A.L.J.F., J.F., J. Geng, L.M.G., I.H.G., D.P.G., J. Gooty, C.G., S.M.G., L.G., M.G., B.H., P.H.P.H., E.E.H., S. Harper, M.J.H., L.H., N.C.H., S. Hippel, S. Hoeppner, S. Hong, T.J.H., B. Jaeger, K.J., J.J.-N., M. Jensen, B. Jokić, D.J., A.M.J., M. Juanchich, A. Keljanovic, S.K., M.K., U.K., M.L., M.J.L., L.L., A.C.E.L., J.M.L.F., K.E.L., S.L., J.L., S.C.L., C.L., H.L., A.-C.L., E.A.L., J. MacDonald, P.M., D.J.M., A. McBride, C. McHugh, G. McMillan, M. Metzger, M.P.M., J. Michalak, L. Micheli, M. Milyavskaya, D.C.M., A.G.M., D. Moreau, A. Morrow, L.B.M., K.A.M., K.N., C.N., G. Nave, H.H.N., F.N., G. Nilsonne, E.O., J. Oettinghaus, J. Oh, A. Oshana, T.O., R.P.O., J.P., I.P., Y.G.P., S.P., M.P., K.P., V.P., N.D.P., D.P., M.F.P., A.P., M. Ramljak, W.R.R., M. Ritchie, S.P.R., J.P.R., I.R., J. Rothschild, J. Saal, H.S., J. Samaha, M. Sanchez, S. Sankaran, A.C.S., K.S., L. Schnabel, A.N.S., S.W.S., B.A. Schuetze, A. Schütz, E.L.S., E. Shackleton, R.M.S., S. Shaki, S. Shakya, M. Sirota, L.R.S., L. Smalarz, C.T.S., J.S.S., N.S., F.S., G.S., M. Syed, A. Szabelska, B.S., E. Szumowska, A.T., S. Täuber, L. Tay, J.T., L. Tummers, E.T., M.V.T., K.U., N.v.d.V., R.v.d.G., E.J.V., L.A.V., J.M.V., D.T.H.V., V.V., D.W., L. Walasek, F. Walter, L. Warmelink, L. Wei, M.I.W., N.W., A.L.W., J.W., J.R.W., K.W., V.X.Y., Y.Y., S.Y., I. Žeželj, Y.Z. and T.M.E. performed the investigation. A.H.T., A.L.A., S.F., N. Fox, N.A.H., M.K.S., B.M., O.M., P.S., C.K.S., M.A., N.A., P.J.A., M. Andreychik, E. Axxe, F.A., M.D.B., B. Bago, J. Bailey, G.C.B., E.B., A. Batruch, A. Beatteay, S.M.B., Z.B., B. Bodroža, M. Bognar, C. Bokhove, R. Bouwman, T.F.B., G.B.J., T.B., R. Briker, G.D.A.B., K.C., S.C., T. Capitán, J. Chandler, T. Charles, C.R.C., W.J.C., G.C., T. Coupé, J. Cummins, A.C.-K., J.d.L., P.K.D., C.E., T.R.E., J.G.F., J. Geng, L.M.G., I.H.G., J. Gooty, C.G., S.M.G., L.G., B.H., P.H.P.H., S. Harper, M.J.H., L.H., N.C.H., T.J.H. K.J., J.J.-N., B. Jokić, D.J., P.J., A.M.J., M. Juanchich, S.K., L.L., J.M.L.F., K.E.L., S.C.L., C.L., H.L., A.-C.L., E.A.L., J. MacDonald, D.J.M., C. McHugh, G. McMillan, M.P.M., L. Micheli, M. Milyavskaya, D.C.M., C. Moya, L.B.M., A.N., A.L.N., J.Oh, R.P.O., J.P., I.P., Y.G.P., S.P., M.P., K.P., V.P., N.D.P., D.P., M. Ramljak, M. Ritzau, S.P.R., R.R., J.P.R., I.R., J. Saal, J. Samaha, M. Sanchez, S. Sankaran, K.S., L. Schnabel, A.N.S., B.A. Schuetze, M.S.-M., E.L.S., R.M.S., S. Shaki, S. Shakya, M. Sirota, L.R.S., L. Smalarz, J.S.S., N.S., F.S., M. Syed, A. Szabelska, E. Szumowska, A.T., S. Täuber, L. Tay, D.T., K.U., M.v.A., N.v.d.V., E.J.V., L.A.V., S.V., D.T.H.V., D.W., L. Walasek, A.L.W., J.W., J.R.W., K.W., F. Wort, V.X.Y., S.Y., I. Žeželj, I. Ziano, B.A.N. and T.M.E. formulated the methodology. A.H.T., O.M., A.C.E.L., B.A.N. and T.M.E. provided project admin. A.H.T., N.A.H., A.L.A., M.K.S., T. Stankov, N.A., M. Andreychik, E. Axxe, F.A., M.D.B., B. Bago, J. Bailey, E.B., A. Batruch, A. Beatteay, S.M.B., Z.B., M. Bognar, C. Bokhove, R. Bouwman, T.F.B., R. Briker, R.C.M.v.A., K.C., T. Capitán, J. Chandler, T. Charles, C.R.C., K.J.C., V.E.C., G.C., T. Coupé, J. Cummins, J.d.L., P.K.D., T.R.E., B.F.-C., J.G.F., J.Geng, I.H.G., S.M.G., P.H.P.H., S. Harper, M.J.H., N.C.H., S. Hippel, S. Hoeppner, T.J.H., K.J., J.J.-N., D.J., A.M.J., M. Juanchich, P.K., S.K., L.L., S.C.L., C.L., H.L., A.-C.L., J. MacDonald, D.J.M., A. McBride, C. McHugh, L. Micheli, D.C.M., A. Morrow, C. Moya, A. Oshana, R.P.O., J.P., Y.G.P., K.P., C.P., V.P., N.D.P., M.F.P., A.P., M. Ramljak, S.P.R., J.P.R., I.R., J. Samaha, M. Sanchez, S. Sankaran, D.S., K.S., L. Schnabel, B.A. Schuetze, E.L.S., S. Shaki, S. Shakya, M. Sirota, M.R.S., L.R.S., J.S.S., N.S., F.S., C.N.H.S., M. Syed, A. Szabelska, E. Szumowska, A.T., L.Tay, E.J.V., L.A.V., D.T.H.V., V.V., D.W., L. Walasek, M.I.W., A.L.W., J.W., J.N.W., V.X.Y., S.Y. and Y.Z. provided software. A.H.T., K.M.H., M.K.S., O.M., C.K.S., F.A., G.C.B., E.B., A.N.B., R.C.M.v.A., T. Capitán, C.R.C., W.J.C., K.M.E., J. Gooty, A.G.-K., L.G., E.E.H., M.I., K.E.L., J.W.L., M. Milyavskaya, A. Morrow, G. Nilsonne, V.P., N.D.P., K.A.Q., W.R.R., R.R., J.P.R., J. Samaha, A.N.S., A. Schütz, E.L.S., S. Shaki, C.T.S., J.S.S., B.A. Spellman, D.T., E.-J.W., A.L.W., C.Z., B.A.N. and T.M.E. provided supervision. A.H.T., A.L.A., N.A.H., O.M., P.S., N.A., E. Awtrey, F.A., J. Bailey, G.C.B., E.B., J.B., L.B., S.R.B., G.B.J., C. Brick, A.N.B., T. Capitán, R.C., W.J.C., J. Cummins, M. Dujmović, D.J.D., K.M.E., B.F.-C., N. Fiala, S.J.G., L.M. Geven, I.H.G., A.G.-K., K.I., B. Jokić, P.J., H.K., K.E.L., C.L., J.W.L., J. MacDonald, D.J.M., D. Marinazzo, C.S.M., J. Matacotta, J.K.M., D. Moreau, A. Morrow, L. Mudrik, H.H.N., A.L.N., G. Nilsonne, J.Oh, A. Oshana, N.D.P., J.M.P., M.F.P., K.A.Q., J.P.R., H.S., S. Sankaran, D.S., M. Sauter, K.S., L. Schnabel, A.N.S., E.L.S., R.M.S., S. Shaki, S. Shakya, L. Smalarz, B.A. Spellman, N.S.-B., E. Strømland, T. Sundelin, A. Szabelska, L. Tummers, K.U., A.E.v.‘t.V., N.v.d.V., E.J.V., V.V., E.-J.W., D.W., A.L.W., J.W., X.X., I. Ziano, C.Z., Z.Z., R.A.Z. and T.M.E. performed validation. A.H.T., N.A.H., B.M., F.A., G.B.J., M.Dujmović, N.C.H., J.MacDonald, J. Matacotta, A. Morrow, A. Oshana, M. Ramljak, J. Samaha, S. Sankaran, S. Shaki, D.W., A.L.W. and B.A.N. conducted visualization. A.H.T., B.M., O.M., F.A., G.B.J., W.J.C., B.C., C.C.C., A.F., B.F.-C., N. Fiala, S.K., K.E.L., S.L., J. MacDonald, J. Matacotta, A. Morrow, C. Moya, H.H.N., F.N., J. Oh, A. Oshana, M.F.P., M. Ritchie, R.R., D.S., A.C.S., A.N.S., S. Shaki, S. Shakya, N.S., S. Täuber, E.J.V., V.V., D.W., N.W., B.A.N. and T.M.E. wrote the original draft of the manuscript. A.H.T, M. Daley, N.A.H., B.M., O.M., P.S., M.A., N.A., F.A., M. Bakker, G.B., G.C.B., B. Bodroža, G.B.J., C. Brick, R. Briker, R.C.M.v.A., J. Chandler, C.R.C., W.J.C., B.C., C.C.C., A.C.-K., M. Dujmović, B.F.-C., N. Fiala, J.G.F., M.A.F., S.J.G., D.P.G., J. Gooty, A.G.-K., C.G., L.G., P.H.P.H., N.C.H., K.J., J.J.-N., B. Jokić, P.K., J.M.L.F., K.E.L., S.L., S.C.L., D.L., J. MacDonald, J. Matacotta, J.K.M., M. Milyavskaya, D.C.M., A. Morrow, C. Moya, H.H.N., F.N., J.Oh, T.O., R. Panczak, Y.G.P., M.F.P., K.A.Q., M. Ritchie, R.R., J.P.R., I.R., J. Samaha, S. Sankaran, D.S., L. Schnabel, A.N.S., B.A. Schuetze, A. Schütz, E.L.S., S. Shakya, M. Sirota, M.M.S., C.T.S., J.S.S., N.S., N.S.-B., C.N.H.S., E. Strømland, T. Sundelin, B.S., S. Täuber, D.T., N.v.d.V., E.J.V., J.M.V., V.V., D.W., N.W., A.L.W., B.A.N. and T.M.E. reviewed and edited the manuscript.

Corresponding author

Correspondence to Brian A. Nosek.

Ethics declarations

Competing interests

A.H.T., M.D., N.H., K.H., O.M., T. Stankov, B.A.N. and T.M.E. are employees of the non-profit organization Center for Open Science, which has a mission to increase openness, integrity and trustworthiness of research.

Peer review

Peer review information

Nature thanks Antica Culina, Malgorzata Lagisz and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Proportion of papers with a completed replication by discipline.

Proportion of papers by discipline for which a replication attempt was finished (purple), never attempted (blue), or for which a replication team was sourced but the replication study was not started or completed (other colors). OSF = Open Science Framework. This is presented as Supplementary Fig. 7 with additional narrative context in the Supplementary Information.

Extended Data Fig. 2 Proportion of papers with a completed replication by year.

Proportion of papers by publication year for which a replication attempt was finished (purple), never attempted (blue), or for which a replication team was sourced but the replication study was not started or completed (other colors). OSF = Open Science Framework. This is presented as Supplementary Fig. 8 with additional narrative context in the Supplementary Information.

Extended Data Fig. 3 Proportion of papers with a completed replication by journal.

Proportion of papers by journal for which a replication attempt was finished (purple), never attempted (blue), or for which a replication team was sourced but the replication study was not started or completed (other colors). Sample sizes per journal ranged from 5 to 10. This is presented as Supplementary Fig. 9 with additional narrative context in the Supplementary Information.

Extended Data Fig. 4 Retrospective review of papers that were not matched to replication teams to conduct a new data replication by discipline.

Y-axis indicates the proportion of available papers per discipline sample. “Plausible” means that there were no clear barriers to conducting a replication other than capacity within the project. “Secondary data” means that these papers were more appropriate for a secondary data replication. This is presented as Supplementary Fig. 10 with additional narrative context in the Supplementary Information.

Extended Data Fig. 5 Retrospective review of papers that were not matched to replication teams to conduct a secondary data replication by discipline.

Y-axis indicates the proportion of available papers per discipline sample. “Plausible” means that there were no clear barriers to conducting a replication other than capacity within the project. “Primary data” means that these papers were more appropriate for new data replications. Admin. = Administrative. This is presented as Supplementary Fig. 11 with additional narrative context in the Supplementary Information.

Extended Data Fig. 6 Correlation matrix among binary assessments of replication success across claims.

Correlation values are right of the diagonal, and correlation magnitude is visualized left of the diagonal with darker shading indicating stronger correlations. CI = confidence interval. This is presented as Supplementary Fig. 12 with additional narrative context in the Supplementary Information.

Extended Data Fig. 7 Replication success or failure for 13 binary assessments by the effect size difference between the replication and original studies.

Data points are differences in effect sizes for individual claims. Data points on top of each graph are successful replications, and data points on the bottom are failed replications, according to the graph’s metric. This is presented as Supplementary Fig. 13 with additional narrative context in the Supplementary Information.

Extended Data Fig. 8 Replication success rates across 13 binary assessments for claims.

The vertical white line for each row is the estimate, and the 95% confidence interval around the estimate is represented by the dark bar. CI = confidence interval. This is presented as Supplementary Fig. 14 with additional narrative context in the Supplementary Information.

Extended Data Fig. 9 Percentage of replicated papers that were automatically identified as using each method or technique.

Two LLMs (GPT-4.1 and Kimi K2) identified the range of methods or techniques used across all abstracts (prompt: “What statistical techniques or analytic approaches are used?”). They then coded each abstract for the presence (1) or absence (0) of each—a method/technique is considered present if at least one of the models identified it as being present. Error bars = 95% confidence intervals. This is presented as Supplementary Fig. 15 with additional narrative context in the Supplementary Information.

Extended Data Fig. 10 Percentage of replicated papers that were automatically identified as citing each theoretical framework or paradigm.

Two LLMs (GPT-4.1 and Kimi K2) identified the range of frameworks or paradigms used across all abstracts (prompt: “What are the main theoretical frameworks/paradigms being cited?”). They then coded each abstract for the presence (1) or absence (0) of each—a framework/paradigm is considered present if at least one of the models identified it as being present. Error bars = 95% confidence intervals. This is presented as Supplementary Fig. 16 with additional narrative context in the Supplementary Information.

Supplementary information

Supplementary Information (download PDF )

Supplementary Methods and Results including Supplementary Figs. 1–16, Supplementary Tables 1–20 and Supplementary References – see Contents for details.

Reporting Summary (download PDF )

Supplementary Data (download ZIP )

Source data and code package.

Peer Review File (download PDF )

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tyner, A.H., Abatayo, A.L., Daley, M. et al. Investigating the replicability of the social and behavioural sciences. Nature 652, 143–150 (2026). https://doi.org/10.1038/s41586-025-10078-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41586-025-10078-y

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing