Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Data
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific data
  3. data descriptors
  4. article
Whole-genome resequencing and genetic diversity of five indigenous cattle breeds from China
Download PDF
Download PDF
  • Data Descriptor
  • Open access
  • Published: 21 January 2026

Whole-genome resequencing and genetic diversity of five indigenous cattle breeds from China

  • Wei Wang1,
  • Linxiang Li2,
  • Ying Chen1,
  • Xiaoqin Ma1,
  • Yueda Aguo1,
  • Jia Gan1,
  • Donghui Fang1,
  • Xiaodong Deng1,
  • Xiaoyun Chen1,
  • Fang He1,
  • Yi Shi1,
  • Changfeng Wu2,
  • Zhixin Yi2,
  • Yihui Chen2,
  • Maozhong Fu1 &
  • …
  • Jun Yi1 

Scientific Data , Article number:  (2026) Cite this article

  • 659 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Animal breeding
  • Structural variation

Abstract

China’s abundant indigenous yellow cattle resources are of great significance for studying environmental adaptability evolution, genetic resource conservation, and breeding improvement. The majority of the cattle population consists of indigenous breeds. Understanding the genetic architecture of these cattle breeds is essential for effective management and conservation efforts. In this study, we collected DNA samples from five local cattle breeds (n = 56) and obtained whole-genome sequencing (WGS) data for 10 Jinchuan (JC) yak samples from the NCBI database as the outgroup. Whole-genome resequencing generated approximately 2.3 TB of paired-end data, achieving an average depth of 13X and a depth range of 9.75X to 39.03X across the 66 samples. The sequencing data were pre-processed and mapped to the cattle reference genome (ARS-UCD1.2) with an alignment rate of 99.5%. Finally, the variant calling process produced approximately 31 million high-quality SNPs. These data enhance our understanding of cattle genetic architecture, enabling the discovery of functional variants and evolutionary insights to inform breeding strategies for climate-resilient and sustainable cattle production.

Similar content being viewed by others

Whole genome sequences of 70 indigenous Ethiopian cattle

Article Open access 05 June 2024

Whole-Genome Sequencing of 54 Dengchuan Cattle (Bos taurus) from Southwest China

Article Open access 01 July 2025

Whole genome sequences of 289 native cattle from Finland, the Netherlands, and Portugal

Article Open access 03 December 2025

Data availability

The raw sequencing data generated in this study have been deposited in both the NCBI Sequence Read Archive under BioProject accession PRJNA1369724 (SRA: SRP655464)21 and the China National GeneBank DataBase (CNGBdb) under accession number CNP000755222. The final variant sets (including SNPs, InDels, SVs, and CNVs) are available in the Figshare repository23.

Code availability

Data analyses were primarily performed using standard bioinformatics tools within a Linux operating system environment. Detailed information regarding software versions and parameter settings is available at: https://github.com/triple-y/WGS-Chinese-yellow-cattle.

References

  1. Lyu, Y. et al. Recent selection and introgression facilitated high-altitude adaptatio n in cattle. Sci Bull (Beijing) 69, 3415–3424, https://doi.org/10.1016/j.scib.2024.05.030.

  2. Zhang, G. X. et al. Genetic diversity and population structure of indigenous yellow cattle breeds of China using 30 microsatellite markers. Anim Genet 38, 550–559, https://doi.org/10.1111/j.1365-2052.2007.01644.x.

  3. Friedrich, J. & Wiener, P. Selection signatures for high-altitude adaptation in ruminants. Anim Genet 51, 157-165, https://doi.org/10.1111/age.12900.

  4. Gong, Y., Li, Y., Liu, X., Ma, Y. & Jiang, L. A review of the pangenome: how it affects our understanding of genomic variation, selection and breeding in domestic animals? J Anim Sci Biotechnol 14, 73, https://doi.org/10.1186/s40104-023-00860-1.

  5. Zhang, Y.-W., Wu, S.-X., Wang, G.-W., Wan, R.-D. & Yang, Q.-E. Single-cell analysis identifies critical regulators of spermatogonial development and differentiation in cattle-yak bulls. J Dairy Sci 107, 7317–7336, https://doi.org/10.3168/jds.2023-24442.

  6. Liu, Y. et al. Discovery of Genomic Characteristics and Selection Signatures in Southern Chinese Local Cattle. Front Genet 11, 533052, https://doi.org/10.3389/fgene.2020.533052.

  7. Zhang, W. et al. Genome-wide assessment of genetic diversity and population structure i nsights into admixture and introgression in Chinese indigenous cattle. BMC Genet 19, 114, https://doi.org/10.1186/s12863-018-0705-9.

  8. Moore, A. J. & Kukuk, P. F. Quantitative genetic analysis of natural populations. Nat Rev Genet 3, 971–978, https://doi.org/10.1038/nrg951.

  9. Li, Z. et al. Impacts of SNP genotyping call rate and SNP genotyping error rate on i mputation accuracy inHolsteincattle. Yi Chuan 41, 644-652, https://doi.org/10.16288/j.yczz.18-319.

  10. Hulsegge, I., Oldenbroek, K., Bouwman, A., Veerkamp, R. & Windig, J. Selection and Drift: A Comparison between Historic and Recent Dutch Fr iesian Cattle and Recent Holstein Friesian Using WGS Data. Animals (Basel) 12, 329, https://doi.org/10.3390/ani12030329.

  11. Xia, X. et al. Assessing genomic diversity and signatures of selection in Jiaxian Red cattle using whole-genome sequencing data. BMC Genomics 22, 43, https://doi.org/10.1186/s12864-020-07340-0.

  12. Yang, B. et al. Genetic Diversity Estimation and Genome-Wide Selective Sweep Analysis of the Bazhou Yak. Animals (Basel) 15, 849, https://doi.org/10.3390/ani15060849.

  13. Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884-i890, https://doi.org/10.1093/bioinformatics/bty560.

  14. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics (Oxford, England) 25, 1754-1760, https://doi.org/10.1093/bioinformatics/btp324.

  15. Pham, M., Tu, Y. & Lv, X. Accelerating BWA-MEM Read Mapping on GPUs. ICS 2023, 155-166, https://doi.org/10.1145/3577193.3593703.

  16. Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078-2079, https://doi.org/10.1093/bioinformatics/btp352.

  17. Grant, J. R. et al. A large structural variant collection in Holstein cattle and associate d database for variant discovery, characterization, and application. BMC Genomics 25, 903, https://doi.org/10.1186/s12864-024-10812-2.

  18. Abyzov, A., Urban, A. E., Snyder, M. & Gerstein, M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res 21, 974–984, https://doi.org/10.1101/gr.114876.110.

  19. McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next- generation DNA sequencing data. Genome Res 20, 1297-1303, https://doi.org/10.1101/gr.107524.110.

  20. Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughpu t sequencing data. Nucleic acids research 38, e164, https://doi.org/10.1093/nar/gkq603.

  21. NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP655464 (2025).

  22. Wenjie, Han. Population Genetic Analysis of Chinese indigenous cattle. China National GeneBank Database https://doi.org/10.26036/CNP0007552 (2025).

  23. Wang, W. Population Genetic Analysis of Chinese indigenous cattle. figshare https://doi.org/10.6084/m9.figshare.30759347 (2025).

Download references

Acknowledgements

This work was supported by the Sichuan Province Science and Technology Planning Project (2021YFYZ0001), the “5 + 1” Special Project for Breakthroughs in Cutting-Edge Agricultural Technologies (5 + 1QYGG003), the Sichuan Beef Cattle Innovation Team Project (SCCXTD-2025-13), the Sichuan Fiscal Operations Special Program (SASA2025CZYX003), the Basic Research Projects of Scientific Research Institutes (SASA202505), and the Sichuan Province Science and Technology Support Project (2024ZYD0283).

Author information

Authors and Affiliations

  1. Animal Genetic Breeding and Reproduction Key Laboratory of Sichuan Province, Sichuan Animal Science Academy, Chengdu, Sichuan, 610066, China

    Wei Wang, Ying Chen, Xiaoqin Ma, Yueda Aguo, Jia Gan, Donghui Fang, Xiaodong Deng, Xiaoyun Chen, Fang He, Yi Shi, Maozhong Fu & Jun Yi

  2. Bazhong Academy of Agriculture and Forestry Sciences, Bazhong, Sichuan, 636000, China

    Linxiang Li, Changfeng Wu, Zhixin Yi & Yihui Chen

Authors
  1. Wei Wang
    View author publications

    Search author on:PubMed Google Scholar

  2. Linxiang Li
    View author publications

    Search author on:PubMed Google Scholar

  3. Ying Chen
    View author publications

    Search author on:PubMed Google Scholar

  4. Xiaoqin Ma
    View author publications

    Search author on:PubMed Google Scholar

  5. Yueda Aguo
    View author publications

    Search author on:PubMed Google Scholar

  6. Jia Gan
    View author publications

    Search author on:PubMed Google Scholar

  7. Donghui Fang
    View author publications

    Search author on:PubMed Google Scholar

  8. Xiaodong Deng
    View author publications

    Search author on:PubMed Google Scholar

  9. Xiaoyun Chen
    View author publications

    Search author on:PubMed Google Scholar

  10. Fang He
    View author publications

    Search author on:PubMed Google Scholar

  11. Yi Shi
    View author publications

    Search author on:PubMed Google Scholar

  12. Changfeng Wu
    View author publications

    Search author on:PubMed Google Scholar

  13. Zhixin Yi
    View author publications

    Search author on:PubMed Google Scholar

  14. Yihui Chen
    View author publications

    Search author on:PubMed Google Scholar

  15. Maozhong Fu
    View author publications

    Search author on:PubMed Google Scholar

  16. Jun Yi
    View author publications

    Search author on:PubMed Google Scholar

Contributions

W.W. and J.Y. conceived and designed the study. W.W. were involved in the review and writing process, L.L., Y.C., X.M., Y.A., J.G., D.F., X.D., X.C., F.H., Y.S., C.W., Z.Y., M.F. performed experiments and analyzed data. W.W. and J.Y. supervised the project and acquired funding. All authors made critical contributions to the manuscript drafts.

Corresponding authors

Correspondence to Wei Wang or Jun Yi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, W., Li, L., Chen, Y. et al. Whole-genome resequencing and genetic diversity of five indigenous cattle breeds from China. Sci Data (2026). https://doi.org/10.1038/s41597-026-06610-y

Download citation

  • Received: 12 September 2025

  • Accepted: 09 January 2026

  • Published: 21 January 2026

  • DOI: https://doi.org/10.1038/s41597-026-06610-y

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Editors & Editorial Board
  • Journal Metrics
  • Policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Contact

Publish with us

  • Submission Guidelines
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Data (Sci Data)

ISSN 2052-4463 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing