A chromosomal level genome assembly of Nguni Sheep, Ovis aries

Nesengani, Lucky Tendani; Tshilate, Thendo; Mdyogolo, Sinebongo; Smith, Rae; Masebe, Tracy; Raphulu, Thomas; Moila, Acclaim; Sharaf, Abdoallah; Muzemil, Sadik; Houaga, Isidore; Muigai, Anne; Djikeng, A.; Kuja, Josiah; Katee, Sally; Osuji, Julian O.; Mungloo-Dilmohamud, Zahra; Botes, Marietjie; Mbizeni, Skhumbuzo; Jarvis, Erich; Ebenezer, ThankGod Echezona; Ntanganedzeni, Mapholi

doi:10.1038/s41597-025-05514-7

Download PDF

Data Descriptor
Open access
Published: 10 July 2025

A chromosomal level genome assembly of Nguni Sheep, Ovis aries

Lucky Tendani Nesengani ORCID: orcid.org/0000-0003-2678-8425¹,
Thendo Tshilate¹,
Sinebongo Mdyogolo¹,
Rae Smith ORCID: orcid.org/0000-0002-1379-3949¹,
Tracy Masebe ORCID: orcid.org/0000-0002-1300-9077¹,
Thomas Raphulu²,
Acclaim Moila³,
Abdoallah Sharaf ORCID: orcid.org/0000-0002-3436-9290^4,5,
Sadik Muzemil⁶,
Isidore Houaga^1,7,
Anne Muigai^8,9,10,
A. Djikeng^1,11,
Josiah Kuja¹²,
Sally Katee¹¹,
Julian O. Osuji¹³,
Zahra Mungloo-Dilmohamud¹⁴,
Marietjie Botes¹⁵,
Skhumbuzo Mbizeni¹,
Erich Jarvis¹⁰,
ThankGod Echezona Ebenezer¹⁶ &
…
Mapholi Ntanganedzeni ORCID: orcid.org/0000-0003-4507-6669¹

Scientific Data volume 12, Article number: 1193 (2025) Cite this article

4479 Accesses
1 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Nguni sheep (Ovis aries) are indigenous to the Southern Africa region and common within the smallholder and poor resources farming systems. They are well adapted to different agroecological regions. However, limited genomic resources such as high-quality reference genomes have hindered our understanding of its adaptation and establishment of an effective breeding program. To address this, we assembled a chromosomal-level genome of Nguni sheep using a combination of PacBio HiFi reads and Omni-C reads. The genome size was estimated to be 2.9 Gb with a contig/scaffold N50 74 Mb and 99.6 Mb and a genome completeness of 96.1%, as estimated by the Benchmarking Universal Single-Copy Orthologs (BUSCO) program. The final genome encompassed a total of 25,926 protein-coding genes. The findings of this study provide a valuable genomic resource for understanding the adaptability of the Nguni sheep and the establishment of effective breeding programs.

A high-quality chromosome level genome assembly of the South African indigenous Nguni goat (Capra hircus)

Article Open access 31 January 2026

Chromosome-level genome assembly of Guide Black-Fur sheep (Ovis aries)

Article Open access 29 June 2024

A near complete genome assembly of the East Friesian sheep genome

Article Open access 11 July 2024

Background & Summary

The Nguni sheep are indigenous to Southern Africa and comprise four ecotypes, namely Zulu, Pedi, Swazi, and Landim sheep^1,2. Their origin can be traced to around 200 and 400 AD during the migration period of the Nguni people from central Africa to the Southern part of the continent^1,3,4,5. The Nguni sheep are hardy, can walk long distances in search of forage and water and are naturally resistant to gastro-intestinal parasites and tick-borne diseases that are endemic in the region^6,7. They therefore rarely fall sick or need vaccination⁴⁶. They also have a strong foraging instinct making them well adapted to the harsh climate that is usually hot and humid⁷. Due to these traits, the Nguni sheep are therefore low maintenance, making them highly attractive to local smallholder and emerging farmers. Throughout Africa, sheep are important and valuable assets that provide meat, milk, wool, hide, and manure^8,9,46. Sheep also play a critical cultural role in many communities by enhancing cultural practices and ceremonies due to the phenotypic appearance of multi-colored coat colors and patterns^{7,10,11,12,13}. However, in recent times, the introduction of larger-bodied exotic sheep breeds has resulted in the Nguni sheep becoming a vulnerable species largely due to farmers’ preference for the larger-bodied exotic breeds¹⁴. The Nguni sheep have small bodies when compared to the exotic breeds, giving the impression that they are a low-performance breed¹⁵. This perception resulted in indiscriminate crossbreeding practices that aimed to improve their so-called inferior traits, proving to be a great threat to indigenous Nguni sheep. This has prompted the establishment of conservation programs to save these unique and irreplaceable genetic resources^14,16.

The Nguni sheep, like many African indigenous genetic resources¹⁷, does not have a high-quality reference genome. Having a high-quality reference genome that is accessible to researchers is an essential genomic tool to establish efficient breeding programs and conservation strategies to guard against the extinction of indigenous genetic resources^18,19. There are few genetic studies conducted on Nguni sheep. These studies assessed the genetic structure of Nguni sheep using microsatellite markers and Ovine 50 K chip and found that the Nguni sheep, despite the different ecotypes, the genetic admixtures were detected with Damara and Dorper sheep^2,3. These findings are one of the pieces of evidence that we need to conserve the genetics of the Nguni sheep. Selepe et al.¹⁴ also emphasized that the Nguni sheep is currently at a high risk of extinction as they reported the admixture of Nguni sheep with exotic breeds. Therefore, this breed has been reported as one of the important breeds within the South African government conservation program to avoid extinction^20,21,22.

Concurrently, in recent years, climate change has shown the need to have breeds that are well-adapted to changing climatic environments²³. In this context, the development of effective breeding programs to improve production performance and efforts aimed at the conservation of indigenous breeds needs urgent attention. Such efforts require establishing high-quality reference genomes¹⁹. To understand the organism’s genetic architecture requires a comprehensive evaluation of the genome and its function. The generation of an error-free, near-gapless reference genome is a step closer to realise such understanding. The majority of the indigenous species constitute a valuable genetic makeup that is under-studied. This is true in the African indigenous sheep. Indigenous African sheep genetic resources play a crucial role in the provision of food security and socio-economic components. The available sheep reference genome²⁴ may not fully explain the unique genome characteristics of the African indigenous breeds such as the Nguni sheep. This may be due to factors such as genetic diversity, evolutionary history, and the adaptation to environmental factors such as different climates in which the animals are exposed. Understanding the genetics of Nguni sheep is important to safeguard their genetic diversity, improve their production, and enhance their resilience to climate change and conservation. The African BioGenome Project (AfricaBP) is a coordinated Pan-African effort to sequence the genomes of 100,000 endemic and indigenous African species¹⁷. As part of AfricaBP’s initiatives, we have produced a near error-free chromosome-scale complete genome of the indigenous Southern African Nguni sheep (Ovis aries). Here, a high-quality reference genome of a female Nguni sheep was generated using Pacific Biosciences (PacBio) HiFi and Dovetail genomics Omni-C technologies.

Methods

Ethics statement

The sample collection procedure including processing and handling of the animals used in this study was approved by the University of South Africa ethics committee (reference number: AREC-100818-024), Limpopo department of Agriculture and the Department of Agriculture Land Reform and Rural Development (DALRRD) under section 20 of the Animal Diseases Act 1984 (Act 35 of 84) (ref no 12/11/1/1/23 (6508 AC). Furthermore, the ethics procedure was guided by the AfricaBP policy on ethics which emphasises ethical, legal, and social issues throughout the research activities. Finally, this work benefited from compliance consultations with the Department of Forestry, Fisheries and Environment (DFFE) as the Competent National Authority for biodiversity framework and digital sequence information in South Africa. Given that this work is solely for academic purposes with no commercialisation intentions either now or in the future, and the samples were sourced and the genome sequenced in South Africa, a bioprospecting permit is not required from DFFE.

Sample collection and sequencing

In this study, a pure breed of ewe Nguni sheep (Fig. 1A) was selected from Mara research station (Fig. 1B) herd for sampling in Limpopo province of South Africa (23° 05′S, 29° 25′E). Blood was collected from the jugular vein in EDTA tubes by the veterinarian and placed on dry ice, immediately transported to Inqaba Biotech laboratories in Pretoria, South Africa, and stored at −80 °C freezer until further processing. High molecular weight genomic DNA was extracted from 200 ul blood using Nanobind protocol for whole blood high molecular weight (HMW) DNA extraction²⁵ to construct a sequencing library. The protocol was optimised for extraction from 200 ul of whole blood. Library sequencing on the PacBio Sequel IIe platform was done using SMRTbell® prep kit 3.0²⁶ following the manufacturer’s instructions. Dovetail Omni-C library prep was performed from the same sample used for HiFi sequencing following the manufacturer’s instructions. The resulting library was sequenced on NovaSeq 6000 instrument (Illumina). The total HiFi output data was 99 Gb at a coverage of 30 × while the output for Omni-C was 300 million read pairs.

Genome assembly

The genome assembly was conducted through a series of steps (Fig. 2) using the Vertebrate Genome Project (VGP) pipeline for genome assembly²⁷. Firstly, the sequenced PacBio HiFi reads were profiled to assess genome, repeat contents and heterozygosity rate. We employed Cutadapt²⁸ on the HiFi reads to remove adaptors sequences then used meryl to generate k-mers (k = 21) count distribution and GenomeScope²⁹ to estimate genome for profiling the genome. This process was carried out using PacBio HiFi reads. Subsequent to this, the genome assembly was conducted using Hifiasm³⁰ in Hi-C mode, incorporating HiFi and Omni-C reads. This generates two haplotypes (hap 1 and hap 2) which are essentially equivalent. The assembly was further subjected to scaffolding with YaHS³¹ then ran the decontamination using dual decontamination preparation workflow which make use of Kraken2³² as shown in Fig. 3 below. The dual decontamination workflow generates scaffolds versus Omni-C reads contact map with both haplotypes for curations. PretextMap was used to manually curate the scaffolds and by orienting and correcting mis-assemblies using the genomic proximity signal of the Omni-C reads alignment against scaffolds. The genome assembly was submitted to the National Center for Biotechnology Information (NCBI) for validation purposes, to confirm that it had been thoroughly cleaned of any extraneous taxa sequences. The submitted genome was subsequently employed for downstream analyses. The final genome size of Nguni sheep is 2.9 Gb, with a heterozygosity rate of 0.236% (Fig. 3A). The genome was sequenced at a coverage of 30X. The quality of the assembly was further assessed using Merqury completeness, which was 99.6% for both assemblies (Table 1). The genome completeness was further assessed using BUSCO with a dataset lineage of cetartiodactyla_odb10, which had 96.1% complete genes, 0.8% fragmented, and 3.1% missing genes respectively, totalling 13,335 (Fig. 3B).

Table 1 Assembly quality results of merqury completeness.

Full size table

Genome annotation

The preliminary phase in genome annotation entails the identification and masking of repeat elements in the Nguni sheep genome. This process utilizes RepeatMasker³³, a software tool that facilitates the identification and masking of repetitive elements within genomic sequences. The characterization of Nguni_Sheep-specific repeats was facilitated through the implementation of repeat modelling with RepeatModeler v0.9, a software that incorporates tools such as RepeatScout v1.0.6³⁴, RECON v1.5.0³⁵, and TRF v4.09³⁶. The identified repeat elements were then subjected to annotation using the “one_code_to_find_them_all” Perl script³⁷.

Protein coding genes for repeat masked assemblies were predicted using TIBERius v1.1.4³⁸. The resulting predicted protein coding genes were subsequently used to build orthologous gene sets among the selected sheep breeds, Hu (GCA_040805955.1)³⁹, Tibetan (GCA_017524585.1)⁴⁰, and Rambouillet (GCF_016772045.2)²⁴ using Orthofinder⁴¹, which clusters shared orthologous genes.

The assembly results are reported in Table 2, showed a scaffold N50 above 99.6 Mb and a contig N50 of 74 Mb. Our assembly showed a comparable contiguity with the latest sheep reference genomes, which include Ramboiuillet sheep genome from the International Sheep Genomics Consortium²⁴, as well as the Hu³⁹ and Tibetan⁴⁰ sheep genomes (Table 2). The variation among these assemblies is attributable to the sequencing coverage and the sequencing technology employed which likely influence the completeness and accuracy of the genome⁴². Additionally, the results provide the first comprehensive genome information for Nguni sheep, highlighting the genomic differences that may underpin phenotypic variations among breeds. Subsequent to scaffolding, the Nguni sheep genome assembly underwent a decontamination process to eliminate contaminants and further refine the genome. This refinement ensures the reliability of the assembly and enhances its utility as a resource for genomic and phenotypic studies.

Table 2 Assembly stats for Nguni sheep as compared to the assembly stats for Hu, Tibetan, and Rambouillet Sheep.

Full size table

Genome annotation of nguni sheep

The repeat annotation analysis of the Nguni genome revealed the presence of approximately 1435.61 Mb of repetitive sequences, constituting 47.39% of the total sequence length. The major categories of repetitive elements identified in the Nguni genome are summarized in Fig. 4, while a distribution of repetitive elements across the sheep genomes are presented in Table 3. The genome annotation parameters are reported in Table 4. Among the repetitive elements, 91.73% were annotated as known repeat motifs, while 8.27% were classified as uncharacterized elements unique to the Nguni Sheep. The most prevalent class of known repeats was the long interspersed nuclear elements (LINEs), which constituted 19.02% of the genome. While the number of repeat elements can influence genome size and certain levels of duplication within the genome may play a significant role in shaping the diverse gene repertoires in the organisms⁴³, this study has yet to determine the extent to which these genomic factors impact gene composition among sheep breeds.

Table 3 Summary of repeat elements identified in the genome of Nguni sheep.

Full size table

Table 4 Genome annotation parameters of the Nguni sheep.

Full size table

The genome was annotated using a deep learning-based ab initio gene structure prediction tool, which generated gene predictions based on inherent structural evidence. This resulted in 25,926 predicted protein-coding genes. The BUSCO analysis, based on the cetartiodactyla_odb10 lineage dataset showed that 89.7% of the BUSCOs were complete, with 88.1% being single-copy and 1.6% duplicated. The annotation process also identified 1.2% fragmented BUSCOs and 9.1% missing BUSCOs, resulting in a total of 13,335 cetartiodactyla BUSCO groups that were searched.

Further experimental evidence may be necessary to provide a more robust demonstration of the distinct characteristics of the Nguni sheep. The primary objective of our study was to generate a high-quality reference genome. This information serves as the essential tool for studies aimed at improving the Nguni sheep’s production and for conservation purposes. The generated genome reference will be made publicly available and accessible to the researchers for use in further research.

Data Records

The raw sequences were submitted to the NCBI and assigned accession number for SRA (SRR33210015) data of PRJNA1175667⁴⁴. The final genome assembly for the Nguni sheep was submitted to the NCBI under the BioProject: PRJNA1175667, BioSample: SAMN44368269 and assigned the accession number JBLGTL000000000⁴⁵. The annotation data of the genome is shared through figshare⁴⁶.

Technical Validation

The reference genome assembly of Nguni Sheep was supported by VGP workflows for genome assembly which were designed to reduce human error by employing workflows that produces near error-free high-quality reference genome. The assembly was done using Hifiasm on a Hi-C mode. The pipeline is comprised of different workflows that make use of methods such as BUSCO and mercury to assess the integrity of the genome.

Code availability

All the analyses done in the current study were processed by employing the VGP workflows that are publicly available in galaxy (https://galaxyproject.org/projects/vgp/workflows/). All the commands and pipelines were executed following the manual and protocols of the corresponding bioinformatics software. In all the workflows, unless mentioned and where necessary, we used the default parameters.

References

Badenhorst, S. & Magoma, M. Sheep (Ovis aries) of Venda speakers during the second millennium AD in South Africa. Int J Osteoarchaeol 32, 944–950 (2022).
Article Google Scholar
Nxumalo, K. S., Grobler, P., Ehlers, K., Nesengani, L. T. & Mapholi, N. O. The genetic assessment of South African Nguni sheep breeds using the ovine 50 K chip. Agriculture 12, 663 (2022).
Article CAS Google Scholar
Kunene, N. W. et al. Genetic diversity in four populations of Nguni (Zulu) sheep assessed by microsatellite analysis. Ital J Anim Sci 13, 3083 (2014).
Article Google Scholar
Nurse, G. T. Population movement around the northern Kalahari. Afr Stud 42, 153–163 (1983).
Article Google Scholar
Huffman, T. N. The archaeology of the Nguni past. Southern African Humanities 16, 79–111 (2004).
Google Scholar
Ramsay, K., Harris, L. & Kotzé, A. Landrace Breeds: South Africa’s Indigenous and Locally Developed Farm Animals (1998).
Nyamukanza, C. C., Scogings, P. F., Mbatha, K. R. & Kunene, N. W. Forage–sheep relationships in communally managed moist thornveld in Zululand, KwaZulu-Natal, South Africa. Afr J Range Forage Sci 27, 11–19 (2010).
Article Google Scholar
Gorkhali, N. A., Han, J. L. & Ma, Y. H. Mitochondrial DNA variation in indigenous sheep (Ovis aries) breeds of Nepal (2015).
Pollott, G. & Wilson, R. T. Sheep and goats for diverse products and profits (2009).
Kunene, N. W. & Fossey, A. A survey on livestock production in some traditional areas of Northern Kwazulu Natal in South Africa. Livest. Res. Rural Dev 18, 30–33 (2006).
Google Scholar
van Zyl, E. A. & Dugmore, T. J. Imvu–The indigenous sheep of KwaZulu-Natal: A Zulu Heritage.
Kunene, N. W., Bezuidenhout, C. C., Nsahlai, I. V. & Nesamvuni, E. A. A review of some characteristics, socio-economic aspects and utilization of Zulu sheep: implications for conservation. Trop Anim Health Prod 43, 1075–1079 (2011).
Article PubMed Google Scholar
Fossey, A., Nesamvuni, E. A. & Kunene, N. Characterisation of Zulu (Nguni) sheep using linear body measurements and some environmental factors affecting these measurements. S Afr J Anim Sci 37, 11–20 (2007).
Google Scholar
Selepe, M. M., Ceccobelli, S., Lasagna, E. & Kunene, N. W. Genetic structure of South African Nguni (Zulu) sheep populations reveals admixture with exotic breeds. PLoS One 13, e0196276 (2018).
Article PubMed PubMed Central Google Scholar
Dzomba, E. F., Chimonyo, M., Snyman, M. A. & Muchadeyi, F. C. The genomic architecture of South African mutton, pelt, dual‐purpose and nondescript sheep breeds relative to global sheep populations. Anim Genet 51, 910–923 (2020).
Article CAS PubMed Google Scholar
Ribeiro, M. N. et al. Threatened goat breeds from the tropics: the impact of crossbreeding with foreign goats. Sustainable Goat Production in Adverse Environments: Volume I: Welfare, Health and Breeding 101–110 (2017).
Ebenezer, T. E. et al. Africa: sequence 100,000 species to safeguard biodiversity. Nature 603, 388–392 (2022).
Article ADS CAS PubMed Google Scholar
Paez, S. et al. Reference genomes for conservation. Science (1979) 377, 364–366 (2022).
CAS Google Scholar
Brandies, P., Peel, E., Hogg, C. J. & Belov, K. The value of reference genomes in the conservation of threatened species. Genes (Basel) 10, 846 (2019).
Article CAS PubMed Google Scholar
Köhler-Rollefson, I. Farm animal genetic resources: Safeguarding national assets for food security and trade (2004).
Mahlase, E. M. & Fakir, S. The use of indigenous animal genetic resources to promote sustainable rural livelihoods in South Africa. Community-Based Management of Animal Genetic Resources 127 (2003).
Mapiye, C., Chikwanha, O. C., Chimonyo, M. & Dzama, K. Strategies for sustainable use of indigenous cattle genetic resources in Southern. Africa. Diversity (Basel) 11, 214 (2019).
Article Google Scholar
Rojas-Downing, M. M., Nejadhashemi, A. P., Harrigan, T. & Woznicki, S. A. Climate change and livestock: Impacts, adaptation, and mitigation. Clim Risk Manag 16, 145–163 (2017).
Article Google Scholar
Davenport, K. M. et al. An improved ovine reference genome assembly to facilitate in-depth functional annotation of the sheep genome. Gigascience 11, giab096 (2022).
Article PubMed PubMed Central Google Scholar
Mitchell, M. W. et al. High molecular weight DNA extraction and long-read next-generation sequencing of human genomic reference standards (2020).
Ekholm, J. et al. Streamlined SMRTbell® library generation using addition-only, single tube strategy for all library types reduces time to results. J Biomol Tech 30, S38–S38 (2019).
PubMed Central Google Scholar
Larivière, D. et al. Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy. Nat Biotechnol 42, 367–370 (2024).
Article PubMed PubMed Central Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. journal 17, 10–12 (2011).
Article Google Scholar
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 18, 170–175 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhou, C., McCarthy, S. A. & Durbin, R. YaHS: yet another Hi-C scaffolding tool. Bioinformatics 39, btac808 (2023).
Article CAS PubMed Google Scholar
Wood, D. E. & Salzberg, S. L. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15, R46 (2014).
Article PubMed PubMed Central Google Scholar
Tarailo‐Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics 25, 4–10 (2009).
Google Scholar
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21, i351–i358 (2005).
Article CAS PubMed Google Scholar
Bao, Z. & Eddy, S. R. Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res 12, 1269–1276 (2002).
Article CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res 27, 573–580 (1999).
Article CAS PubMed PubMed Central Google Scholar
Bailly-Bechet, M., Haudry, A. & Lerat, E. “One code to find them all”: a perl tool to conveniently parse RepeatMasker output files. Mob DNA 5, 1–15 (2014).
Article Google Scholar
Gabriel, L., Becker, F., Hoff, K. J. & Stanke, M. Tiberius: end-to-end deep learning with an HMM for gene prediction. Bioinformatics 40, btae685 (2024).
Article CAS PubMed PubMed Central Google Scholar
Luo, L.-Y. et al. Telomere-to-telomere sheep genome assembly reveals new variants associated with wool fineness trait. bioRxiv 2024–2027 (2024).
Liang, X. et al. Genomic structural variation contributes to evolved changes in gene expression in high-altitude Tibetan sheep. Proceedings of the National Academy of Sciences 121, e2322291121 (2024).
Article CAS Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 20, 1–14 (2019).
Article Google Scholar
Tshilate, T. S., Ishengoma, E. & Rhode, C. A first annotated genome sequence for Haliotis midae with genomic insights into abalone evolution and traits of economic importance. Mar Genomics 70, 101044 (2023).
Article CAS PubMed Google Scholar
Meagher, T. R. & Vassiliadis, C. Phenotypic impacts of repetitive DNA in flowering plants. New Phytologist 168, 71–80 (2005).
Article CAS PubMed Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR33210015 (2025).
Nesengani,L.T. and Mapholi,N. Ovis aries breed Pedi, whole genome shotgun sequencing project. Genbank. https://identifiers.org/ncbi/insdc:JBLGTL000000000.1 (2025).
Nesengani, L.T. and Mapholi, N. Nguni Sheep Annotation Data. e https://doi.org/10.6084/m9.figshare.28078811 (2025).

Download references

Acknowledgements

This project is supported by University of South Africa, Africa-nuanced sustainable development goals research support programme. The samples for this project were provided by Mara research station, Limpopo province of South Africa and the technical support was provided by the AfricanBP. A special thanks to Mapholi lab for providing the resources and managing the project through grant of African-Nuanced Sustainable Development Goals Research Support Programme (ASDG-RSP) awarded by the University of South Africa to Mapholi lab. Finally, we would like to thank Mr Thomas Nkhumeleni Mbedzi from the Department of Forestry, Fisheries and Environment for the consultation engagement on this work.

Author information

Authors and Affiliations

College of Agriculture and Environmental Sciences, UNISA Science Campus, Florida, Johannesburg, South Africa
Lucky Tendani Nesengani, Thendo Tshilate, Sinebongo Mdyogolo, Rae Smith, Tracy Masebe, Isidore Houaga, A. Djikeng, Skhumbuzo Mbizeni & Mapholi Ntanganedzeni
Limpopo Department of Agricultur, Polokwane, South Africa
Thomas Raphulu
Inqaba Biotech, Pretoria, South Africa
Acclaim Moila
SequAna Core Facility, Department of Biology, University of Konstanz, Konstanz, Germany
Abdoallah Sharaf
Genetic Department, Faculty of Agriculture, Ain Shams University, Cairo, Egypt
Abdoallah Sharaf
Rothamsted Research, Harpended, United Kingdom
Sadik Muzemil
The Roslin Institute and Centre for Tropical Livestock Genetics and Health (CTLGH), University of Edinburgh, Edinburgh, UK
Isidore Houaga
National Defence University-Kenya, Nakuru, Kenya
Anne Muigai
Jomo Kenyatta University of Agriculture and Technology, Juja, Kenya
Anne Muigai
Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
Anne Muigai & Erich Jarvis
International Livestock Research Institute, Nairobi, Kenya
A. Djikeng & Sally Katee
Bioinformatics Center, University of Copenhagen, Copenhagen, Denmark
Josiah Kuja
Department of Plant Science and Biotechnology, University of Port Harcourt, Port Harcourt, Nigeria
Julian O. Osuji
Department of Digital Technologies, FoICDT, University of Mauritius, Mauritius, Mauritius
Zahra Mungloo-Dilmohamud
School of Law, University of KwaZulu-Natal, Durban, South Africa
Marietjie Botes
African BioGenome Project (AfricaBP), Nairobi, Kenya
ThankGod Echezona Ebenezer

Authors

Lucky Tendani Nesengani
View author publications
Search author on:PubMed Google Scholar
Thendo Tshilate
View author publications
Search author on:PubMed Google Scholar
Sinebongo Mdyogolo
View author publications
Search author on:PubMed Google Scholar
Rae Smith
View author publications
Search author on:PubMed Google Scholar
Tracy Masebe
View author publications
Search author on:PubMed Google Scholar
Thomas Raphulu
View author publications
Search author on:PubMed Google Scholar
Acclaim Moila
View author publications
Search author on:PubMed Google Scholar
Abdoallah Sharaf
View author publications
Search author on:PubMed Google Scholar
Sadik Muzemil
View author publications
Search author on:PubMed Google Scholar
Isidore Houaga
View author publications
Search author on:PubMed Google Scholar
Anne Muigai
View author publications
Search author on:PubMed Google Scholar
A. Djikeng
View author publications
Search author on:PubMed Google Scholar
Josiah Kuja
View author publications
Search author on:PubMed Google Scholar
Sally Katee
View author publications
Search author on:PubMed Google Scholar
Julian O. Osuji
View author publications
Search author on:PubMed Google Scholar
Zahra Mungloo-Dilmohamud
View author publications
Search author on:PubMed Google Scholar
Marietjie Botes
View author publications
Search author on:PubMed Google Scholar
Skhumbuzo Mbizeni
View author publications
Search author on:PubMed Google Scholar
Erich Jarvis
View author publications
Search author on:PubMed Google Scholar
ThankGod Echezona Ebenezer
View author publications
Search author on:PubMed Google Scholar
Mapholi Ntanganedzeni
View author publications
Search author on:PubMed Google Scholar

Contributions

L.T.N.- Literature review, sampling, analyses, diagrams, manuscript drafting, validation and manuscript review; T.T.- analyses, validation and manuscript review; S.M. - sampling and manuscript review; R.S.- diagrams, validation and manuscript review; T.M.- validation and manuscript review; T.P.- sampling; A.M.- review; A.S.- analyses, validation and manuscript review; S.M.- validation and manuscript review, I.H.- validation and manuscript review; AM- literature review, validation and manuscript review; A.D.- manuscript review; JK- review; A.M.- sequencing; S.K.- compliance; J.O.- review; Z.D.- review; Z.M.D.- validation and manuscript review; M.B.- Compliance; S.K.- sampling; E.J.: Validation and review the manuscript; T.E.E.- literature review, validation and manuscript review; N.M.: connived idea, literature review, secured funding validation and manuscript review.

Corresponding authors

Correspondence to Lucky Tendani Nesengani or Mapholi Ntanganedzeni.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Nesengani, L.T., Tshilate, T., Mdyogolo, S. et al. A chromosomal level genome assembly of Nguni Sheep, Ovis aries. Sci Data 12, 1193 (2025). https://doi.org/10.1038/s41597-025-05514-7

Download citation

Received: 02 January 2025
Accepted: 02 July 2025
Published: 10 July 2025
Version of record: 10 July 2025
DOI: https://doi.org/10.1038/s41597-025-05514-7