The genomic structure of complex chromosomal rearrangement at the Fm locus in black-bone Silkie chicken

Sharma, Ashutosh; Vijay, Nagarjun

doi:10.1038/s42003-025-07825-2

Download PDF

Matters Arising
Open access
Published: 01 April 2025

The genomic structure of complex chromosomal rearrangement at the Fm locus in black-bone Silkie chicken

Communications Biology volume 8, Article number: 537 (2025) Cite this article

1551 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Matters Arising to this article was published on 01 April 2025

The Original Article was published on 06 December 2023

arising from F. Zhu et al. Communications Biology https://doi.org/10.1038/s42003-023-05619-y (2023)

High-quality chromosome-level genome assemblies for numerous avian species promise to address longstanding questions in bird evolution and biology. In a recent issue of Communications Biology, Zhu, F., Yin, Z.T., Zhao, Q.S. et al. (ZYZSJ)¹ presented a chromosome-level assembly for the Silkie chicken using a multi-platform high-coverage dataset to obtain accurate and complete sequences spanning the entire genome. A key finding from their genomic analysis is the reconstruction of the structure of the complex rearrangement at the Fm locus, the primary genetic change underlying the rare and conspicuous dermal hyperpigmentation phenotype generally called Fibromelanosis. However, in contrast to their identification of the *Fm_1 scenario, several previously published studies^2,3,4,5,6 claim that *Fm_2 is the valid scenario. Our re-analysis of ZYZSJ’s new assembly (CAU_Silkie) using long-read data from multiple black-bone chickens demonstrates that *Fm_2 is the alternative scenario. The *Fm_1 scenario favoured by ZYZSJ results from an assembly error caused by mosaic haplotypes generated during the de novo assembly step. We recommend post-assembly validation and correction in genome projects to prevent misinterpretation due to assembly artefacts. Enhancing the assembly of haplotypes in such complex regions is essential for unravelling the genetic foundations of traits governed by genes within these areas.

The fibromelanosis phenotype, found in Silkie and other black-bone chicken breeds such as Yeonsan Ogye and Kadaknath, is caused by a complex chromosomal rearrangement (Fm locus) on chromosome 20 with a common origin in all black-bone chicken breeds^{2,3,5,6,7,8,9} (Supplementary Text and Supplementary Figs. S1–4). However, the correct genomic structure of this rearrangement has been challenging to establish based on short-read sequencing, PCR and genetic analysis of crossing experiments alone¹⁰. Three possible hypothetical scenarios (*Fm_1, *Fm_2, and *Fm_3, see Fig. 1) have been proposed³ based on the identified rearrangement junctions. Genetic analysis of crosses between individuals with Fm and wild-type (*N) has favoured the second scenario (*Fm_2)^3,10. However, subsequent studies utilising limited long-read sequencing for de novo assembly have failed to determine the correct scenario^11,12,13 (Supplementary Text).

**Fig. 1: Organisation of Fm locus in Red junglefowl and Silkie black-bone chicken.**

ZYZSJ generated a de novo high-quality multi-platform chromosome-level genome (CAU_Silkie) from a single Fm homozygote (Fm/Fm) silkie individual. In ZYZSJ’s Fig. 1b and Supplementary Figs. 11–15, they identified *Fm_1 scenario (which the authors call FM2) as the arrangement at the Fm locus by relying upon this genome assembly. In a parallel study⁵, we opted for a haplotype phasing approach to resolve the structure of the Fm locus. In this approach, the silkie long-reads mapped to the GRCg6a genome were assigned to two haplotypes (Fig. 2A) corresponding to the two copies of the duplicated regions. The reads mapping at the boundaries of the duplicated region consists of two types: those that spanned the duplicated-unduplicated junctions and those that aligned in the duplicated region but were soft-clipped at the junctions (Supplementary Fig. S4a). The genomic sequences contiguous with these two types of reads were distinguished based on the allelic states at haplotype-defining positions (HDPs). HDPs are sites that separate haplotype-consistent long-read tiling paths extending into the flanks of the duplicated regions. The read assignment step identified two haplotypes (Fig. 2B) spanning Dup1 and Dup2 regions and extending into the flanking regions. Step-wise extension of these haplotype-specific tiling paths identified *Fm_2 scenario as the correct arrangement of the Fm locus (Fig. 2C).

**Fig. 2: Graphical representation of haplotype-based phasing at the Fm locus in CAU Silkie using GRCg6a genome assembly.**

Despite utilising ZYZSJ’s long-read datasets (ONT and PacBio), our analysis consistently supported the *Fm_2 scenario (Supplementary Text, Supplementary Figs. S5–121 and Supplementary Data S1–19). Our haplotype phasing approach excels in resolving the rearrangement at the Fm locus for several reasons: (1) we identify specific haplotype-defining positions (HDPs) spanning the entire length of the Fm locus, (2) we quantify long-read support for each pair of HDPs, and (3) we independently validate the results using ONT and PacBio long-reads. Our method’s intuitive nature and data visualisation at various steps led us to the alternative (*Fm_2) conclusion, which can be verified by examining the raw data (Fig. 2). We observed that the end of the Dup1 region and the start of the Dup2 region have insufficient HDPs (Supplementary Figs. S9a and S10a). Therefore, we relied upon haplotype-specific reads that span these HDP-poor regions.

Evaluating long-read support for the CAU_Silkie assembly revealed multiple haplotype switching events (Supplementary Fig. S122). The absence of haplotype-consistent raw read support for the published CAU_Silkie genome assembly suggests that the de novo assembly approach has failed to resolve the haplotypes. We observed that the allelic states either did not differ between haplotypes or were inconsistent with the long-read dataset (Supplementary Text, Supplementary Figs. S123–163 and Supplementary Data S20–27). Hence, the CAU_Silkie assembly of the Fm locus consists of a mosaic haplotype. Due to a mosaic haplotype in the CAU_Silkie assembly, the previously identified 49 HDPs were insufficient for haplotype phasing. So, we identified 211 new HDPs while re-analysing the long-read data using CAU_Silkie as the reference. All 260 HDPs are concordant with the haplotypes we reconstructed earlier⁵, reinforcing our result that the *Fm_2 is the alternative structure of the Fm locus (Supplementary Data S20–27).

Based on the evidence presented in our re-analysis and the findings from our previously published study⁵, we propose that consistent with early studies^3,10, *Fm_2 represents the accurate arrangement at the Fm locus, with no support for scenarios *Fm_1 and *Fm_3. Our evidence that *Fm_2 is the alternative structure of the Fm locus will be vital to resolving future questions about gene regulation in this region. In a broader context, a black-box approach to genome assembly, lacking downstream validation, can introduce erroneous bases into genome assemblies, potentially impacting evolutionary genomic analysis¹⁴. Given the increasing capability of long-read sequencing methods to resolve complex structural variations, we advocate for caution when employing genome assembly tools. To enhance the reliability of results, we recommend thorough post-assembly evaluation and correction of the genomic sequence, utilising all raw data used in the assembly process and supplementing it with alternative sources of information such as population genetics and methylation-based phasing.

Methods

A schematic description of HDP identification and haplotype-specific read partitioning using the GRCg6a genome is provided in Supplementary Figs. S120 and S121. In our re-analysis, we scrutinised the CAU_Silkie assembly to assess whether the sequencing raw data substantiated the integrity of the genome assembly. Initially, we examined the allelic states at the 49 HDPs identified in our recent study⁵ within the CAU_Silkie assembly after the liftover of positions from GRCg6a. We could not find haplotype-consistent tiling paths based on these HDPs. Considering the possibility that the genome assembler employed a distinct set of HDPs for reconstructing the correct structure of the Fm locus, we identified 211 additional sites that differed between the haplotypes of the CAU_Silkie assembly. We adapted our read-backed phasing script¹⁵ to assess long-read support for each pair of these sites. The complete set of HDPs (previous 49 + newly identified 211 positions) consistently recovered the same haplotypes in mapped long-reads irrespective of whether the GRCg6a or CAU_Silkie assembly was used.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Data are available at https://github.com/Ashu2195/Fm_locus and archived on Zenodo at https://doi.org/10.5281/zenodo.14011071¹⁵.

Code availability

Scripts and code are available at https://github.com/Ashu2195/Fm_locus and archived on Zenodo at https://doi.org/10.5281/zenodo.14011071¹⁵.

References

Zhu, F. et al. A chromosome-level genome assembly for the Silkie chicken resolves complete sequences for key chicken metabolic, reproductive, and immunity genes. Commun. Biol. 61, 1–15 (2023).
CAS Google Scholar
Dorshorst, B., Okimoto, R. & Ashwell, C. Genomic regions associated with dermal hyperpigmentation, polydactyly and other morphological traits in the Silkie chicken. J. Hered. 101, 339–350 (2010).
Dorshorst, B. et al. A complex genomic rearrangement involving the Endothelin 3 locus causes dermal hyperpigmentation in the chicken. PLoS Genet. 7, e1002412 (2011).
Tian, M. et al. Genomic regions associated with the sex-linked inhibitor of dermal melanin in Silkie chicken. Front. Agric. Sci. Eng. 1, 242–249 (2014).
Article Google Scholar
Shinde, S. S., Sharma, A. & Vijay, N. Decoding the fibromelanosis locus complex chromosomal rearrangement of black-bone chicken: genetic differentiation, selective sweeps and protein-coding changes in Kadaknath chicken. Front. Genet. 14, 1180658 (2023).
Article CAS PubMed PubMed Central Google Scholar
Shinomiya, A. et al. Gene duplication of endothelin 3 is closely correlated with the hyperpigmentation of the internal organs (fibromelanosis) in silky chickens. Genetics 190, 627–638 (2012).
Tian, M. et al. Inverted duplication including Endothelin 3 closely related to dermal hyperpigmentation in Silkie chickens. Front. Agric. Sci. Eng. 1, 121–129 (2014).
Article Google Scholar
Bateson, B. W. et al. The inheritance of the peculiar pigmentation of the silky fowl. J. Genet. 1, 185–203 (1911).
Article Google Scholar
Cha, J. et al. Genome-wide association study revealed the genomic regions associated with skin pigmentation in an Ogye x White Leghorn F2 chicken population. Poult. Sci. 102, 102720 (2023).
Dharmayanthi, A. B. et al. The origin and evolution of fibromelanosis in domesticated chickens: Genomic comparison of Indonesian Cemani and Chinese Silkie breeds. PLoS ONE 12, e0173147 (2017).
Li, M. et al. De novo assembly of 20 chicken genomes reveals the undetectable phenomenon for thousands of core genes on microchromosomes and subtelomeric regions. Mol. Biol. Evol. 39, msac066 (2022).
Sohn, J. L. et al. Whole genome and transcriptome maps of the entirely black native Korean chicken breed Yeonsan Ogye. Gigascience 7, 1–14 (2018).
Article CAS Google Scholar
Cho, Y., Kim, J. Y. & Kim, N. Comparative genomics and selection analysis of Yeonsan Ogye black chicken with whole-genome sequencing. Genomics 114, 110298 (2022).
Mittal, P., Jaiswal, S. K., Vijay, N., Saxena, R. & Sharma, V. K. Comparative analysis of corrected tiger genome provides clues to its neuronal evolution. Sci. Rep. 9, 18459 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sharma, A. & Vijay, N. The genomic structure of complex chromosomal rearrangement at the Fm locus in black-bone Silkie chicken. Zenodo https://doi.org/10.5281/ZENODO.14011071 (2024).

Download references

Acknowledgements

The authors thank the University Grants Commission for supporting AS with a Ph.D. scholarship. The Department of Biotechnology, Ministry of Science and Technology, India (Grant no. BT/11/IYBA/2018/03) and Science and Engineering Research Board (Grant no. ECR/2017/001430) provided funds for computational resources (i.e., Har Gobind Khorana Computational Biology cluster) used.

Author information

Authors and Affiliations

Computational Evolutionary Genomics Lab, Department of Biological Sciences, IISER Bhopal, Bhauri, Madhya Pradesh, India
Ashutosh Sharma & Nagarjun Vijay

Authors

Ashutosh Sharma
View author publications
Search author on:PubMed Google Scholar
Nagarjun Vijay
View author publications
Search author on:PubMed Google Scholar

Contributions

Ashutosh Sharma: conceptualisation, formal analysis, investigation, visualisation, validation, writing—original draft, writing—review & editing. Nagarjun Vijay: conceptualisation, resources, writing—original draft, writing—review & editing, funding acquisition, project administration, supervision.

Corresponding author

Correspondence to Nagarjun Vijay.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: George Inglis and Christina Karlsson Rosenthal.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Description of Additional Supplementary Files

Supplementary Data

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Sharma, A., Vijay, N. The genomic structure of complex chromosomal rearrangement at the Fm locus in black-bone Silkie chicken. Commun Biol 8, 537 (2025). https://doi.org/10.1038/s42003-025-07825-2

Download citation

Received: 03 February 2024
Accepted: 26 February 2025
Published: 01 April 2025
Version of record: 01 April 2025
DOI: https://doi.org/10.1038/s42003-025-07825-2

This article is cited by

Selection signature analysis in chickens divergently selected for growth rate reveals novel candidate genes regulating fat deposition
- Hossein Abbasabadi
- Mohammad Reza Bakhtiarizadeh
- Beatriz Gutiérrez-Gil
BMC Genomics (2025)