Abstract
Structural variations (SVs) play a key role in many human diseases and are major causative factors of malignant tumors. High-throughput chromatin conformation capture (Hi-C) technology captures spatial interactions between genomic fragments, thereby enhancing SV identification and localization and compensating for the limitations of sequencing-based approaches in detecting complex variants. However, existing methods based on Hi-C data still suffer from low accuracy, limited applicability, and difficulties in handling multiple types of SVs simultaneously. In this study, we propose VarHiCNet, a novel method for detecting structural variations from Hi-C data. Contact matrices are preprocessed and converted into image-like representations. These representations are then input into an improved RT-DETR network to identify candidate SV regions. Subsequently, a filtering and classification network is applied for precise breakpoint detection. Evaluated on six cancer cell lines, VarHiCNet demonstrates high accuracy and stability in SV identification, with overall performance surpassing that of existing methods. The source code is available at https://github.com/000425/VarHiCNet. Experimental results indicate that VarHiCNet achieves superior performance in detecting structural variations compared to other methods, offering a robust and accurate tool for genomic studies.
Data availability
The high-confidence SV list used in this study is sourced from Reference [25] and can be accessed via the following link: https://github.com/000425/data.
References
Logsdon, G. A. et al. Complex genetic variation in nearly complete human genomes. Nature 644, 430–441 (2025).
Hwang, M. S. et al. Targeting loss of heterozygosity for cancer-specific immunotherapy. Proceedings of the national academy of sciences 118.12 : e2022410118. (2021).
Nezami, M., Hager, S. & Shirazi, R. Transforming growth factor and the role of epigenetic aberrancies in oncogenic amplifications: A new perspective in preventive and therapeutic arena. J. Cancer Therapy. 14 (9), 390–407 (2023).
Sreenivasan, Varun, K. A. Verónica Yumiceba, and malte Spielmann. Structural variants in the 3D genome as drivers of disease. Nat. Rev. Genet. 26 (11), 742–760 (2025).
Su, X. et al. Challenges and prospects in utilizing technologies for gene fusion analysis in cancer diagnostics. Med-X 2 (1), 14 (2024).
Micheal, D. Personalized Medicine in the Genomic Age: Utilizing Whole Genome Sequencing for Specific Therapies. (2025).
He, S. et al. Systematic benchmarking of tools for structural variation detection using short-and long-read sequencing data in pigs. iScience 28 (3), 111983 (2025).
Alves, D. et al. Promoting comprehensive care for people with rare diseases in a tertiary care setting in brazil: protocol for a mixed methods implementation Study[J]. JMIR Res. Protocols. 14 (1), e68949 (2025).
Moustakli, E. et al. Long-Read sequencing and structural variant detection: unlocking the hidden genome in rare genetic disorders. Diagnostics 15 (14), 1803 (2025).
Liu, R. et al. Hi-C, a chromatin 3D structure technique advancing the functional genomics of immune cells. Front. Genet. 15, 1377238 (2024).
Dixon, J. R. et al. Integrative detection and analysis of structural variation in cancer genomes. Nat. Genet. 50 (10), 1388–1398 (2018).
Wang, S. et al. HiNT: a computational method for detecting copy number variations and translocations from Hi-C data. Genome Biol. 21 (1), 73 (2020).
Zhai, H. et al. HiSVision: A method for detecting Large-Scale structural variations based on Hi-C data and detection transformer. Interdisciplinary Sciences: Comput. Life Sci. 17 (3), 519–527 (2024).
Li, J., Gao, L. & Ye, Y. HiSV: a control-free method for structural variation detection from Hi-C data. PLoS Comput. Biol. 19 (1), e1010760 (2023).
Luo, J. et al. BreakNet: detecting deletions using long reads and a deep learning approach. BMC Bioinform. 22 (1), 577 (2021).
Gao, R. et al. INSnet: a method for detecting insertions based on deep learning network. BMC Bioinform. 24 (1), 80 (2023).
Zhao, Y. et al. Detrs beat yolos on real-time object detection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. (2024).
Han, K. et al. Transformer in transformer. Adv. Neural. Inf. Process. Syst. 34, 15908–15919 (2021).
Henderi, H., Wahyuningsih, T. & Rahwanto, E. Comparison of Min-Max normalization and Z-Score normalization in the K-nearest neighbor (kNN) algorithm to test the accuracy of types of breast cancer. Int. J. Inf. Inform. Syst. 4 (1), 13–20 (2021).
Schöpflin, R. et al. Integration of Hi-C with short and long-read genome sequencing reveals the structure of germline rearranged genomes. Nat. Commun. 13 (1), 6470 (2022).
Álvarez-Fidalgo, D. & Ortin, F. CLAVE: A deep learning model for source code authorship verification with contrastive learning and transformer encoders. Inf. Process. Manag. 62 (3), 104005 (2025).
Koonce, B. ResNet 50. Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization 63–72 (A, 2021).
Liu, S. et al. Path aggregation network for instance segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition. (2018).
Maćkiewicz, A. & Ratajczak, W. Principal components analysis (PCA). Comput. Geosci. 19 (3), 303–342 (1993).
Wang, X., Luan, Y. & Yue, F. EagleC: A deep-learning framework for detecting a full range of structural variations from bulk and single-cell contact maps. Sci. Adv. 8 (24), eabn9215 (2022).
Funding
This research was supported by the Henan Provincial Department of Science and Technology Research Project (Grant No. 242102210097, 242102210110).
Author information
Authors and Affiliations
Contributions
JQS, HJW and JFW participated in the design of the study and the analysis of the experimental results. JFW and JQS performed the implementation. HJW and HXZ prepared the tables and figures. JQS and JWL summarized the results of the study and checked the format of the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Shen, J., Wang, H., Zhai, H. et al. A method for structural variant detection using Hi-C contact matrix and neural networks. Sci Rep (2026). https://doi.org/10.1038/s41598-026-37678-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-37678-6