Machine learning-driven alignment architecture of heterogeneous data with transient varying semantics

Li, Chaofan; Ma, Zhichao; Zeng, Yangzhi; Yang, Zaizheng; Li, Jiakai; Yang, Zheng; Xiong, Junming; Niu, Shichao; Wang, Zhe; Zhao, Hongwei; Ren, Luquan

doi:10.1038/s41467-026-72377-w

Download PDF

Article
Open access
Published: 23 April 2026

Machine learning-driven alignment architecture of heterogeneous data with transient varying semantics

Nature Communications (2026) Cite this article

3895 Accesses
1 Altmetric
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Via cross-correlation algorithms or synchronized acquisition of signals, the alignment of heterogeneous data with unknown semantic time shifts and intermittent semantic variations cannot be solved. The shift is caused by different data acquisition principles of sensors, different response discrimination principles using heterogeneous data, etc. Here, we report an unsupervised alignment architecture with a supervised learning model as the kernel to overcome the limitations of brain cognition, perception, and storage in aligning complex heterogeneous data. A set of data with a time shift is input into the kernel model of the architecture to predict the semantic labels, features or continuous values corresponding to another set of data. The time shift corresponding to the maximum testing accuracy or the minimum mean squared error is the alignment parameter for the two heterogeneous datasets. This architecture is expected to serve as a preprocessing step for semantic mining of signals and for information fusion.

Unified time series classification framework for explainable artificial intelligence

Article Open access 28 April 2026

Delegation to artificial intelligence can increase dishonest behaviour

Article Open access 17 September 2025

Machine learning-enhanced multi-band metamaterial sensor for early detection of neurological disorders

Article Open access 06 February 2026

Data availability

The authors declare that the main data supporting the findings of this study are available within the article and its Supplementary Information files. Source Data are provided with this paper. All other relevant data are available from the corresponding author upon request. The datasets used for data alignment, as well as training and testing of the arc detection models have been deposited in the public repository (https://www.scidb.cn/en/s/iMnaii). Source data are provided with this paper.

Code availability

The synchronize triggering software and code for data generation, data processing, data alignment, and obtaining arc detection models have been deposited in the public repository⁵³.

References

Bao, F. et al. Heat-assisted detection and ranging. Nature. 619, 743 (2023).
Google Scholar
Xue, H., Hu, G., Hong, N., Dunnick, N. & Jin, Z. How to keep artificial intelligence evolving in the medical imaging world? Challenges and opportunities. Sci. Bull. 68, 648–652 (2023).
Google Scholar
Li, C. et al. Development of an in-situ current-carrying friction testing instrument and experimental analysis under the background of the Fourth Industrial Revolution. Mech. Syst. Signal. Pr. 223, 111936 (2025).
Google Scholar
Zhao, X. et al. JAMIP: an artificial-intelligence aided data-driven infrastructure for computational materials informatics. Sci. Bull. 66, 1973–1985 (2021).
Google Scholar
Ali, M. A., Irfan, M. S., Khan, T., Khalid, M. Y. & Umer, R. Graphene nanoparticles as data generating digital materials in industry 4.0. Sci. Rep-UK. 13, 4945 (2023).
Google Scholar
Liu, X., Zhang, J. & Pei, Z. Machine learning for high-entropy alloys: Progress, challenges and opportunities. Prog. Mater. Sci. 131, 101018 (2023).
Google Scholar
Yang, C. et al. A machine learning-based alloy design system to facilitate the rational design of high entropy alloys with enhanced hardness. Acta. Mater. 222, 117431 (2022).
Google Scholar
Bai, P., Miljkovic, F., John, B. & Lu, H. Interpretable bilinear attention network with domain adaptation improves drug-target prediction. Nat. Mach. Intell. 5, 126–136 (2023).
Google Scholar
Sapoval, N. et al. Current progress and open challenges for applying deep learning across the biosciences. Nat. Commun. 13, 1728 (2022).
Google Scholar
Fang, X. et al. Programmable gear-based mechanical metamaterials. Nat. Mater. 21, 869–876 (2022).
Google Scholar
Jiao, P., Mueller, J., Raney, J., Zheng, X. & Alavi, A. Mechanical metamaterials and beyond. Nat. Commun. 14, 6004 (2023).
Google Scholar
Xu, C., Solomon, S. & Gao, W. Artificial intelligence-powered electronic skin. Nat. Mach. Intell. 5, 1344–1355 (2023).
Google Scholar
Liu, J. et al. Multimodal and flexible hydrogel-based sensors for respiratory monitoring and posture recognition. Biosens. Bioelectron. 243, 115773 (2024).
Google Scholar
Che, Y., Stroe, D., Hu, X. & Teodorescu, R. Semi-Supervised Self-Learning-Based Lifetime Prediction for Batteries. IEEE T. Ind. Inform. 19, 6471–6481 (2023).
Google Scholar
Lu, J., Xiong, R., Tian, J., Wang, C. & Sun, F. Deep learning to estimate lithium-ion battery state of health without additional degradation experiments. Nat. Commun. 14, 2760 (2023).
Google Scholar
Hang, J., Qiu, G., Hao, M. & Ding, S. Improved fault diagnosis method for permanent magnet synchronous machine system based on lightweight multisource information data layer fusion. IEEE T. Power. electr. 39, 13808–13817 (2024).
Google Scholar
Xiao, Y., Shao, H., Wang, J., Yan, S. & Liu, B. Bayesian variational transformer: A generalizable model for rotating machinery fault diagnosis. Mech. Syst. Signal. Pr. 207, 110936 (2024).
Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM. 60, 84–90 (2017).
Google Scholar
Deng, W., Li, Z., Li, X., Chen, H. & Zhao, H. Compound fault diagnosis using optimized MCKD and sparse representation for rolling bearings. IEEE T. Instrum. Meas. 71, 3508509 (2022).
Google Scholar
Song, Q., Jiang, X., Du, G., Liu, J. & Zhu, Z. Smart multichannel mode extraction for enhanced bearing fault diagnosis. Mech. Syst. Signal. Pr. 189, 110107 (2023).
Google Scholar
Ding, S. et al. A meta-learning based multimodal neural network for multistep ahead battery thermal runaway forecasting. IEEE T. Ind. Inform. 17, 4503–4511 (2021).
Google Scholar
Yang, H., Wu, J., Hu, Z. & Lv, C. Real-time driver cognitive workload recognition: attention-enabled learning with multimodal information fusion. IEEE T. Ind. Electron. 71, 4999–5009 (2024).
Google Scholar
Noy, S. & Zhang, W. Experimental evidence on the productivity effects of generative artificial intelligence. Science 381, 187–192 (2023).
Google Scholar
Li, C. et al. Realization of tensile-bending mechanical-thermal coupling fatigue based on a uniaxial tensile-fatigue testing device. IEEE T. Instrum. Meas. 71, 6005709 (2022).
Google Scholar
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning internal representations by error propagation. Nature. 323, 533–536 (1986).
Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
Google Scholar
Zhang, M., Feng, Q., Ji, D. & Kang, Z. Invariant feature exploration generalization network for high-speed train brake pad state recognition under variable speeds. IEEE Trans. Ind. Inform. 21, 1921–1930 (2025).
Google Scholar
Qian, Q., Qin, Y., Luo, J., Wang, Y. & Wu, F. Deep discriminative transfer learning network for cross-machine fault diagnosis. Mech. Syst. Signal. Pr. 186, 109884 (2023).
Google Scholar
Pan, C., Shang, Z., Tang, L., Cheng, H. & Li, W. Open-set domain adaptive fault diagnosis based on supervised contrastive learning and a complementary weighted dual adversarial network. Mech. Syst. Signal. Pr. 222, 111780 (2025).
Google Scholar
Xu, J., Kong, H., Li, K. & Ding, X. Generative zero-shot compound fault diagnosis based on semantic alignment. IEEE T. Instrum. Meas. 73, 3508713 (2024).
Google Scholar
Zhu, Y., Liang, X., Wang, T., Xie, J. & Yang, J. Multi-information fusion fault diagnosis of bogie bearing under small samples via unsupervised representation alignment Deep Q-Learning. IEEE T. Instrum. Meas. 72, 3503315 (2023).
Google Scholar
Zhu, H. et al. Visual grounding with joint multimodal representation and interaction. IEEE T. Instrum. Meas. 72, 5031811 (2023).
Google Scholar
Dong, M., Li, H., Yin, S., Wu, Y. & See, K. Y. A postprocessing-technique-based switching loss estimation method for GaN Devices. IEEE T. Power. electr. 36, 8253–8266 (2021).
Google Scholar
Gordon, J. A. & Novotny, D. R. Simultaneous Imaging and Precision Alignment of Two mm Wave Antennas Based on Polarization-Selective Machine-Vision. IEEE T. Instrum. Meas. 61, 3065–3071 (2012).
Google Scholar
Liu, S., Lin, L., Ma, M. & Jiao, B. Improved Fractional Delay Method for Canceling the Self-Interference of Full Duplex. IEEE T. Veh. Technol. 72, 2599–2603 (2023).
Google Scholar
Dawson, A., Michaels, J. E. & Michaels, T. Isolation of ultrasonic scattering by wavefield baseline subtraction. Mech. Syst. Signal. Pr. 70-71, 891–903 (2016).
Google Scholar
Yu, S. et al. TDMSAE: A transferable decoupling multi-scale autoencoder for mechanical fault diagnosis. Mech. Syst. Signal. Pr. 185, 109789 (2023).
Google Scholar
Long, M., Cao, Y., Wang, J., Jordan, M. Learning Transferable Features with Deep Adaptation Networks. In Proceedings of the 32nd International Conference on Machine Learning. (eds F. Bach & D. Blei) 37, 97–105 (2015).
Wang, C. et al. A multi-source domain feature-decision dual fusion adversarial transfer network for cross-domain anti-noise mechanical fault diagnosis insustainable city. Inform. Fusion. 115, 102739 (2025).
Google Scholar
Hua, Z., Shi, J. & Dumond, P. Domain-invariant feature exploration for intelligent fault diagnosis under unseen and time-varying working conditions. Mech. Syst. Signal. Pr. 224, 112193 (2025).
Google Scholar
Li, S. J. & Yu, J. A Multisource Domain Adaptation Network for Process Fault Diagnosis Under Different Working Conditions. IEEE T. Ind. Electron. 70, 6272–6283 (2023).
Google Scholar
Chen, Q. et al. Metric Learning-Based Few-Shot Adversarial Domain Adaptation: A Cross-Machine Diagnosis Method for Ball Screws of Industrial Robots. IEEE T. Instrum. Meas. 73, 3522010 (2024).
Google Scholar
Xie, K., Zhang, X., Hu, L., Chen, J. & Wu, G. Fiber-Optic Time Transfer Based on Bidirectional FDM and Cross Correlation Processing. IEEE T. Instrum. Meas. 73, 5504307 (2024).
Google Scholar
Wang, D., Zhou, L., Zhao, Y. Frame Synchronization for Passive Continuous-Variable Quantum Key Distribution with a Local Local Oscillator. Adv. Quantum Technol. 8, 2400715 (2025).
Kaur, M. & Joshi, H. D. Chirp Signal Based Timing Offset Estimation for GFDM Systems. Wireless. Pers. Commun. 132, 1781–1796 (2023).
Google Scholar
Agrawal, R., Dixon, S. Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment. In 2020 28th European Signal Processing Conference (EUSIPCO). 141-145 (2021).
Gao, Q. et al. Enhanced current-carrying tribological properties of copper-based microporous friction pairs containing slow-release polyaniline conductive grease. Tribol. Int. 201, 110240 (2025).
Google Scholar
Qiu, Y. et al. Laser powder bed fusion of in-situ amorphous oxide dispersion strengthened immiscible Cu-316 L bimetallic composite: Formation mechanism and current-carrying wear behavior. Tribol. Int. 200, 110096 (2024).
Google Scholar
Iwama, S. et al. Two common issues in synchronized multimodal recordings with EEG:Jitter and latency. Neurosci. Res. 203, 1–7 (2024).
Google Scholar
Fan, S., Zhang, X. & Song, Z. Imbalanced Sample Selection With Deep Reinforcement Learning for Fault Diagnosis. IEEE T. Ind. Inform 18, 2518–2527 (2022).
Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A. Learning Deep Features for Discriminative Localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2921–2929 (2016).
Hollmann, N. et al. Accurate predictions on small data with a tabular foundation model. Nature 637, 319–326 (2025).
Google Scholar
Li C, et al. Machine Learning-driven Alignment Architecture of Heterogeneous Data with Transient Varying Semantics, Zenodo, https://doi.org/10.5281/zenodo.19679056 (2025).

Download references

Acknowledgements

This work is funded by the National Natural Science Foundation of China No.92266206 (Z.M.), No.52525510 (Z.M.), and No.52550005 (Z.M.), the National Key R&D Program of China No.2023YFF0716800 (Z.M.) and Jilin Province Science and Technology Development Plan No.20240302065GX (Z.M.) and No.20250101004JJ (Z.M.).

Author information

Authors and Affiliations

School of Mechanical and Aerospace Engineering, Jilin University, Changchun, China
Chaofan Li, Zhichao Ma, Yangzhi Zeng, Zaizheng Yang, Jiakai Li, Zheng Yang, Junming Xiong & Hongwei Zhao
Key Laboratory of CNC Equipment Reliability Ministry of Education, Jilin University, Changchun, China
Zhichao Ma & Hongwei Zhao
Key Laboratory of Bionic Engineering Ministry of Education, Jilin University, Changchun, China
Shichao Niu, Zhe Wang & Luquan Ren

Authors

Chaofan Li
View author publications
Search author on:PubMed Google Scholar
Zhichao Ma
View author publications
Search author on:PubMed Google Scholar
Yangzhi Zeng
View author publications
Search author on:PubMed Google Scholar
Zaizheng Yang
View author publications
Search author on:PubMed Google Scholar
Jiakai Li
View author publications
Search author on:PubMed Google Scholar
Zheng Yang
View author publications
Search author on:PubMed Google Scholar
Junming Xiong
View author publications
Search author on:PubMed Google Scholar
Shichao Niu
View author publications
Search author on:PubMed Google Scholar
Zhe Wang
View author publications
Search author on:PubMed Google Scholar
Hongwei Zhao
View author publications
Search author on:PubMed Google Scholar
Luquan Ren
View author publications
Search author on:PubMed Google Scholar

Contributions

C.L. and Z.M. conceived the research and designed the experiments. C.L. carried out the experiments, edited the code and analyzed the data. Y.Z. prepared the specimens. Z.Y.(Zaizheng Yang), J.L. Z.Y.(Zheng Yang) and J.X. checked the code. C.L. wrote and revised the manuscript with input from the other authors. S.N., Z.W. assisted C.L. in revising the manuscript, with input from the other authors. Z.M. examined and polished the manuscript. Z.M., H.Z. and L.R. supervised the research. All authors contributed to the interpretation and drafting of the paper.

Corresponding author

Correspondence to Zhichao Ma.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Anuj Bansal, Shuaibing Li, Santhakumar Sampath and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Description of Additional Supplementary Files (download PDF )

Supplementary Movie 1 (download MP4 )

Supplementary Movie 2 (download MP4 )

Supplementary Movie 3 (download MP4 )

Supplementary Movie 4 (download MP4 )

Supplementary Movie 5 (download MP4 )

Transparent Peer Review file (download PDF )

Source data

Source Data (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, C., Ma, Z., Zeng, Y. et al. Machine learning-driven alignment architecture of heterogeneous data with transient varying semantics. Nat Commun (2026). https://doi.org/10.1038/s41467-026-72377-w

Download citation

Received: 08 April 2025
Accepted: 10 April 2026
Published: 23 April 2026
DOI: https://doi.org/10.1038/s41467-026-72377-w