Abstract
Via cross-correlation algorithms or synchronized acquisition of signals, the alignment of heterogeneous data with unknown semantic time shifts and intermittent semantic variations cannot be solved. The shift is caused by different data acquisition principles of sensors, different response discrimination principles using heterogeneous data, etc. Here, we report an unsupervised alignment architecture with a supervised learning model as the kernel to overcome the limitations of brain cognition, perception, and storage in aligning complex heterogeneous data. A set of data with a time shift is input into the kernel model of the architecture to predict the semantic labels, features or continuous values corresponding to another set of data. The time shift corresponding to the maximum testing accuracy or the minimum mean squared error is the alignment parameter for the two heterogeneous datasets. This architecture is expected to serve as a preprocessing step for semantic mining of signals and for information fusion.
Similar content being viewed by others
Data availability
The authors declare that the main data supporting the findings of this study are available within the article and its Supplementary Information files. Source Data are provided with this paper. All other relevant data are available from the corresponding author upon request. The datasets used for data alignment, as well as training and testing of the arc detection models have been deposited in the public repository (https://www.scidb.cn/en/s/iMnaii). Source data are provided with this paper.
Code availability
The synchronize triggering software and code for data generation, data processing, data alignment, and obtaining arc detection models have been deposited in the public repository53.
References
Bao, F. et al. Heat-assisted detection and ranging. Nature. 619, 743 (2023).
Xue, H., Hu, G., Hong, N., Dunnick, N. & Jin, Z. How to keep artificial intelligence evolving in the medical imaging world? Challenges and opportunities. Sci. Bull. 68, 648–652 (2023).
Li, C. et al. Development of an in-situ current-carrying friction testing instrument and experimental analysis under the background of the Fourth Industrial Revolution. Mech. Syst. Signal. Pr. 223, 111936 (2025).
Zhao, X. et al. JAMIP: an artificial-intelligence aided data-driven infrastructure for computational materials informatics. Sci. Bull. 66, 1973–1985 (2021).
Ali, M. A., Irfan, M. S., Khan, T., Khalid, M. Y. & Umer, R. Graphene nanoparticles as data generating digital materials in industry 4.0. Sci. Rep-UK. 13, 4945 (2023).
Liu, X., Zhang, J. & Pei, Z. Machine learning for high-entropy alloys: Progress, challenges and opportunities. Prog. Mater. Sci. 131, 101018 (2023).
Yang, C. et al. A machine learning-based alloy design system to facilitate the rational design of high entropy alloys with enhanced hardness. Acta. Mater. 222, 117431 (2022).
Bai, P., Miljkovic, F., John, B. & Lu, H. Interpretable bilinear attention network with domain adaptation improves drug-target prediction. Nat. Mach. Intell. 5, 126–136 (2023).
Sapoval, N. et al. Current progress and open challenges for applying deep learning across the biosciences. Nat. Commun. 13, 1728 (2022).
Fang, X. et al. Programmable gear-based mechanical metamaterials. Nat. Mater. 21, 869–876 (2022).
Jiao, P., Mueller, J., Raney, J., Zheng, X. & Alavi, A. Mechanical metamaterials and beyond. Nat. Commun. 14, 6004 (2023).
Xu, C., Solomon, S. & Gao, W. Artificial intelligence-powered electronic skin. Nat. Mach. Intell. 5, 1344–1355 (2023).
Liu, J. et al. Multimodal and flexible hydrogel-based sensors for respiratory monitoring and posture recognition. Biosens. Bioelectron. 243, 115773 (2024).
Che, Y., Stroe, D., Hu, X. & Teodorescu, R. Semi-Supervised Self-Learning-Based Lifetime Prediction for Batteries. IEEE T. Ind. Inform. 19, 6471–6481 (2023).
Lu, J., Xiong, R., Tian, J., Wang, C. & Sun, F. Deep learning to estimate lithium-ion battery state of health without additional degradation experiments. Nat. Commun. 14, 2760 (2023).
Hang, J., Qiu, G., Hao, M. & Ding, S. Improved fault diagnosis method for permanent magnet synchronous machine system based on lightweight multisource information data layer fusion. IEEE T. Power. electr. 39, 13808–13817 (2024).
Xiao, Y., Shao, H., Wang, J., Yan, S. & Liu, B. Bayesian variational transformer: A generalizable model for rotating machinery fault diagnosis. Mech. Syst. Signal. Pr. 207, 110936 (2024).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM. 60, 84–90 (2017).
Deng, W., Li, Z., Li, X., Chen, H. & Zhao, H. Compound fault diagnosis using optimized MCKD and sparse representation for rolling bearings. IEEE T. Instrum. Meas. 71, 3508509 (2022).
Song, Q., Jiang, X., Du, G., Liu, J. & Zhu, Z. Smart multichannel mode extraction for enhanced bearing fault diagnosis. Mech. Syst. Signal. Pr. 189, 110107 (2023).
Ding, S. et al. A meta-learning based multimodal neural network for multistep ahead battery thermal runaway forecasting. IEEE T. Ind. Inform. 17, 4503–4511 (2021).
Yang, H., Wu, J., Hu, Z. & Lv, C. Real-time driver cognitive workload recognition: attention-enabled learning with multimodal information fusion. IEEE T. Ind. Electron. 71, 4999–5009 (2024).
Noy, S. & Zhang, W. Experimental evidence on the productivity effects of generative artificial intelligence. Science 381, 187–192 (2023).
Li, C. et al. Realization of tensile-bending mechanical-thermal coupling fatigue based on a uniaxial tensile-fatigue testing device. IEEE T. Instrum. Meas. 71, 6005709 (2022).
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning internal representations by error propagation. Nature. 323, 533–536 (1986).
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
Zhang, M., Feng, Q., Ji, D. & Kang, Z. Invariant feature exploration generalization network for high-speed train brake pad state recognition under variable speeds. IEEE Trans. Ind. Inform. 21, 1921–1930 (2025).
Qian, Q., Qin, Y., Luo, J., Wang, Y. & Wu, F. Deep discriminative transfer learning network for cross-machine fault diagnosis. Mech. Syst. Signal. Pr. 186, 109884 (2023).
Pan, C., Shang, Z., Tang, L., Cheng, H. & Li, W. Open-set domain adaptive fault diagnosis based on supervised contrastive learning and a complementary weighted dual adversarial network. Mech. Syst. Signal. Pr. 222, 111780 (2025).
Xu, J., Kong, H., Li, K. & Ding, X. Generative zero-shot compound fault diagnosis based on semantic alignment. IEEE T. Instrum. Meas. 73, 3508713 (2024).
Zhu, Y., Liang, X., Wang, T., Xie, J. & Yang, J. Multi-information fusion fault diagnosis of bogie bearing under small samples via unsupervised representation alignment Deep Q-Learning. IEEE T. Instrum. Meas. 72, 3503315 (2023).
Zhu, H. et al. Visual grounding with joint multimodal representation and interaction. IEEE T. Instrum. Meas. 72, 5031811 (2023).
Dong, M., Li, H., Yin, S., Wu, Y. & See, K. Y. A postprocessing-technique-based switching loss estimation method for GaN Devices. IEEE T. Power. electr. 36, 8253–8266 (2021).
Gordon, J. A. & Novotny, D. R. Simultaneous Imaging and Precision Alignment of Two mm Wave Antennas Based on Polarization-Selective Machine-Vision. IEEE T. Instrum. Meas. 61, 3065–3071 (2012).
Liu, S., Lin, L., Ma, M. & Jiao, B. Improved Fractional Delay Method for Canceling the Self-Interference of Full Duplex. IEEE T. Veh. Technol. 72, 2599–2603 (2023).
Dawson, A., Michaels, J. E. & Michaels, T. Isolation of ultrasonic scattering by wavefield baseline subtraction. Mech. Syst. Signal. Pr. 70-71, 891–903 (2016).
Yu, S. et al. TDMSAE: A transferable decoupling multi-scale autoencoder for mechanical fault diagnosis. Mech. Syst. Signal. Pr. 185, 109789 (2023).
Long, M., Cao, Y., Wang, J., Jordan, M. Learning Transferable Features with Deep Adaptation Networks. In Proceedings of the 32nd International Conference on Machine Learning. (eds F. Bach & D. Blei) 37, 97–105 (2015).
Wang, C. et al. A multi-source domain feature-decision dual fusion adversarial transfer network for cross-domain anti-noise mechanical fault diagnosis insustainable city. Inform. Fusion. 115, 102739 (2025).
Hua, Z., Shi, J. & Dumond, P. Domain-invariant feature exploration for intelligent fault diagnosis under unseen and time-varying working conditions. Mech. Syst. Signal. Pr. 224, 112193 (2025).
Li, S. J. & Yu, J. A Multisource Domain Adaptation Network for Process Fault Diagnosis Under Different Working Conditions. IEEE T. Ind. Electron. 70, 6272–6283 (2023).
Chen, Q. et al. Metric Learning-Based Few-Shot Adversarial Domain Adaptation: A Cross-Machine Diagnosis Method for Ball Screws of Industrial Robots. IEEE T. Instrum. Meas. 73, 3522010 (2024).
Xie, K., Zhang, X., Hu, L., Chen, J. & Wu, G. Fiber-Optic Time Transfer Based on Bidirectional FDM and Cross Correlation Processing. IEEE T. Instrum. Meas. 73, 5504307 (2024).
Wang, D., Zhou, L., Zhao, Y. Frame Synchronization for Passive Continuous-Variable Quantum Key Distribution with a Local Local Oscillator. Adv. Quantum Technol. 8, 2400715 (2025).
Kaur, M. & Joshi, H. D. Chirp Signal Based Timing Offset Estimation for GFDM Systems. Wireless. Pers. Commun. 132, 1781–1796 (2023).
Agrawal, R., Dixon, S. Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment. In 2020 28th European Signal Processing Conference (EUSIPCO). 141-145 (2021).
Gao, Q. et al. Enhanced current-carrying tribological properties of copper-based microporous friction pairs containing slow-release polyaniline conductive grease. Tribol. Int. 201, 110240 (2025).
Qiu, Y. et al. Laser powder bed fusion of in-situ amorphous oxide dispersion strengthened immiscible Cu-316 L bimetallic composite: Formation mechanism and current-carrying wear behavior. Tribol. Int. 200, 110096 (2024).
Iwama, S. et al. Two common issues in synchronized multimodal recordings with EEG:Jitter and latency. Neurosci. Res. 203, 1–7 (2024).
Fan, S., Zhang, X. & Song, Z. Imbalanced Sample Selection With Deep Reinforcement Learning for Fault Diagnosis. IEEE T. Ind. Inform 18, 2518–2527 (2022).
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A. Learning Deep Features for Discriminative Localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2921–2929 (2016).
Hollmann, N. et al. Accurate predictions on small data with a tabular foundation model. Nature 637, 319–326 (2025).
Li C, et al. Machine Learning-driven Alignment Architecture of Heterogeneous Data with Transient Varying Semantics, Zenodo, https://doi.org/10.5281/zenodo.19679056 (2025).
Acknowledgements
This work is funded by the National Natural Science Foundation of China No.92266206 (Z.M.), No.52525510 (Z.M.), and No.52550005 (Z.M.), the National Key R&D Program of China No.2023YFF0716800 (Z.M.) and Jilin Province Science and Technology Development Plan No.20240302065GX (Z.M.) and No.20250101004JJ (Z.M.).
Author information
Authors and Affiliations
Contributions
C.L. and Z.M. conceived the research and designed the experiments. C.L. carried out the experiments, edited the code and analyzed the data. Y.Z. prepared the specimens. Z.Y.(Zaizheng Yang), J.L. Z.Y.(Zheng Yang) and J.X. checked the code. C.L. wrote and revised the manuscript with input from the other authors. S.N., Z.W. assisted C.L. in revising the manuscript, with input from the other authors. Z.M. examined and polished the manuscript. Z.M., H.Z. and L.R. supervised the research. All authors contributed to the interpretation and drafting of the paper.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Anuj Bansal, Shuaibing Li, Santhakumar Sampath and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, C., Ma, Z., Zeng, Y. et al. Machine learning-driven alignment architecture of heterogeneous data with transient varying semantics. Nat Commun (2026). https://doi.org/10.1038/s41467-026-72377-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-026-72377-w


