Abstract
Network robustness design is a significant engineering task in complex systems including urban planning, communication programming, and chip designing. With the embedded vulnerability of complex networks, the relationship between network topology and its robustness remains unknown, presenting a significant challenge in designing robust networks. Existing approaches—ranging from empirical manual designs, statistically-driven rules to optimization via Monte Carlo simulations, struggle to meet the design demands of robust networks under multidimensional attacks. Here, we introduce a general framework for designing robust networks based on AI reinforcement learning. This framework establishes an interactive environment between network attack strategies and design models, enabling the learning of effective robustness design strategies against attacks. Our framework enables effective design of robust networks, for a given cost, surpassing existing methods. Notably, we find that during the design process, the network may develop suitable multi-backbones that mitigate its current vulnerability, offering insight into higher-order relations in real-world networks. Our approach can be adopted to various network design scenarios, which provides an integrative intelligent solution for designing robust complex systems.
Similar content being viewed by others
Data availability
The data used for training and testing the RL agent, the trained neural network parameters, the real networks data, the designed networks data, and corresponding source code are available at https://github.com/Zhu-BY/Design_Robust_Network.
Code availability
All codes used for training, network design based on the trained model, and backbone structure analysis in this research can be freely accessed at: https://github.com/Zhu-BY/Design_Robust_Network.
References
Strogatz, S. H. Exploring complex networks. Nature 410, 268–276 (2001).
Boccaletti, S., Latora, V., Moreno, Y., Chavez, M. & Hwang, D.-U. Complex networks: structure and dynamics. Phys. Rep. 424, 175–308 (2006).
Gao, J., Buldyrev, S. V., Havlin, S. & Stanley, H. E. Robustness of a network of networks. Phys. Rev. Lett. 107, 195701 (2011).
De Domenico, M. et al. Mathematical formulation of multilayer networks. Phys. Rev. X 3, 041022 (2013).
Scarinci, R., Markov, I. & Bierlaire, M. Network design of a transport system based on accelerating moving walkways. Transp. Res. Part C. 80, 310–328 (2017).
Venkataraman, N. L. & Kumar, R. Design and analysis of application specific network on chip for reliable custom topology. Comput. Netw. 158, 69–76 (2019).
Boyken, S. E. et al. De novo design of protein homo-oligomers with modular hydrogen-bond network–mediated specificity. Science 352, 680–687 (2016).
Elhayatmy, G., Dey, N. & Ashour, A. S. Internet of things based wireless body area network in healthcare. In Internet of Things and Big Data Analytics Toward Next-Generation Intelligence, 3–20 (Springer International Publishing, 2018).
Albert, R., Jeong, H. & Barabási, A.-L. Error and attack tolerance of complex networks. Nature 406, 378–382 (2000).
Motter, A. E. & Lai, Y.-C. Cascade-based attacks on complex networks. Phys. Rev. E 66, 065102 (2002).
Buldyrev, S. V., Parshani, R., Paul, G., Stanley, H. E. & Havlin, S. Catastrophic cascade of failures in interdependent networks. Nature 464, 1025–1028 (2010).
Wang, W., Yang, S., Stanley, H. E. & Gao, J. Local floods induce large-scale abrupt failures of road networks. Nat. Commun. 10, 2114 (2019).
Gao, J., Barzel, B. & Barabási, A.-L. Universal resilience patterns in complex networks. Nature 530, 307–312 (2016).
Liu, X. et al. Network resilience. Phys. Rep. 971, 1–108 (2022).
Radicchi, F. & Bianconi, G. Redundant interdependencies boost the robustness of multiplex networks. Phys. Rev. X 7, 011013 (2017).
Beygelzimer, A., Grinstein, G., Linsker, R. & Rish, I. Improving network robustness by edge modification. Phys. A 357, 593–612 (2005).
Zhao, J. & Xu, K. Enhancing the robustness of scale-free networks. J. Phys. A 42, 195003 (2009).
Cao, X.-B., Hong, C., Du, W.-B. & Zhang, J. Improving the network robustness against cascading failures by adding links. Chaos, Solit. Fractals 57, 35–40 (2013).
Zheng, Y. et al. Spatial planning of urban communities via deep reinforcement learning. Nat. Comput. Sci. 3, 748–762 (2023).
Mirhoseini, A. et al. A graph placement methodology for fast chip design. Nature 594, 207–212 (2021).
Watson, J. L. et al. De novo design of protein structure and function with RFdiffusion. Nature 620, 1089–1100 (2023).
Carchiolo, V., Grassia, M., Longheu, A., Malgeri, M. & Mangioni, G. Network robustness improvement via long-range links. Comput. Soc. Netw. 6, 1–16 (2019).
Alenazi, M. J., Cetinkaya, E. K. & Sterbenz, J. P. Cost-efficient algebraic connectivity optimisation of backbone networks. Opt. Switch. Netw. 14, 107–116 (2014).
Paul, G., Tanizawa, T., Havlin, S. & Stanley, H. E. Optimization of robustness of complex networks. Eur. Phys. J. B 38, 187–191 (2004).
Valente, A. X., Sarkar, A. & Stone, H. A. Two-peak and three-peak optimal complex networks. Phys. Rev. Lett. 92, 118702 (2004).
Alenazi, M. J., Cetinkaya, E. K. & Sterbenz, J. P. Cost-constrained and centrality-balanced network design improvement. In Proc. 6th International Workshop on Reliable Networks Design and Modeling (RNDM) 194–201 (IEEE, 2014).
Herrmann, H. J., Schneider, C. M., Moreira, A. A., Andrade, J. S. & Havlin, S. Onion-like network topology enhances robustness against malicious attacks. J. Stat. Mech. 2011, P01027 (2011).
Zeng, A. & Liu, W. Enhancing network robustness against malicious attacks. Phys. Rev. E 85, 066130 (2012).
Buesser, P., Daolio, F. & Tomassini, M. Optimizing the robustness of scale-free networks with simulated annealing. in Adaptive and Natural Computing Algorithms 167–176 (Springer, 2011).
Artime, O. et al. Robustness and resilience of complex networks. Nat. Rev. Phys. 6, 114–131 (2024).
Rubinstein, R. Y. & Kroese, D. P. Simulation and the Monte Carlo Method (John Wiley & Sons, 2016).
Ashlock, D. Evolutionary Computation for Modeling and Optimization (Springer, 2006).
Ong, Y.-S., Zhou, Z. & Lim, D. Curse and blessing of uncertainty in evolutionary algorithm using approximation. In Proc. IEEE International Conference on Evolutionary Computation 2928–2935 (IEEE, 2006).
Fan, C., Zeng, L., Sun, Y. & Liu, Y.-Y. Finding key players in complex networks through deep reinforcement learning. Nat. Mach. Intell. 2, 317–324 (2020).
Zhang, J. & Wang, B. Dismantling complex networks by a neural model trained from tiny networks. In Proc. 31st ACM International Conference on Information & Knowledge Management 2559–2568 (ACM, 2022).
Grassia, M., De Domenico, M. & Mangioni, G. Machine learning dismantling and early-warning signals of disintegration in complex systems. Nat. Commun. 12, 5190 (2021).
Schneider, C. M., Moreira, A. A., Andrade, J. S., Havlin, S. & Herrmann, H. J. Mitigation of malicious attacks on networks. Proc. Natl. Acad. Sci. Usa. 108, 3838–3841 (2011).
Eswaran, K. P. & Tarjan, R. E. Augmentation problems. SIAM J. Comput. 5, 653–665 (1976).
Schulman, J., Moritz, P., Levine, S., Jordan, M. I. & Abbeel, P. High-dimensional continuous control using generalized advantage estimation. 4th International Conference on Learning Representations (2016).
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., & Klimov, O. Proximal policy optimization algorithms. Preprint at https://arxiv.org/abs/1707.06347 (2017).
Holme, P., Kim, B. J., Yoon, C. N. & Han, S. K. Attack vulnerability of complex networks. Phys. Rev. E 65, 056109 (2002).
Morone, F. & Makse, H. A. Influence maximization in complex networks through optimal percolation. Nature 524, 65–68 (2015).
Ren, X.-L., Gleinig, N., Helbing, D. & Antulov-Fantulin, N. Generalized network dismantling. Proc. Natl. Acad. Sci. USA 116, 6554–6559 (2019).
Braunstein, A., Dall’Asta, L., Semerjian, G. & Zdeborová, L. Network dismantling. Proc. Natl. Acad. Sci. USA 113, 12368–12373 (2016).
Bertsimas, D. & Tsitsiklis, J. Simulated annealing. Stat. Sci. 8, 10–15 (1993).
Barabási, A.-L. & Albert, R. Emergence of scaling in random networks. Science 286, 509–512 (1999).
Latora, V. & Marchiori, M. Efficient behavior of small-world networks. Phys. Rev. Lett. 87, 198701 (2001).
Rousseeuw, P. J. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987).
Squartini, T., van Lelyveld, I. & Garlaschelli, D. Early-warning signals of topological collapse in interbank networks. Sci. Rep. 3, 3357 (2013).
Bauch, C. T., Sigdel, R., Pharaon, J. & Anand, M. Early warning signals of regime shifts in coupled human–environment systems. Proc. Natl. Acad. Sci. USA 113, 14560–14567 (2016).
Dakos, V. & Bascompte, J. Critical slowing down as early warning for the onset of collapse in mutualistic communities. Proc. Natl. Acad. Sci. USA 111, 17546–17551 (2014).
Zhou, D. & Elmokashfi, A. Network recovery based on system crash early warning in a cascading failure model. Sci. Rep. 8, 7443 (2018).
Smith, A. M. et al. Competitive percolation strategies for network recovery. Sci. Rep. 9, 11843 (2019).
Pan, X. & Wang, H. Resilience of and recovery strategies for weighted networks. PLoS ONE 13, e0203894 (2018).
Carlson, J. M. & Doyle, J. Highly optimized tolerance: robustness and design in complex systems. Phys. Rev. Lett. 84, 2529–2532 (2000).
Doyle, J. C. et al. The “robust yet fragile” nature of the Internet. Proc. Natl. Acad. Sci. USA 102, 14497–14502 (2005).
Carlson, J. M. & Doyle, J. Complexity and robustness. Proc. Natl. Acad. Sci. USA 99, 2538–2545 (2002).
Cohen, R., Erez, K., ben-Avraham, D. & Havlin, S. Breakdown of the Internet under intentional attack. Phys. Rev. Lett. 86, 3682–3685 (2001).
University of Washington CSE. Rocketfuel: an ISP topology mapping engine. Available at https://research.cs.washington.edu/networking/rocketfuel/ (2024).
Reis, S. D. S. et al. Avoiding catastrophic failure in correlated networks of networks. Nat. Phys. 10, 762–767 (2014).
Smart, A. G., Amaral, L. A. N. & Ottino, J. M. Cascading failure and robustness in metabolic networks. Proc. Natl. Acad. Sci. USA 105, 13223–13228 (2008).
Duan, J., Li, D. & Huang, H.-J. Reliability of the traffic network against cascading failures with individuals acting independently or collectively. Transp. Res. Part C. 147, 104017 (2023).
Schneider, C. M., Yazdani, N., Araújo, N. A. M., Havlin, S. & Herrmann, H. J. Towards designing robust coupled networks. Sci. Rep. 3, 1969 (2013).
Zeng, G. et al. Multiple metastable network states in urban traffic. Proc. Natl. Acad. Sci. USA 117, 17528–17534 (2020).
Duan, J. et al. Spatiotemporal dynamics of traffic bottlenecks yields an early signal of heavy congestions. Nat. Commun. 14, 8002 (2023).
Marculescu, R. & Bogdan, P. The chip is the network: toward a science of network-on-chip design. Found. Trends Electron. Des. Autom. 2, 371–461 (2009).
Liu, Y. et al. Large-scale network lifetime inference based on universal scaling function. IEEE Internet Things J. 11, 23123–23139 (2024).
Daqing, L., Kosmidis, K., Bunde, A. & Havlin, S. Dimension of spatially embedded networks. Nat. Phys. 7, 481–484 (2011).
Liu, Q., Allamanis, M., Brockschmidt, M. & Gaunt, A. Constrained graph variational autoencoders for molecule design. Advances in Neural Information Processing Systems 31 https://doi.org/10.48550/arXiv.1805.09076 (2018).
Khailany, B. et al. Accelerating chip design with machine learning. IEEE Micro 40, 23–32 (2020).
Mureddu, M. Representation of the German transmission grid for renewable energy sources impact analysis. Preprint at https://arxiv.org/abs/1612.05532 (2016).
Šubelj, L. & Bajec, M. Robust network community detection using balanced propagation. Eur. Phys. J. B 81, 353–362 (2011).
Latora, V., Nicosia, V. & Russo, G. Complex Networks: Principles, Methods and Applications (Cambridge University Press, 2017).
Tornatore, M. et al. A survey on network resiliency methodologies against weather-based disruptions. In Proc. 8th International Workshop on Resilient Networks Design and Modeling (RNDM) 23–34 (IEEE, 2016).
Hadzilacos, V. & Toueg, S. A Modular Approach to Fault-Tolerant Broadcasts and Related Problems (Cornell University, Department of Computer Science, 1994).
Morris, J., Kroening, D. & Koopman, P. Fault tolerance tradeoffs in moving from decentralized to centralized embedded systems. In International Conference on Dependable Systems and Networks 377–386 (IEEE, 2004).
Moussa, N., Hamidi-Alaoui, Z. & El Alaoui, A. E. B. CFTM: a centralized fault tolerant mechanism for wireless sensor networks. In Proc. 5th International Conference on Optimization and Applications (ICOA) 1–6 (IEEE, 2019).
Yuan, X., Hu, Y., Stanley, H. E. & Havlin, S. Eradicating catastrophic collapse in interdependent networks via reinforced nodes. Proc. Natl. Acad. Sci. USA 114, 3311–3315 (2017).
Kfir-Cohen, Y., Vaknin Ben Porath, D. & Havlin, S. Optimization of robustness based on reinforced nodes in a modular network. Europhys. Lett. 137, 41003 (2022).
Oh, S. J., Schiele, B. & Fritz, M. Towards reverse-engineering black-box neural networks. In Explainable AI: Interpreting, Explaining and Visualizing Deep Learning 121–144 (Springer, 2019).
Yu, T. et al. Meta-world: a benchmark and evaluation for multi-task and meta reinforcement learning. Conference on Robot Learning 1094–1100 (PMLR, 2020).
Grover, A., Al-Shedivat, M., Gupta, J., Burda, Y. & Edwards, H. Learning policy representations in multiagent systems. In International Conference on Machine Learning 1802–1811 (PMLR, 2018).
Tampuu, A. et al. Multiagent cooperation and competition with deep reinforcement learning. PLoS ONE 12, e0172395 (2017).
Duan, K. et al. A comprehensive study on large-scale graph training: Benchmarking and rethinking. Adv. Neural Inf. Process. Syst. 35, 5376–5389 (2022).
Gupta, V. et al. GraphScale: a framework to enable machine learning over billion-node graphs. In Proc. 33rd ACM International Conference on Information and Knowledge Management 4514–4521 (ACM, 2024).
Xu, H. et al. Learning to reduce the scale of large graphs: a comprehensive survey. ACM Trans. Knowl. Discov. Data 19, 1–25 (2025).
Zeng, H., Zhou, H., Srivastava, A., Kannan, R. & Prasanna, V. GraphSAINT: Graph sampling based inductive learning method. In International Conference on Learning Representations (ICLR) (OpenReview.net, 2020).
Chu, T., Qu, S. & Wang, J. Large-scale multi-agent reinforcement learning using image-based state representation. In Proc. IEEE 55th Conference on Decision and Control (CDC) 7592–7597 (IEEE, 2016).
Ma, C., Li, A., Du, Y., Dong, H. & Yang, Y. Efficient and scalable reinforcement learning for large-scale network control. Nat. Mach. Intell. 6, 1006–1020 (2024).
Chu, T., Wang, J., Codecà, L. & Li, Z. Multi-agent deep reinforcement learning for large-scale traffic signal control. IEEE Trans. Intell. Transp. Syst. 21, 1086–1095 (2019).
Jin, Y., Wei, S., Yuan, J. & Zhang, X. Hierarchical and stable multiagent reinforcement learning for cooperative navigation control. IEEE Trans. Neural Netw. Learn. Syst. 34, 90–103 (2021).
Tan, T., Chu, T. & Wang, J. Multi-agent bootstrapped deep Q-network for large-scale traffic signal control. In Proc. IEEE Conference on Control Technology and Applications (CCTA) 358–365 (IEEE, 2020).
Christianos, F., Papoudakis, G., Rahman, M. A. & Albrecht, S. V. Scaling multi-agent reinforcement learning with selective parameter sharing. In International Conference on Machine Learning 1989–1998 (PMLR, 2021).
Li, Y.-P., Tan, S.-Y., Deng, Y. & Wu, J. Attacker-defender game from a network science perspective. Chaos 28, 051102 (2018).
Chaoqi, F. et al. Attack-defense game for critical infrastructure considering the cascade effect. Reliab. Eng. Syst. Saf. 216, 107958 (2021).
Wu, Y., Guo, P., Wang, Y. & Zio, E. Attack-defense game modeling framework from a vulnerability perspective to protect critical infrastructure systems. Reliab. Eng. Syst. Saf. 256, 110740 (2025).
Chen, K., Nguyen, T. & Hassanaly, M. Adversarial Multi-Agent Reinforcement Learning for Proactive False Data Injection Detection. Preprint at https://arxiv.org/abs/2411.12130 (2024).
Brody, S., Alon, U. & Yahav, E. How attentive are graph attention networks? International Conference on Learning Representations (OpenReview.net, 2022).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 770–778 (IEEE, 2016).
Watts, D. J. & Strogatz, S. H. Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998).
Arnaboldi, V., Conti, M., Passarella, A. & Pezzoni, F. Analysis of ego network structure in online social networks. In Proc. International Conference on Privacy, Security, Risk and Trust and 2012 International Conference on Social Computing 31–40 (IEEE, 2012).
DeJordy, R. & Halgin, D. Introduction to Ego Network Analysis (Boston College and the Winston Center for Leadership & Ethics, 2008).
Bonacich, P. Some unique properties of eigenvector centrality. Soc. Netw. 29, 555–564 (2007).
Kitsak, M. et al. Identification of influential spreaders in complex networks. Nat. Phys. 6, 888–893 (2010).
Ahmed, M., Seraj, R. & Islam, S. M. S. The k-means algorithm: a comprehensive survey and performance evaluation. Electronics 9, 1295 (2020).
Darvariu, V.-A., Hailes, S. & Musolesi, M. Goal-directed graph construction using reinforcement learning. Proc. R. Soc. A 477, 20210168 (2021).
Yang, S., Kaili, M., Wang, B., Yu, T. & Zha, H. Learning to boost resilience of complex networks via neural edge rewiring. Trans. Mach. Learn. Res. (2023).
Hu, Q., Li, R., Deng, Q., Zhao, Y. & Li, R. Enhancing network by reinforcement learning and neural confined local search. In Proc. Thirty-Second International Joint Conference on Artificial Intelligence 2122–2132 (International Joint Conferences on Artificial Intelligence, 2023).
Acknowledgements
This work is supported by the National Natural Science Foundation of China (Grants 72225012, D.L.; 72288101, D.L.; and 71822101, D.L.), the National Key Research and Development Program of China (2023YFB4302901, D.L.), the Fundamental Research Funds for the Central Universities (D.L.).
Author information
Authors and Affiliations
Contributions
B.Z. and D.L. designed the main idea of the research; B.Z. and T.Z. designed the RL framework; B.Z. performed the experiments; B.Z., J.G., S.H. and D.L. conducted the theoretical analysis. All authors contributed to discussing the results and writing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhu, B., Zhu, T., Gao, J. et al. Design of robust networks via reinforcement learning prompts the emergence of multi-backbones. Nat Commun (2026). https://doi.org/10.1038/s41467-026-70745-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-026-70745-0


