Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Gradient-based optimization of complex nanoparticle heterostructures enabled by deep learning on heterogeneous graphs

A preprint version of the article is available at ChemRxiv.

Abstract

Applications of deep learning (DL) to design nanomaterials are hampered by a lack of suitable data representations and training data. Here we report efforts to overcome these limitations and leverage DL to optimize the nonlinear optical properties of core–shell upconverting nanoparticles (UCNPs). UCNPs, which have applications in fields such as biosensing, super-resolution microscopy and three-dimensional printing, can emit visible and ultraviolet light from near-infrared excitations. We report a large-scale dataset of UCNP emission spectra based on accurate but expensive kinetic Monte Carlo simulations (N > 6,000) and use these data to train a heterogeneous graph neural network using a physically motivated representation of UCNP nanostructure. Applying gradient-based optimization on the trained graph neural network, we identify structures with 6.5× higher predicted emission under 800-nm illumination than any UCNP in our training set. Our work reveals design principles for UCNP heterostructures and presents a roadmap for DL-based inverse design of nanomaterials.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: DL approach to enable UCNP heterostructure optimization.
Fig. 2: Description of SUNSET datasets.
Fig. 3: UCNP graph representation and hetero-GNN model architecture.
Fig. 4: Increasing hetero-GNN accuracy using on-the-fly data augmentation to promote subdivision invariance.
Fig. 5: Gradient-based optimization of UCNP heterostructure.

Similar content being viewed by others

Data availability

The SUNSET dataset is freely available via Figshare at https://doi.org/10.6084/m9.figshare.25130921 (ref. 61). Each subset (for instance, SUNSET-1) is presented in Javascript Object Notation (JSON) format; separate JSON files are also provided for ID and OOD collections for each subset. The hetero-GNN models used for UCNP optimization are freely available via Figshare at https://doi.org/10.6084/m9.figshare.27941694.v1 (ref. 62). The nanoparticle structures discovered by optimizing the hetero-GNN model are available via Figshare at https://doi.org/10.6084/m9.figshare.27973206 (ref. 63). Data for Figs. 2 and 5 are available via Figshare at https://doi.org/10.6084/m9.figshare.29916992 (ref. 64).

Code availability

The RNMC program, which contains the NPMC kMC tool, is available via GitHub at https://github.com/BlauGroup/RNMC and via Zenodo at https://doi.org/10.5281/zenodo.14360064 (ref. 65). Code defining the ML representations, data featurization and model training is available via GitHub at https://github.com/BlauGroup/NanoParticleTools and via Zenodo at https://doi.org/10.5281/zenodo.16878169 (ref. 66).

References

  1. Voznyy, O., Sutherland, B. R., Ip, A. H., Zhitomirsky, D. & Sargent, E. H. Engineering charge transport by heterostructuring solution-processed semiconductors. Nat. Rev. Mater. 2, 17026 (2017).

    Article  Google Scholar 

  2. Sun, W. et al. Machine learning–assisted molecular design and efficiency prediction for high-performance organic photovoltaic materials. Sci. Adv. 5, 4275 (2019).

    Article  Google Scholar 

  3. Sanchez-Lengeling, B. & Aspuru-Guzik, A. Inverse molecular design using machine learning: generative models for matter engineering. Science 361, 360–365 (2018).

    Article  Google Scholar 

  4. Wu, S. et al. Machine-learning-assisted discovery of polymers with high thermal conductivity using a molecular design algorithm. npj Comput. Mater. 5, 66 (2019).

    Article  Google Scholar 

  5. Skripka, A. et al. A generalized approach to photon avalanche upconversion in luminescent nanocrystals. Nano Lett. 23, 7100–7106 (2023).

    Article  Google Scholar 

  6. Zhong, Y. et al. Elimination of photon quenching by a transition layer to fabricate a quenching-shield sandwich structure for 800 nm excited upconversion luminescence of Nd3+-sensitized nanoparticles. Adv. Mater. 26, 2831–2837 (2014).

    Article  Google Scholar 

  7. So, S., Mun, J. & Rho, J. Simultaneous inverse design of materials and structures via deep learning: demonstration of dipole resonance engineering using core–shell nanoparticles. ACS Appl. Mater. Interfaces 11, 24264–24268 (2019).

    Article  Google Scholar 

  8. Peurifoy, J. et al. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv. 4, 4206 (2018).

    Article  Google Scholar 

  9. Xia, X., Sivonxay, E., Helms, B. A., Blau, S. M. & Chan, E. M. Accelerating the design of multishell upconverting nanoparticles through Bayesian optimization. Nano Lett. 23, 11129–11136 (2023).

    Article  Google Scholar 

  10. Liu, G.-X. et al. Inverse design in quantum nanophotonics: combining local-density-of-states and deep learning. Nanophotonics 12, 1943–1955 (2023).

    Article  Google Scholar 

  11. Hamachi, L. S. et al. Precursor reaction kinetics control compositional grading and size of cdse1-xsx nanocrystal heterostructures. Chem. Sci. 10, 6539–6552 (2019).

    Article  Google Scholar 

  12. Albrecht, M. et al. Magnetic multilayers on nanospheres. Nat. Mater. 4, 203–206 (2005).

    Article  Google Scholar 

  13. Xu, M., Liang, T., Shi, M. & Chen, H. Graphene-like two-dimensional materials. Chem. Rev. 113, 3766–3798 (2013).

    Article  Google Scholar 

  14. Chen, Y. et al. Two-dimensional metal nanomaterials: synthesis, properties, and applications. Chem. Rev. 118, 6409–6455 (2018).

    Article  Google Scholar 

  15. Xie, C., Niu, Z., Kim, D., Li, M. & Yang, P. Surface and interface control in nanoparticle catalysis. Chem. Rev. 120, 1184–1249 (2019).

    Article  Google Scholar 

  16. Haghighatlari, M. et al. Learning to make chemical predictions: the interplay of feature representation, data, and machine learning methods. Chem 6, 1527–1542 (2020).

    Article  Google Scholar 

  17. Krenn, M., Häse, F., Nigam, A., Friederich, P. & Aspuru-Guzik, A. Self-referencing embedded strings (SELFIES): a 100% robust molecular string representation. Mach. Learn. Sci. Technol. 1, 045024 (2020).

    Article  Google Scholar 

  18. Weininger, D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 28, 31–36 (1988).

    Article  Google Scholar 

  19. O’Brien, M. N., Jones, M. R. & Mirkin, C. A. The nature and implications of uniformity in the hierarchical organization of nanomaterials. Proc. Natl Acad. Sci. USA 113, 11717–11725 (2016).

    Article  Google Scholar 

  20. Liang, R. et al. Hierarchically engineered nanostructures from compositionally anisotropic molecular building blocks. Nat. Mater. 21, 1434–1440 (2022).

    Article  Google Scholar 

  21. Brown, K. A., Brittman, S., Maccaferri, N., Jariwala, D. & Celano, U. Machine learning in nanoscience: big data at small scales. Nano Lett. 20, 2–10 (2019).

    Article  Google Scholar 

  22. Zhou, J. et al. 2Dmatpedia, an open computational database of two-dimensional materials from top-down and bottom-up approaches. Sci. Data 6, 86 (2019).

    Article  Google Scholar 

  23. Reker, D. et al. Computationally guided high-throughput design of self-assembling drug nanoparticles. Nat. Nanotechnol. 16, 725–733 (2021).

    Article  Google Scholar 

  24. McCandler, C. A., Dahl, J. C. & Persson, K. A. Phosphine-stabilized hidden ground states in gold clusters investigated via a Aun(PH3)m database. ACS Nano 17, 1012–1021 (2022).

    Article  Google Scholar 

  25. Barnard, A. et al. Nanoinformatics, and the big challenges for the science of small things. Nanoscale 11, 19190–19201 (2019).

    Article  Google Scholar 

  26. Hlaváček, A. et al. Bioconjugates of photon-upconversion nanoparticles for cancer biomarker detection and imaging. Nat. Protoc. 17, 1028–1072 (2022).

    Article  Google Scholar 

  27. Pan, J.-A., Qi, X. & Chan, E. M. Enhanced upconversion and photoconductive nanocomposites of lanthanide-doped nanoparticles functionalized with low-vibrational-energy inorganic ligands. Nanoscale Horiz. 10, 596–604 (2025).

    Article  Google Scholar 

  28. Sanders, S. N. et al. Triplet fusion upconversion nanocapsules for volumetric 3d printing. Nature 604, 474–478 (2022).

    Article  Google Scholar 

  29. He, M. et al. Monodisperse dual-functional upconversion nanoparticles enabled near-infrared organolead halide perovskite solar cells. Angew. Chem. Int. Ed. 55, 4280–4284 (2016).

    Article  Google Scholar 

  30. Teitelboim, A. et al. Energy transfer networks within upconverting nanoparticles are complex systems with collective, robust, and history-dependent dynamics. J. Phys. Chem. C 123, 2678–2689 (2019).

    Article  Google Scholar 

  31. Zichi, L. et al. RNMC: kinetic Monte Carlo implementations for complex reaction networks. J. Open Source Softw. 9, 7244 (2024).

    Article  Google Scholar 

  32. Xu, X. et al. Optimising passivation shell thickness of single upconversion nanoparticles using a time-resolved spectrometer. APL Photonics 4, 026104 (2019).

    Article  Google Scholar 

  33. Wu, X. et al. Dye-sensitized core/active shell upconversion nanoparticles for optogenetics and bioimaging applications. ACS Nano 10, 1060–1066 (2016).

    Article  Google Scholar 

  34. Corso, G., Stark, H., Jegelka, S., Jaakkola, T. & Barzilay, R. Graph neural networks. Nature Rev. Methods Primers 4, 17 (2024).

    Article  Google Scholar 

  35. Shi, R. & Mudring, A.-V. Phonon-mediated nonradiative relaxation in ln3+-doped luminescent nanocrystals. ACS Mater. Lett. 4, 1882–1903 (2022).

    Article  Google Scholar 

  36. Perez, E., Strub, F., De Vries, H., Dumoulin, V. & Courville, A. Film: visual reasoning with a general conditioning layer. Proc. AAAI Conf. Artif. Intell. 32, 39412–3951 (2018).

    Google Scholar 

  37. Brody, S., Alon, U. & Yahav, E. How attentive are graph attention networks. In International Conference on Learning Representations (ICLR, 2022).

  38. Chan, E. M., Levy, E. S. & Cohen, B. E. Rationally designed energy transfer in upconverting nanoparticles. Adv. Mater. 27, 5753–5761 (2015).

    Article  Google Scholar 

  39. Skripka, A. & Chan, E.M. Unraveling the myths and mysteries of photon avalanching nanoparticles. Mater. Horiz. https://doi.org/10.1039/D4MH01798F (2025).

  40. Chan, E. M. Combinatorial approaches for developing upconverting nanomaterials: high-throughput screening, modeling, and applications. Chem. Soc. Rev. 44, 1653–1679 (2015).

    Article  Google Scholar 

  41. Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).

    Article  Google Scholar 

  42. Conn, A. R., Gould, N. I. & Toint, P. L. Trust Region Methods (SIAM, 2000).

  43. Gargas, D. J. et al. Engineering bright sub-10-nm upconverting nanocrystals for single-molecule imaging. Nat. Nanotechnol. 9, 300–305 (2014).

    Article  Google Scholar 

  44. Tian, B. et al. Low irradiance multiphoton imaging with alloyed lanthanide nanocrystals. Nat. Commun. 9, 3082 (2018).

    Article  Google Scholar 

  45. Xie, X. et al. Mechanistic investigation of photon upconversion in Nd3+-sensitized core-shell nanoparticles. J. Am. Chem. Soc. 135, 12608–12611 (2013).

    Article  Google Scholar 

  46. Li, X., Wang, R., Zhang, F. & Zhao, D. Engineering homogeneous doping in single nanoparticle to enhance upconversion efficiency. Nano Lett. 14, 3634–3639 (2014).

    Article  Google Scholar 

  47. Du, Y., Chalapathi, N. & Krishnapriyan, A. Neural spectral methods: self-supervised learning in the spectral domain. In Proc. Twelfth International Conference on Learning Representations (2024).

  48. Chalapathi, N., Du, Y. & Krishnapriyan, A. Scaling physics-informed hard constraints with mixture-of-experts. In Proc. Twelfth International Conference on Learning Representations (2024).

  49. Wahl, C. B. et al. Machine learning-accelerated design and synthesis of polyelemental heterostructures. Sci. Adv. 7, 5505 (2021).

    Article  Google Scholar 

  50. Yan, Y., Such, G. K., Johnston, A. P., Lomas, H. & Caruso, F. Toward therapeutic delivery with layer-by-layer engineered particles. ACS Nano 5, 4252–4257 (2011).

    Article  Google Scholar 

  51. Lee, J.-H., Loya, P. E., Lou, J. & Thomas, E. L. Dynamic mechanical behavior of multilayer graphene via supersonic projectile penetration. Science 346, 1092–1096 (2014).

    Article  Google Scholar 

  52. Gillespie, D. T. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J. Comput. Phys. 22, 403–434 (1976).

    Article  MathSciNet  Google Scholar 

  53. Andersen, M., Panosetti, C. & Reuter, K. A practical guide to surface kinetic monte carlo simulations. Front. Chem. https://doi.org/10.3389/fchem.2019.00202 (2019)

  54. Chatterjee, A. & Voter, A. F. Accurate acceleration of kinetic monte carlo simulations through the modification of rate constants. J. Chem. Phys. 132, 194101 (2010).

    Article  Google Scholar 

  55. Pini, F. et al. Spatial and temporal resolution of luminescence quenching in small upconversion nanocrystals. ACS Appl. Mater. Interfaces 14, 11883–11894 (2022).

    Article  Google Scholar 

  56. Jain, A. et al. Fireworks: a dynamic workflow system designed for high-throughput applications. Concurr. Comput. Pract. Exp. 27, 5037–5059 (2015).

    Article  Google Scholar 

  57. Rosen, A. S. et al. Jobflow: computational workflows made simple. J. Open Source Softw. 9, 5995 (2024).

    Article  Google Scholar 

  58. Wu, C.-F. J. Jackknife, bootstrap and other resampling methods in regression analysis. Ann. Stat. 14, 1261–1295 (1986).

    MathSciNet  Google Scholar 

  59. Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. J. Big Data 6, 1–48 (2019).

    Article  Google Scholar 

  60. Batzner, S. et al. E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).

    Article  Google Scholar 

  61. Sivonxay, E., Attia, L., Chan, E. M. & Blau, S. M. SUNSET Dataset (For ‘Gradient-based optimization of complex nanoparticle heterostructures enabled by deep learning on heterogeneous graphs’). Version v1. Figshare https://doi.org/10.6084/m9.figshare.25130921.v1 (2025).

  62. Attia, L. Hetero-GNN checkpoints. Version v1. Figshare https://doi.org/10.6084/m9.figshare.27941694.v1 (2024).

  63. Attia, L. Optimized nanoparticle structures. Version v1. Figshare https://doi.org/10.6084/m9.figshare.27973206.v1 (2024).

  64. Sivonxay, E., Attia, L., Blau, S. & Chan, E. M. Source data (For ‘Gradient-based optimization of complex nanoparticle heterostructures enabled by deep learning on heterogeneous graphs’). Version v1. Figshare https://doi.org/10.6084/m9.figshare.29916992.v1 (2025).

  65. Zichi, L. et al. RNMC: kinetic Monte Carlo implementations for complex reaction networks. Version v1.1.1. Zenodo https://doi.org/10.5281/zenodo.14360064 (2024).

  66. Sivonxay, E., Attia, L., Barter, D., Xia, X. & Blau, S. M. BlauGroup/NanoParticleTools. Version Publication. Zenodo https://doi.org/10.5281/zenodo.16878169 (2025).

Download references

Acknowledgements

This work was primarily funded by the Charter Hill Laboratory Directed Research and Development program of Lawrence Berkeley National Laboratory (LBNL), supported by the Office of Science, Office of Basic Energy Sciences (BES), of the US Department of Energy (DOE) under contract no. DE-AC02-05CH11231. This DOE-BES contract also supported work at the Molecular Foundry as well as computational resources at the National Energy Research Scientific Computing Center (NERSC, award no. BES-ERCAP0023292) and the Lawrencium computational cluster provided by the LBNL IT Division. We are especially grateful for the NERSC and Lawrencium low-priority queues, without which this work would not have been possible. L.A. was supported by the DOE Computational Science Graduate Fellowship under award no. DE-SC0022158. E.W.C.S.-S. was supported by the Carnegie Bosch Institute Postdoctoral Fellowship. We thank K. Chiao for helpful discussions related to optimizing C++ code.

Author information

Authors and Affiliations

Authors

Contributions

E.S. and D.B. implemented nanoparticle kMC in C++, and X.X. provided validation against a previous implementation. E.S. and X.X. built simulation analysis capabilities in Python. E.S. implemented the high-throughput workflow infrastructure. E.S. and X.X. ran kMC simulations to construct SUNSET. E.S. implemented RFR, FCNN, CNN and homo-GNN models and trained on SUNSET data. E.S. and L.A. implemented the hetero-GNN model and trained on SUNSET data. L.A. and E.S. implemented the on-the-fly data augmentation scheme. E.S. implemented and carried out global optimization, and then ran kMC on the optimized particles to validate predicted intensities. B.S.-L. provided important input on components of the GNN architecture and global optimization. E.S., L.A., E.W.C.S.-S., E.M.C. and S.M.B. wrote the manuscript. E.M.C. and S.M.B. conceived and supervised the project.

Corresponding authors

Correspondence to Emory M. Chan or Samuel M. Blau.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Computational Science thanks Kuan Sun and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Katilin McCardle, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Supplementary Figs. 1–11, Discussion and Tables 1–3.

Peer Review File (download PDF )

Supplementary Video 1 (download MP4 )

Visualization of a local optimization trajectory for a particle with eight regions, illustrating how the layer geometry and dopant concentrations are modified to maximize emission intensity.

Source data

Source Data Fig. 2 (download XLSX )

Data used to generate the subfigures in Fig. 2.

Source Data Fig. 5 (download XLSX )

Data used to plot the heat map in Fig. 5.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sivonxay, E., Attia, L., Spotte-Smith, E.W.C. et al. Gradient-based optimization of complex nanoparticle heterostructures enabled by deep learning on heterogeneous graphs. Nat Comput Sci 6, 83–95 (2026). https://doi.org/10.1038/s43588-025-00917-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s43588-025-00917-3

Search

Quick links

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics