Abstract
Applications of deep learning (DL) to design nanomaterials are hampered by a lack of suitable data representations and training data. Here we report efforts to overcome these limitations and leverage DL to optimize the nonlinear optical properties of core–shell upconverting nanoparticles (UCNPs). UCNPs, which have applications in fields such as biosensing, super-resolution microscopy and three-dimensional printing, can emit visible and ultraviolet light from near-infrared excitations. We report a large-scale dataset of UCNP emission spectra based on accurate but expensive kinetic Monte Carlo simulations (N > 6,000) and use these data to train a heterogeneous graph neural network using a physically motivated representation of UCNP nanostructure. Applying gradient-based optimization on the trained graph neural network, we identify structures with 6.5× higher predicted emission under 800-nm illumination than any UCNP in our training set. Our work reveals design principles for UCNP heterostructures and presents a roadmap for DL-based inverse design of nanomaterials.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout





Similar content being viewed by others
Data availability
The SUNSET dataset is freely available via Figshare at https://doi.org/10.6084/m9.figshare.25130921 (ref. 61). Each subset (for instance, SUNSET-1) is presented in Javascript Object Notation (JSON) format; separate JSON files are also provided for ID and OOD collections for each subset. The hetero-GNN models used for UCNP optimization are freely available via Figshare at https://doi.org/10.6084/m9.figshare.27941694.v1 (ref. 62). The nanoparticle structures discovered by optimizing the hetero-GNN model are available via Figshare at https://doi.org/10.6084/m9.figshare.27973206 (ref. 63). Data for Figs. 2 and 5 are available via Figshare at https://doi.org/10.6084/m9.figshare.29916992 (ref. 64).
Code availability
The RNMC program, which contains the NPMC kMC tool, is available via GitHub at https://github.com/BlauGroup/RNMC and via Zenodo at https://doi.org/10.5281/zenodo.14360064 (ref. 65). Code defining the ML representations, data featurization and model training is available via GitHub at https://github.com/BlauGroup/NanoParticleTools and via Zenodo at https://doi.org/10.5281/zenodo.16878169 (ref. 66).
References
Voznyy, O., Sutherland, B. R., Ip, A. H., Zhitomirsky, D. & Sargent, E. H. Engineering charge transport by heterostructuring solution-processed semiconductors. Nat. Rev. Mater. 2, 17026 (2017).
Sun, W. et al. Machine learning–assisted molecular design and efficiency prediction for high-performance organic photovoltaic materials. Sci. Adv. 5, 4275 (2019).
Sanchez-Lengeling, B. & Aspuru-Guzik, A. Inverse molecular design using machine learning: generative models for matter engineering. Science 361, 360–365 (2018).
Wu, S. et al. Machine-learning-assisted discovery of polymers with high thermal conductivity using a molecular design algorithm. npj Comput. Mater. 5, 66 (2019).
Skripka, A. et al. A generalized approach to photon avalanche upconversion in luminescent nanocrystals. Nano Lett. 23, 7100–7106 (2023).
Zhong, Y. et al. Elimination of photon quenching by a transition layer to fabricate a quenching-shield sandwich structure for 800 nm excited upconversion luminescence of Nd3+-sensitized nanoparticles. Adv. Mater. 26, 2831–2837 (2014).
So, S., Mun, J. & Rho, J. Simultaneous inverse design of materials and structures via deep learning: demonstration of dipole resonance engineering using core–shell nanoparticles. ACS Appl. Mater. Interfaces 11, 24264–24268 (2019).
Peurifoy, J. et al. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv. 4, 4206 (2018).
Xia, X., Sivonxay, E., Helms, B. A., Blau, S. M. & Chan, E. M. Accelerating the design of multishell upconverting nanoparticles through Bayesian optimization. Nano Lett. 23, 11129–11136 (2023).
Liu, G.-X. et al. Inverse design in quantum nanophotonics: combining local-density-of-states and deep learning. Nanophotonics 12, 1943–1955 (2023).
Hamachi, L. S. et al. Precursor reaction kinetics control compositional grading and size of cdse1-xsx nanocrystal heterostructures. Chem. Sci. 10, 6539–6552 (2019).
Albrecht, M. et al. Magnetic multilayers on nanospheres. Nat. Mater. 4, 203–206 (2005).
Xu, M., Liang, T., Shi, M. & Chen, H. Graphene-like two-dimensional materials. Chem. Rev. 113, 3766–3798 (2013).
Chen, Y. et al. Two-dimensional metal nanomaterials: synthesis, properties, and applications. Chem. Rev. 118, 6409–6455 (2018).
Xie, C., Niu, Z., Kim, D., Li, M. & Yang, P. Surface and interface control in nanoparticle catalysis. Chem. Rev. 120, 1184–1249 (2019).
Haghighatlari, M. et al. Learning to make chemical predictions: the interplay of feature representation, data, and machine learning methods. Chem 6, 1527–1542 (2020).
Krenn, M., Häse, F., Nigam, A., Friederich, P. & Aspuru-Guzik, A. Self-referencing embedded strings (SELFIES): a 100% robust molecular string representation. Mach. Learn. Sci. Technol. 1, 045024 (2020).
Weininger, D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 28, 31–36 (1988).
O’Brien, M. N., Jones, M. R. & Mirkin, C. A. The nature and implications of uniformity in the hierarchical organization of nanomaterials. Proc. Natl Acad. Sci. USA 113, 11717–11725 (2016).
Liang, R. et al. Hierarchically engineered nanostructures from compositionally anisotropic molecular building blocks. Nat. Mater. 21, 1434–1440 (2022).
Brown, K. A., Brittman, S., Maccaferri, N., Jariwala, D. & Celano, U. Machine learning in nanoscience: big data at small scales. Nano Lett. 20, 2–10 (2019).
Zhou, J. et al. 2Dmatpedia, an open computational database of two-dimensional materials from top-down and bottom-up approaches. Sci. Data 6, 86 (2019).
Reker, D. et al. Computationally guided high-throughput design of self-assembling drug nanoparticles. Nat. Nanotechnol. 16, 725–733 (2021).
McCandler, C. A., Dahl, J. C. & Persson, K. A. Phosphine-stabilized hidden ground states in gold clusters investigated via a Aun(PH3)m database. ACS Nano 17, 1012–1021 (2022).
Barnard, A. et al. Nanoinformatics, and the big challenges for the science of small things. Nanoscale 11, 19190–19201 (2019).
Hlaváček, A. et al. Bioconjugates of photon-upconversion nanoparticles for cancer biomarker detection and imaging. Nat. Protoc. 17, 1028–1072 (2022).
Pan, J.-A., Qi, X. & Chan, E. M. Enhanced upconversion and photoconductive nanocomposites of lanthanide-doped nanoparticles functionalized with low-vibrational-energy inorganic ligands. Nanoscale Horiz. 10, 596–604 (2025).
Sanders, S. N. et al. Triplet fusion upconversion nanocapsules for volumetric 3d printing. Nature 604, 474–478 (2022).
He, M. et al. Monodisperse dual-functional upconversion nanoparticles enabled near-infrared organolead halide perovskite solar cells. Angew. Chem. Int. Ed. 55, 4280–4284 (2016).
Teitelboim, A. et al. Energy transfer networks within upconverting nanoparticles are complex systems with collective, robust, and history-dependent dynamics. J. Phys. Chem. C 123, 2678–2689 (2019).
Zichi, L. et al. RNMC: kinetic Monte Carlo implementations for complex reaction networks. J. Open Source Softw. 9, 7244 (2024).
Xu, X. et al. Optimising passivation shell thickness of single upconversion nanoparticles using a time-resolved spectrometer. APL Photonics 4, 026104 (2019).
Wu, X. et al. Dye-sensitized core/active shell upconversion nanoparticles for optogenetics and bioimaging applications. ACS Nano 10, 1060–1066 (2016).
Corso, G., Stark, H., Jegelka, S., Jaakkola, T. & Barzilay, R. Graph neural networks. Nature Rev. Methods Primers 4, 17 (2024).
Shi, R. & Mudring, A.-V. Phonon-mediated nonradiative relaxation in ln3+-doped luminescent nanocrystals. ACS Mater. Lett. 4, 1882–1903 (2022).
Perez, E., Strub, F., De Vries, H., Dumoulin, V. & Courville, A. Film: visual reasoning with a general conditioning layer. Proc. AAAI Conf. Artif. Intell. 32, 39412–3951 (2018).
Brody, S., Alon, U. & Yahav, E. How attentive are graph attention networks. In International Conference on Learning Representations (ICLR, 2022).
Chan, E. M., Levy, E. S. & Cohen, B. E. Rationally designed energy transfer in upconverting nanoparticles. Adv. Mater. 27, 5753–5761 (2015).
Skripka, A. & Chan, E.M. Unraveling the myths and mysteries of photon avalanching nanoparticles. Mater. Horiz. https://doi.org/10.1039/D4MH01798F (2025).
Chan, E. M. Combinatorial approaches for developing upconverting nanomaterials: high-throughput screening, modeling, and applications. Chem. Soc. Rev. 44, 1653–1679 (2015).
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Conn, A. R., Gould, N. I. & Toint, P. L. Trust Region Methods (SIAM, 2000).
Gargas, D. J. et al. Engineering bright sub-10-nm upconverting nanocrystals for single-molecule imaging. Nat. Nanotechnol. 9, 300–305 (2014).
Tian, B. et al. Low irradiance multiphoton imaging with alloyed lanthanide nanocrystals. Nat. Commun. 9, 3082 (2018).
Xie, X. et al. Mechanistic investigation of photon upconversion in Nd3+-sensitized core-shell nanoparticles. J. Am. Chem. Soc. 135, 12608–12611 (2013).
Li, X., Wang, R., Zhang, F. & Zhao, D. Engineering homogeneous doping in single nanoparticle to enhance upconversion efficiency. Nano Lett. 14, 3634–3639 (2014).
Du, Y., Chalapathi, N. & Krishnapriyan, A. Neural spectral methods: self-supervised learning in the spectral domain. In Proc. Twelfth International Conference on Learning Representations (2024).
Chalapathi, N., Du, Y. & Krishnapriyan, A. Scaling physics-informed hard constraints with mixture-of-experts. In Proc. Twelfth International Conference on Learning Representations (2024).
Wahl, C. B. et al. Machine learning-accelerated design and synthesis of polyelemental heterostructures. Sci. Adv. 7, 5505 (2021).
Yan, Y., Such, G. K., Johnston, A. P., Lomas, H. & Caruso, F. Toward therapeutic delivery with layer-by-layer engineered particles. ACS Nano 5, 4252–4257 (2011).
Lee, J.-H., Loya, P. E., Lou, J. & Thomas, E. L. Dynamic mechanical behavior of multilayer graphene via supersonic projectile penetration. Science 346, 1092–1096 (2014).
Gillespie, D. T. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J. Comput. Phys. 22, 403–434 (1976).
Andersen, M., Panosetti, C. & Reuter, K. A practical guide to surface kinetic monte carlo simulations. Front. Chem. https://doi.org/10.3389/fchem.2019.00202 (2019)
Chatterjee, A. & Voter, A. F. Accurate acceleration of kinetic monte carlo simulations through the modification of rate constants. J. Chem. Phys. 132, 194101 (2010).
Pini, F. et al. Spatial and temporal resolution of luminescence quenching in small upconversion nanocrystals. ACS Appl. Mater. Interfaces 14, 11883–11894 (2022).
Jain, A. et al. Fireworks: a dynamic workflow system designed for high-throughput applications. Concurr. Comput. Pract. Exp. 27, 5037–5059 (2015).
Rosen, A. S. et al. Jobflow: computational workflows made simple. J. Open Source Softw. 9, 5995 (2024).
Wu, C.-F. J. Jackknife, bootstrap and other resampling methods in regression analysis. Ann. Stat. 14, 1261–1295 (1986).
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. J. Big Data 6, 1–48 (2019).
Batzner, S. et al. E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).
Sivonxay, E., Attia, L., Chan, E. M. & Blau, S. M. SUNSET Dataset (For ‘Gradient-based optimization of complex nanoparticle heterostructures enabled by deep learning on heterogeneous graphs’). Version v1. Figshare https://doi.org/10.6084/m9.figshare.25130921.v1 (2025).
Attia, L. Hetero-GNN checkpoints. Version v1. Figshare https://doi.org/10.6084/m9.figshare.27941694.v1 (2024).
Attia, L. Optimized nanoparticle structures. Version v1. Figshare https://doi.org/10.6084/m9.figshare.27973206.v1 (2024).
Sivonxay, E., Attia, L., Blau, S. & Chan, E. M. Source data (For ‘Gradient-based optimization of complex nanoparticle heterostructures enabled by deep learning on heterogeneous graphs’). Version v1. Figshare https://doi.org/10.6084/m9.figshare.29916992.v1 (2025).
Zichi, L. et al. RNMC: kinetic Monte Carlo implementations for complex reaction networks. Version v1.1.1. Zenodo https://doi.org/10.5281/zenodo.14360064 (2024).
Sivonxay, E., Attia, L., Barter, D., Xia, X. & Blau, S. M. BlauGroup/NanoParticleTools. Version Publication. Zenodo https://doi.org/10.5281/zenodo.16878169 (2025).
Acknowledgements
This work was primarily funded by the Charter Hill Laboratory Directed Research and Development program of Lawrence Berkeley National Laboratory (LBNL), supported by the Office of Science, Office of Basic Energy Sciences (BES), of the US Department of Energy (DOE) under contract no. DE-AC02-05CH11231. This DOE-BES contract also supported work at the Molecular Foundry as well as computational resources at the National Energy Research Scientific Computing Center (NERSC, award no. BES-ERCAP0023292) and the Lawrencium computational cluster provided by the LBNL IT Division. We are especially grateful for the NERSC and Lawrencium low-priority queues, without which this work would not have been possible. L.A. was supported by the DOE Computational Science Graduate Fellowship under award no. DE-SC0022158. E.W.C.S.-S. was supported by the Carnegie Bosch Institute Postdoctoral Fellowship. We thank K. Chiao for helpful discussions related to optimizing C++ code.
Author information
Authors and Affiliations
Contributions
E.S. and D.B. implemented nanoparticle kMC in C++, and X.X. provided validation against a previous implementation. E.S. and X.X. built simulation analysis capabilities in Python. E.S. implemented the high-throughput workflow infrastructure. E.S. and X.X. ran kMC simulations to construct SUNSET. E.S. implemented RFR, FCNN, CNN and homo-GNN models and trained on SUNSET data. E.S. and L.A. implemented the hetero-GNN model and trained on SUNSET data. L.A. and E.S. implemented the on-the-fly data augmentation scheme. E.S. implemented and carried out global optimization, and then ran kMC on the optimized particles to validate predicted intensities. B.S.-L. provided important input on components of the GNN architecture and global optimization. E.S., L.A., E.W.C.S.-S., E.M.C. and S.M.B. wrote the manuscript. E.M.C. and S.M.B. conceived and supervised the project.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Computational Science thanks Kuan Sun and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Katilin McCardle, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information (download PDF )
Supplementary Figs. 1–11, Discussion and Tables 1–3.
Supplementary Video 1 (download MP4 )
Visualization of a local optimization trajectory for a particle with eight regions, illustrating how the layer geometry and dopant concentrations are modified to maximize emission intensity.
Source data
Source Data Fig. 2 (download XLSX )
Data used to generate the subfigures in Fig. 2.
Source Data Fig. 5 (download XLSX )
Data used to plot the heat map in Fig. 5.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sivonxay, E., Attia, L., Spotte-Smith, E.W.C. et al. Gradient-based optimization of complex nanoparticle heterostructures enabled by deep learning on heterogeneous graphs. Nat Comput Sci 6, 83–95 (2026). https://doi.org/10.1038/s43588-025-00917-3
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s43588-025-00917-3


