A generative diffusion model for amorphous materials

Yang, Kai; Schwalbe-Koda, Daniel

doi:10.1038/s41524-025-01901-1

Download PDF

Article
Open access
Published: 10 December 2025

A generative diffusion model for amorphous materials

Kai Yang^1,2 &
Daniel Schwalbe-Koda¹

npj Computational Materials volume 12, Article number: 29 (2026) Cite this article

2667 Accesses
18 Altmetric
Metrics details

Subjects

Abstract

Generative models show great promise for the inverse design of molecules and inorganic crystals, but remain largely ineffective within more complex structures such as amorphous materials. Here, we present a diffusion model that reliably generates amorphous structures up to 3 orders of magnitude times faster than conventional simulations across processing conditions, compositions, and data sources. Generated structures recovered the short- and medium-range order, sampling diversity, and macroscopic properties of silica glass, as validated by simulations and an information-theoretical strategy. Conditional generation allowed sampling large structures at low cooling rates of 10⁻² K/ps to uncover a ductile-to-brittle transition and mesoporous silica structures. Extension to metallic glassy systems accurately reproduced local structures and properties from both computational and experimental datasets, demonstrating how synthetic data can be generated from characterization results. Our methods provide a roadmap for the design and simulation of amorphous materials previously inaccessible to computational methods.

Generating 3D images of material microstructures from a single 2D image: a denoising diffusion approach

Article Open access 18 March 2024

Signatures of paracrystallinity in amorphous silicon from machine-learning-driven molecular dynamics

Article Open access 10 March 2025

Mechanisms and energetics of calcium aluminosilicate glass dissolution through ab initio molecular dynamics-metadynamics simulations

Article Open access 15 March 2024

Introduction

Solving the inverse design of materials is one of the most coveted goals of computational materials science,¹ but sampling the space of materials exhibiting outstanding structures and properties remains challenging. In recent years, deep generative models provided an effective framework for sampling data points conditioned on labels.² Diffusion models, in particular, excel at learning the joint probability distributions of complex data and labels, denoising random inputs into high-fidelity samples, such as images, text, and more^3,4. Within materials science, diffusion models have been shown to produce novel inorganic crystal structures conditioned on properties, symmetries, compositions, and other constraints, in an encouraging step for the inverse design of crystalline materials.^5,6,7,8

Many technologically relevant solids, however, are not crystalline, but amorphous. Polymers, metallic glasses, battery electrolytes, and phase-change memory alloys are examples of materials lacking the long-range order that makes inorganic crystalline systems comparatively simpler to represent and, in many cases, generate.^9,10 Furthermore, the rugged, high-dimensional energy landscapes of amorphous structures demand very long molecular dynamics (MD) or Monte Carlo simulations to achieve adequate convergence¹¹. Thus, modeling synthesis-structure-property relationships that can enable the inverse design of amorphous materials remains challenging. Several works have attempted to use generative models, including variational autoencoders, generative adversarial networks, or diffusion models, to produce amorphous configurations^{12,13,14,15,16,17}. However, comparisons between generated and simulated structures show that most models fail to either produce physically correct samples¹² or sample structures well outside the training set^14,18. Even when structures are reasonable, there is limited evidence that generated samples exhibit the correct macroscopic properties,¹⁷ severely limiting the applicability of generative models in computational studies (Supplementary Text, Sec. S1.1).

Here, we show that a graph neural network (GNN)-based diffusion model can faithfully generate amorphous structures across diverse compositions, densities, processing conditions, and data sources (Fig. 1). Following a denoising diffusion probabilistic model (DDPM) framework³ for atomistic systems,¹⁹ the GNNs are trained to denoise atomic environments added to known amorphous structures (Fig. 1a, b, Methods). Later, trained models are used to sample new amorphous configurations from random distributions of atoms (Fig. 1a, c). Generated amorphous structures recover short- and medium-range order, create distributions of outputs nearly indistinguishable from simulated structures, exhibit elastic and plastic behaviors compatible with their simulated counterparts, and can be conditioned on processing parameters, such as cooling rates (Fig. 1d). The methods are demonstrated with applications on amorphous silica (a-SiO₂), one simulated metallic glass, and one experimentally determined amorphous nanoparticle. The capabilities are used to generate large simulation cells, create processing-property relationships, produce porous structures, and augment experimental characterization data with synthetic results, enabling computational studies previously expensive or impractical (Fig. 1e). These results provide a step toward the inverse design of amorphous materials and an efficient tool for sampling their vast configurational space.

**Fig. 1: Training and validating a generative diffusion model for amorphous materials.**

Results

Generating valid and novel amorphous materials

As a first case study, we trained a GNN-based diffusion model to generate a-SiO₂ structures sampled with MD simulations at a constant cooling rate of 1 K/ps (Methods). Then, we verified whether generated structures displayed the correct short- and medium-range order for these materials. Fig. 2 shows that generated structures exhibited excellent structural agreement with their simulated counterparts adopted as ground-truth. Generated structures captured the short-range order characteristics of a-SiO₂, with partial PDFs and bond angle distributions (BADs) reproducing peak positions and intensities of the reference data (Fig. 2a). At the medium range, Fig. 2c shows that generated structures reproduced ring distributions (Methods) with excellent accuracy, and only slightly overestimated the number of 6-membered rings. Beyond structural distributions, which can average out outliers, macroscopic quantities, such as mechanical properties are highly sensitive to unphysical environments, as they often lead to large variations in energy (see Supplementary Text, Section S1.2 and Tab. S1). Thus, reproducing properties is a rigorous test to validate generative models, but have been largely underutilized within amorphous materials. To enable this test, we calculated the stiffness tensor of simulated and generated structures according to the ground-truth potential used to produce the training data (see Methods for details). Fig. 2d shows that the elastic properties computed for the generated silica structures, including bulk modulus (K), shear modulus (G), Young’s modulus (E), and Poisson’s ratio (ν), are statistically indistinguishable from their simulated counterparts. In contrast with methods, such as reverse Monte Carlo, where structural features are well-reproduced but properties can be severely wrong^20,21, our models produced structures in excellent agreement with reference simulation data across all relevant figures of merit.

**Fig. 2: Structures and properties of generated a-SiO₂ structures are nearly indistinguishable from simulated ones.**

As generative models learn a distribution of input data points, they are not only assessed according to the validity of samples, but also according to their novelty, thus verifying whether the model exhibits “mode collapse”². However, in contrast to crystalline structures or molecular graphs, novelty is challenging to define in the amorphous space (see Sec. S1.3 in the Supplementary Text for a complete discussion). We computed a probability distribution over environments for amorphous structures following an information-theoretical strategy,²² and quantified the overlap between distributions. This comparison between distributions resembles the Fréchet Inception Distance²³, which accounts for novelty in terms of distribution statistics rather than point-wise metrics. Fig. 2e shows that the distribution of overlap scores between different structure pairs is nearly identical (see detailed scores in Fig. S11). Generated structures exhibited as much overlap with simulated structures as the overlap between two simulated structures, ruling out the existence of mode collapse across the system when a large external noise is used to generate structures. On the other hand, Fig. 2f shows that the information entropy of generated samples is unreasonably large if no external noise is used during the denoising process (see Supplementary Text, Sec. S1.4, and Figs. S2–S5). These results demonstrate that our diffusion model generates valid structures according to structural features^12,17,18 and property distributions, as well as novel structures, as quantified by their sampling distributions.

Generating amorphous structures conditioned on processing parameters

Controlling processing conditions, such as cooling rates in glassy materials is essential to model and design amorphous structures and their resulting properties^24,25,26. Given the cost of performing long MD simulations, cooling rates in computational models typically revolve around 10¹² K/s (or 1 K/ps), exceeding typical experimental cooling rates (1–100 K/s) by several orders of magnitude. Thus, learning to sample structures at very low cooling rates can unlock the ability to produce increasingly realistic models for amorphous systems. We trained our generative model on a-SiO₂ structures simulated with cooling rates between 10⁻¹ and 10² K/ps (Methods), then used an input to the GNN that conditioned the generation of amorphous structures to a target cooling rate (Fig. 1c). Fig. 3a demonstrates that structures generated with higher cooling rate targets exhibited higher information entropy even at constant density, reproducing known relationships between configurational entropy and cooling rates in glasses (see Sec. S1.5 and Fig. S6).²⁷ Moreover, Fig. 3b shows that the dependence of average Si-O-Si angles with respect to the thermal history are reasonably captured in the generated a-SiO₂ structures, in agreement with simulated counterparts and the literature.^24,25,26 Beyond structural features, Fig. 3c shows the Young’s modulus (E) for simulated and generated a-SiO₂ structures across different cooling rates. The values of Young’s moduli remain consistent across cooling rates for both simulated and generated a-SiO₂ structures, ranging between 85–100 GPa. This agreement is also true for all other elastic properties, including bulk modulus, shear modulus, and Poisson’s ratio (Figs. S7–S10), confirming our ability to conditionally generate structures across cooling rates while reproducing their property distributions (see Sec. S1.6 in the Supplementary Text for an extended analysis).

**Fig. 3: Conditional generation of a-SiO₂ using diffusion models recover structural features and properties across cooling rates.**

Remarkably, the trends in Fig. 3b, c generalize to both faster (10³ K/ps) and slower (10⁻² K/ps) cooling rates beyond the training regime (see Fig. S11 for more examples), which can be used to generate structures at low cooling rates that are expensive for traditional simulations. To illustrate the impact of this observation, Fig. 3d compares the estimated computational cost (in GPU-h) for simulating amorphous structures with 3000 atoms across cooling rates compared to the time required to evaluate the generative model (see Methods for hardware details). As slow cooling rates can only be simulated by longer MD simulations, the computational cost of simulating realistic glasses follows a power law with the cooling rate, while the generative model produces structures at constant time. At slow cooling rates, such as 0.01 K/ps, the diffusion model produces a-SiO₂ structures at least 2400 times faster (in GPU-h) than MD simulations (Fig. 3d) for a system with 3000 atoms using the Jakse potential²⁸. Performing melt-quench simulations for the largest system in this work (112,848 atoms) at 0.01 K/ps would require about 34,746 GPU-h using MD in our hardware (Methods), a substantial investment of computational resources. The generative process drastically improves the effectiveness of sampling amorphous structures at low cooling rates and large sizes, demonstrating that the correct processing-structure-property relationships can be captured with the model.

Fracture tests with large and slow-cooled amorphous silica

As our generative model samples structures for any input size or shape and for a range of cooling rates, we investigated the fracture behavior of generated a-SiO₂ structures for different length scales and processing conditions. Experimental observations have shown that a-SiO₂ exhibited a brittle-to-ductile transition at the nanoscopic regime^29,30 while computational studies were typically limited by small sizes and fast cooling rates.^24,25,31 We used our diffusion model to generate a-SiO₂ with different sizes and cooling rates, and performed fracture test simulations for each of them (Methods). Fig. 4a shows the stress-strain curves obtained for a-SiO₂ structures generated with a fixed aspect ratio of L_z/L_x = 1.4, as z being the direction of the tensile strain, and L_x varying from 3.5 nm (3735 atoms) to 10.9 nm (112,848 atoms). Generated structures reproduced the increasingly brittle behavior of amorphous silica at large length scales, while smaller simulation lengths (L_x < 6 nm) were more ductile (Supplementary Text, Sec. S1.7). The influence of low cooling rate effects on fracture, however, is often not studied at large length scales due to its associated computational cost^25,32. On the other hand, our generative model allows sampling these large, slow-cooled structures at minimal computational cost. Fig. 4b compares the behavior of slow- and fast-cooled structures for a-SiO₂ structures generated at large sizes (L_z/L_x = 1.4, L_x = 9.4 nm) and different target cooling rates. The results show that lower cooling rates decreased the ductility of the amorphous material,³³ but up to a certain limit. The fracture strain for samples generated with cooling rates of 0.1 and 0.01 K/ps are nearly identical even though the structures are slightly different (Fig. 3b), showing how the fracture behavior converges at very low cooling rates. This contrasts with the continuous trend observed in simulated glasses at much smaller system sizes of 3000 atoms, for which the ductility was drastically overestimated and continued to decrease at low cooling rates (Figs. S12, S13).

**Fig. 4: Shape- and size-agnostic amorphous structure generation enables simulations on fracture and structural porosity.**

Generating mesoporous amorphous structures

Mesoporous silica particles exhibiting pores with diameters between 2–50 nanometers are ubiquitous in many applications ranging from catalysis and separation to drug delivery^34,35. Despite their wide use in experiments, computational sampling of mesoporous silica structures remains challenging. Traditional melt-quench methods fail to preserve hollow porous architectures, requiring a “pore drilling" approach that often produces incorrect surface configurations.³⁶ Moreover, accurately representing surface silanol groups, which are critical for many applications, requires simultaneously maintaining structural stability while correctly terminating Si-O bonds at pore surfaces^36,37. We used our generative approach to produce a porous a-SiO₂ structure by denoising a structure from atoms sampled in a hexagonal lattice with a cylindrical pore (Methods, Fig. 4c). Despite never been trained on surfaces or porous structures, our diffusion model generated excellent amorphous surfaces and a wall that exhibited the right short-range ordering. Interestingly, generated porous structures correctly captured the atomic structure at the pore surface, comprised mostly of pristine SiO₄ tetrahedra and non-bridging oxygens (NBO). The concentrations of NBOs were also in agreement with typical ranges of 2-4 NBOs/nm² from experimental data^38,39, and were a function of the initial wall density used as starting configuration for the denoiser. At higher densities, the pore tended to shrink and form more densed NBOs at the pore wall, while low-density walls tended to form less NBOs at the surface (Fig. S14). As NBOs are responsible for a variety of properties of mesoporous silica in the presence of hydrogen, such as catalytic or hydrophobicity, obtaining these substructures directly from the generative approach can drastically accelerate computational modeling in these fields.

Generating metallic glasses

To extend our generative approach beyond a-SiO₂, we tested our methodology against Cu-Zr metallic glass structures and properties, a non-covalent amorphous system with a challenging potential energy surface. Amorphous Cu-Zr structures with a fixed composition of Cu₅₀Zr₅₀ and 5000 atoms were obtained from ref. ⁴⁰ to train a diffusion model without conditional targets. Then, given a box with randomly sampled atoms at a chosen composition and density, we used the diffusion model to generate the structure of the metallic glass, which can be followed by a short, post-denoising MD equilibration (Methods). Figure 5a shows that the partial PDFs between simulated and generated Cu₅₀Zr₅₀ glasses are in excellent agreement with each other, with generated structures showing near-perfect peak positions and only slightly lower intensities compared to the reference structures. This is true even for long-range peaks around 7 and 9 Å, which were outside of the cutoff of the GNN (5 Å), but were treated indirectly by the message-passing layers. The atomic volume in Fig. 5b and fractions of Voronoi indices in Fig. 5c for simulated and generated Cu₅₀Zr₅₀ also show good agreement with each other and only underestimated 〈0, 0, 12, 0〉 icosahedral motifs in generated samples. Given the sensitivity of Voronoi polyhedra to the structural fidelity, the ability of the model to correctly recover these trends is remarkable. Furthermore, Fig. 5d shows that the elastic and plastic regimes of the generated amorphous Cu-Zr structures (six independent samples) are also compatible with the reference curves (two samples). Though the yield stress is slightly underestimated in the generated structures, mirroring the results in Fig. S13, the strength curves of both metallic glasses are within good agreement with each other, especially when considering the sensitivity of stress-strain curves from these metallic glasses depending on the sample preparation⁴¹ and the limited training data used for the denoiser model. This proves that the generative model is not limited to amorphous silica networks, but can also produce valid amorphous structures and properties for metallic glassy systems.

**Fig. 5: Generating metallic glass structures from simulated and experimental data sources.**

Augmenting experimental characterization data on amorphous structures

Given that characterization data on amorphous structures is challenging to obtain at high resolutions, generative models can be used to augment experimental datasets with realistic, synthetically generated data. We trained our model on the atomistic structure of a multi-component metallic glass from Yang et al.⁴² obtained using atomic electron tomography (AET) (Fig. 5e). The data contained a finite metallic glass nanoparticle with three different element classes denoted as types 1, 2, and 3. Generated structures reproduced the distribution of polyhedral faces (Fig. 5f) and Voronoi indices distribution (see Supplementary Text, Sec. S1.8 and Tab. S2) from the experimental data, indicating that the model captured the correct geometrical distribution of the data. Furthermore, Fig. 5g, h show that the diffusion model approximated the total PDF of the experimental data at realistic levels despite being trained on a single example of experimental structure. Discrepancies between the two PDFs in Fig. 5g, especially at longer ranges, are due to the mismatch between the radial data computed for a finite particle in the experimental data and the generated bulk structure represented with periodic boundary conditions. Nevertheless, the generated structure reproduced the experimental behavior observed in the medium-range, where certain substructures were shown to form crystal-like “superclusters” that created medium-range order for specific atom pairs, and atypical PDFs for specific atom pairs⁴². Fig. 5h illustrates that the model correctly captured the correct short-range ordering and pairwise distributions in the experimental data (see Fig. S15 for all partial PDFs). In typical pair distribution functions, such as that for 1–3 pairs in Fig. 5h, generated structures exhibited the right peak positions and their relative intensities of the PDF, including medium-range features between 5–6 Å. Remarkably, even for atypical partial PDFs, such as the ones for 3–3 pairs in the experimental data, the model captured the correct distribution of atomistic species, as demonstrated by the agreement in peak positions around 2.9, 3.8, and 4.8 Å, with the third peak being the most prominent one. At medium-range, the model also captured the shoulder in the PDF between 5.5–6.5 Å, and the longer-range peak around 7.3 Å. This shows that the generative model can quantitatively reproduce experimental datasets and can be used to produce synthetically generated structures that mimic the experimental results at arbitrary sizes and shapes, or assist in the development of atomistic models for structural characterization.

Discussion

Our generative diffusion model substantially advances computational modeling of amorphous materials, accurately producing physically realistic structures and properties across various compositions, data sources, and processing conditions. Unlike previous generative methods for amorphous materials, whose unphysical artifacts or constrained structural diversity limited their broad use in simulations, our model consistently replicates key short- and medium-range structural characteristics and mechanical properties, highlighting its robustness and reliability. Importantly, this framework offers efficient computational performance, drastically reducing the computational cost typically associated with MD simulations. Our work provides a roadmap not only for the training process of diffusion models for amorphous materials, but also how to quantitatively assess model quality in terms of structural novelty, properties, and applications.

As with any model, our approach exhibits limitations. Occasional outlier environments, although rare, require short MD simulations to refine the generated structure. Furthermore, though several properties are predicted with high accuracy, yield strengths are still slightly underestimated, showing this metric may be a good indicator of model quality within generative models for amorphous structures as the field progresses. Future research can extend this approach toward broader classes of complex materials, including metallic or semiconducting alloys or polymers. Additionally, integrating generative methods with experimental characterization can automate structural elucidation, particularly in fields where experimental data is scarce or costly to obtain. The adaptability and diversity of applications demonstrated here suggest that generative diffusion models can already be used in a range of computational materials science problems, accelerating their modeling and discovery.

Methods

Score-based denoiser for amorphous structures

To create a generative model for amorphous structures, we adapted the denoising diffusion probabilistic models (DDPM) approach to amorphous materials by learning a mapping between a random distribution of atoms and a physically meaningful amorphous structure using a GNN^3,19. Figure 1a exemplifies the training and generative denoising processes for amorphous SiO₂ (a-SiO₂), with their partial pair distribution functions (PDFs) illustrating how short- and medium-range order varies across the system. The forward diffusion process (left to right in Fig. 1a) gradually introduces Gaussian noise to atomic positions from simulated amorphous structures until the final structure resembles a random distribution of particles. After training, the model is used to generate amorphous structures from randomly sampled particles with the denoising process (Fig. 1a, right to left). Given a target cooling rate and an initial configuration with fixed density and shape, the model sequentially denoises the configuration in a process that resembles “Langevin dynamics” with a learned score. This process adds an extra noise at each denoising step, which acts as an effective “temperature” for the system and escaping local minima in the learned score function (see Sec. S1.4 in the Supplementary Text). Within the DDPM, this extra noise follows a schedule and becomes progressively smaller throughout the denoising process³.

Mathematically, a generative model approximates the distribution p(x) over atomic structures x of the material system of interest. Under this DDPM strategy for atomistic structures, we train a model to approximate the distribution of atomic environments of amorphous configurations p(x) inferred from a dataset of known configurations. This approximation of p(x) is first performed by considering an input structure x₀ and defining a Markov chain process such that

$${{\bf{x}}}_{0}\to {{\bf{x}}}_{1}\to \ldots \to {{\bf{x}}}_{T},$$

(1)

where T is the number of diffusion steps. This forward process is defined by a distribution q(x_t∣x_t−1), 1≤t≤T. For simplicity, the distribution q is often chosen as a Gaussian distribution, where

$$q({{\bf{x}}}_{t}| {{\bf{x}}}_{t-1})={\mathcal{N}}\left({{\boldsymbol{\mu }}}_{t}({{\bf{x}}}_{t-1}),{\sigma }_{t}^{2}{\mathbb{I}}\right),$$

(2)

with (μ_t, σ_t) functions of the denoising timestep t and ${\mathbb{I}}$ the identity matrix. Within the diffusion model framework, given that the ground-truth distribution p is often intractable, a model p_θ with parameters θ is trained to approximate, at each step t, the score function of the noise distribution q(x_t∣x_t−1),⁴³

$$\mathop{\min }\limits_{\theta }{{\mathbb{E}}}_{t}\left[{\sigma }_{t}^{2}{{\mathbb{E}}}_{{{\bf{x}}}_{0} \sim {\mathcal{D}}}{{\mathbb{E}}}_{q({{\bf{x}}}_{t}| {{\bf{x}}}_{0})}\left[\parallel {p}_{\theta }({{\bf{x}}}_{t},t)-{\nabla }_{{{\bf{x}}}_{t}}\log q({{\bf{x}}}_{t}| {{\bf{x}}}_{0}){\parallel }^{2}\right]\right].$$

(3)

Within this notation, p_θ(x_t, t) is called the score model, and the expectation ${{\mathbb{E}}}_{{{\bf{x}}}_{0} \sim {\mathcal{D}}}$ is performed over the dataset ${\mathcal{D}}$ of initial (training) structures x₀.

When the distribution q is chosen to be Gaussian with μ_t = 0 and we define x_t = x_t−1 + σ_tϵ, ${\boldsymbol{\epsilon }} \sim {\mathcal{N}}({\bf{0}},{\mathbb{I}})$, its score can be written directly as

$${\nabla }_{{{\bf{x}}}_{t}}\log q({{\bf{x}}}_{t}| {{\bf{x}}}_{t-1})=-\frac{{\boldsymbol{\epsilon }}}{{\sigma }_{t}},$$

(4)

which defines a per-atom displacement ε_t that maps the noisy atomic positions x_t to its previous, denoised state x_t−1.³ By redefining the model p_θ(x_t, t) as ϵ_θ(x_t, t) = − σ_tp_θ(x_t, t) and adopting a denoising score-matching loss,⁴⁴ which simplifies the objective function of Eq. (3) to

$$\mathop{\min }\limits_{\theta }{{\mathbb{E}}}_{t}\left[{{\mathbb{E}}}_{{{\bf{x}}}_{0} \sim {\mathcal{D}}}{{\mathbb{E}}}_{q({{\bf{x}}}_{t}| {{\bf{x}}}_{t-1})}\left[{\left\Vert {{\boldsymbol{\epsilon }}}_{\theta }({{\bf{x}}}_{t},t)-{\boldsymbol{\epsilon }}\right\Vert }^{2}\right]\right].$$

(5)

Equation (5) indicates that a model ϵ_θ is given an input structure x_t and is trained to predict the noise ϵ that maps from the denoised state x_t−1 into x_t given a noise schedule σ_t.

Model architecture and training

Following previous approaches¹⁹, the denoiser model uses a customized version of the E(3)-equivariant NequIP model⁴⁵ to implement the training from in Eq. (5). The model utilizes an initial embedding layer that maps atomic species information onto an 8-dimensional scalar feature space (8 × 0e) for both node attributes and atomic numbers, with interactions truncated at the specified cutoff radius (5 Å). The network architecture consists of three convolution layers (num_convs = 3) within the message-passing framework. Hidden layer representations are configured as 64 scalar features and 32 vector (rotation-equivariant) features. Edge features are represented with 4 × 0e + 4 × 1o + 2 × 2e features encoding scalar, vector, and rank-2 tensor components. The radial basis functions are parameterized through a two-layer neural network with 16 and 64 neurons, respectively, to model distance-dependent interactions. The model considers up to 12 nearest neighbors per atom (num_neighbors = 12) and produces vector outputs, ensuring that the predicted noises are equivariant under rotations and translations while maintaining the fundamental symmetries of the physical system.

To incorporate cooling rate information into the model architecture, we implemented a Gaussian basis embedding module, which maps a scalar cooling rate value into a high-dimensional feature vector. Specifically, we project each cooling rate onto a set of 9 Gaussian basis functions with means distributed uniformly across the normalized cooling rate range in a ${\log }_{10}$ scale, thus mapping the interval of cooling rates from [10⁻², 10³] K/ps to [−2, 3]. Each basis function’s activation represents the similarity between the input cooling rate and that basis function’s mean value, effectively creating a smoothly varying representation. These activations are then processed through a two-layer neural network with a SoftPlus nonlinearity, producing a dense embedding vector that captures the cooling rate’s influence on structural properties. This embedding is summed to the hidden representation at each convolution layer of the GNN.

The model was trained on a single NVIDIA RTX A6000 GPU. The per-coordinate, per-atom noise perturbation ϵ was added at train time and sampled from a normal distribution ${\mathcal{N}}(0,\sigma^2 )$, where σ ranged from 0.001 to 0.75 Å based on the noise schedule. At train time, the schedule t is sampled from a uniform distribution, where 1 ≤ t ≤ T. Maximum displacement parameters were calibrated to approximately 50% of typical bond lengths, ensuring that the diffusion process leads to fully random or highly noised configurations. Cooling rates associated with individual training configurations were embedded using a Gaussian basis set, as details described in the above section. The conditional denoiser model was trained for 40,000 iterations with a batch size of 16 and learning rate of 2 × 10⁻⁴, with validation loss monitored to assess convergence. For unconditional denoiser models (CuZr and experimental nanoparticle systems), the number of training iterations was 20,000.

Inference and generative denoising process

The number of atoms and lattice parameters were selected as inputs based on desired density, lattice shape, and atom types. All atoms were initially sampled from a uniform distribution within the simulation cell, meaning each of their fractional coordinates (f_x, f_y, f_z) was sampled from ${\mathcal{U}}(0,1)$. The denoiser model was applied to generate atomic structures from these initialized configurations with extra noise (Fig. 1c). The per-atom, per-coordinate extra noise is sampled from a normal distribution ${\mathcal{N}}(0,{\sigma }_{t}^2)$ with mean 0 and standard deviation σ_t represents the noise magnitude following a noise schedule dependent on the denoising step t. The noise magnitude was decreased linearly to zero over the extent of 2900 generation steps. Thus, each initial configuration underwent 2900 denoising steps with extra noise (with σ_t=0 = 1.0) followed by 100 denoising steps without extra noise. Despite being denoised from the same initial structure, adding “large” initial extra noise and using different random seeds ensures that generated structures are different from each other.

Generated configurations were subsequently refined using MD simulations consisting of 25 ps in the NVT ensemble followed by 25 ps in the NPT ensemble using the potential and settings for MD described below. This last-step MD refinement was performed for a-SiO₂ and Cu₅₀Zr₅₀ systems but was omitted for structures used to reproduce experimental data.

For conditional denoising based on different cooling rates, simulation boxes are kept constant during the denoising process, but the cooling rate is known to influence the density. To account for this fact, we extracted densities from simulated structures across different cooling rates and identified a reasonably linear correlation between a-SiO₂ density and the logarithm of the cooling rate (see Fig. S16 and Supplementary Text, Sec. S1.6) within the ranges of interest^24,25. Thus, when selecting the targeted cooling rate, we fix the density of the initial, randomly sampled configuration using a linear regression model valid within the range of cooling rates of interest. Future work can focus on sampling amorphous structures with variable densities throughout the denoising process.

To create an amorphous a-SiO₂ structure, we performed the denoising process described above with a different initial condition, where atoms were sampled according to a non-uniform distribution throughout the lattice. Atoms were sampled from a random distribution over a hexagonal lattice with predefined wall density, number of atoms, and lattice length. However, an excluded volume in the shape of a cylinder prevented atoms from being sampled within the “pore”, thus mimicking a uniform atomic distribution of atoms for all regions outside the pore (Fig. 4c). Then, the porous structure was denoised following the exact same procedure described above, with a target cooling rate of 1 K/ps.

MD simulations

All MD simulations were performed using the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) software⁴⁶ (v. 17 Apr 2024). A fixed integration timestep of 1.0 fs was used throughout all simulations.

To generate training data for the SiO₂ systems, we conducted MD simulations of amorphous silica comprised of 3000 atoms across multiple cooling rates. We executed six independent simulations at each cooling rate, maintaining the same settings while varying initial configurations. We adopt the Jakse interatomic potential²⁸, parametrized from ab initio calculations, to model a-SiO₂. Short-range interactions are calculated with an 8.0 Å cutoff. Coulomb interactions are calculated by adopting the Fennell damped shifted force model with a damping parameter of 0.25 Å⁻¹ and a global cutoff of 8.0 Å. Periodic boundary conditions were applied in all three directions during MD simulations.

All a-SiO₂ samples were prepared via conventional melt-quench method. Initial structures were generated by randomly placing SiO₂ molecules in cubic boxes using PACKMOL (v. 20.13.0)⁴⁷, enforcing a 2.0 Å minimum separation between molecules to prevent unphysical overlaps. Following energy minimization, configurations underwent sequential 50 ps relaxations in the canonical (NVT) and isothermal-isobaric (NPT) ensembles at 300 K. Complete melting was achieved with a 50 ps-long NPT equilibration at 5000 K and zero pressure, eliminating the “memory” of the initial configuration. Subsequently, liquid systems were cooled from 5000 to 300 K under zero pressure NPT conditions at rates of 10⁻¹, 10⁰, 10¹, and 10² K/ps. The obtained glass structures are further relaxed one last time at 300 K for 100 ps in the NPT ensemble. This process follows the standards from previous work²⁶ shown to reproduce multiple experimental features from a-SiO₂ glasses.

We adopted the published dataset from Wang et al.⁴⁰, which used a set of optimized embedded-atom method (EAM) potentials to simulate CuZr metallic glass models⁴⁸. Specifically, we used two of their Cu₅₀Zr₅₀ configurations, each containing 5000 atoms, to train our denoiser model. The potential from Cheng et al.⁴⁸ was later used to perform the short MD refinement and the fracture test (described below) for generated CuZr structures. The post-denoising MD simulation was 25 ps-long at the NVT ensemble, followed by a 25 ps-long simulation at the NPT ensemble, both at 300 K.

Computation of elastic properties and fracture test

The stiffness tensor C_ij of the equilibrated glasses is computed by performing a series of six deformations (i.e., three axial and three shear deformations along the three axes) and calculating the curvature of the potential energy U:

$${C}_{ij}=\frac{1}{V}\frac{{\partial }^{2}U}{\partial {e}_{i}\partial {e}_{j}},$$

(6)

where V is the glass volume, e is the strain, and i, j are the indices corresponding to each Cartesian direction. All simulated configurations exhibited near-complete isotropy. Bulk modulus (K), shear modulus (G), Young’s modulus (E), and Poisson’s ratio (ν) are then calculated from the stiffness tensors.

Fracture simulations are performed on different starting bulk configurations by deforming the structures along a single direction during an MD simulation. Uniaxial deformation is achieved by imposing a constant strain rate of 10⁹ s⁻¹ along the z direction while allowing lateral dimensions (x and y) to relax freely under zero lateral pressure, approximating realistic uniaxial tension conditions. The deformation process is controlled using an NPT ensemble maintained at 300 K and 1 bar. True stress is calculated from the negative z-component of the stress tensor.

Computational cost estimates

We used a single NVIDIA RTX A6000 GPU to compare the computational cost of MD simulations and denoiser inference for an a-SiO₂ system of 3000 atoms, as shown in Fig. 3d. The entire MD simulation process includes initialization, melting, cooling and relaxation parts as detailed above, where only the cooling part is affected by the choice of cooling rate. Note that, the choice of potential, simulation size, and other aspects can affect the computational time for MD simulation. The LAMMPS package was compiled with Kokkos to perform the GPU-accelerated simulations⁴⁹. The denoising process (inference) consists of a total of 3000 denoising steps (2900 steps with extra noise and 100 steps without extra noise).

To estimate the computational cost of traditional melt-quench MD simulations for large systems at slow cooling rates, we benchmarked structures at target sizes for 25 ps. All simulations include fixed computational overhead include 50 ps NVT initialization, 50 ps NPT initialization, 50 ps NPT melting, and 100 ps NPT relaxation (250 ps total). The quenching process duration varies with cooling rate, requiring 47, 470, 4,700, 47,000 and 470,000 ps for 10², 10¹, 10⁰, 10⁻¹, and 10⁻² K/ps, respectively. Total computational cost is calculated by multiplying the average time per step obtained for a 10 ps MD simulation by the total number of steps to perform the simulation. For the large system estimate, the cost of performing a 10 ps MD simulation of 112,848 atoms within a-SiO₂ and the simulation settings described above is 0.74 GPU-h. This results on the total estimated cost of 34,746 GPU-h for the same size under 0.01 K/ps cooling rate described in the main text.

We also used a single CPU core of Intel(R) Xeon(R) CPU E5-2650 v4 @ 2.20 GHz from UCLA’s Hoffman2 Cluster to estimate the computational cost of MD simulation and denoiser generation on CPU. The simulate and generation process are the same as described above, and the results are shown in Fig. S17.

Analysis of amorphous structures

All structural features of simulated a-SiO₂ structures were performed with six independent samples of 3000 atoms each. Samples are obtained either through the traditional melt-quench process, or using the generative models. All structural properties reported in the main text (e.g., Figs. 2, 3) are reported as either averages over these six samples (e.g., structural features in Fig. 2a–c) or distributions (e.g., mechanical properties in Fig. 2d). This allows us to perform statistically relevant comparisons between generated and simulated structures by accounting for the distribution of structures and properties in amorphous materials.

Short-range structural ordering was characterized through pair distribution function (PDF) analysis. All PDFs were calculated using OVITO⁵⁰ with 100 bins between 0 and 8 Å. The medium-range order structure of amorphous silica was characterized through ring size distribution analysis. These calculations were performed using the RINGS package,⁵¹ with Si-O bonds defined using a 2.0 Å cutoff distance for ring identification within the network structure, which is in agreement with the partial PDF for Si-O pairs in Fig. 2a.

We quantified the concentration of non-bridging oxygens (NBOs) on generated mesoporous silica surfaces by calculating the ratio of NBO atoms to the estimated surface area of the pore. As described in the main text, the pore morphology exhibits density-dependent shape and distortions from ideal cylindrical structures, as illustrated in Fig. S14. To address this non-ideal geometry, we analyzed the radial distribution of Si and O atoms relative to the central simulation axis. The effective pore radius was estimated as the distance at which the density of Si and O atoms plateaus, indicating the boundary between pore space and silica framework. Whereas our generation has been performed only for the SiO₂ composition, NBOs are of high relevance to applications of mesoporous silica, where they can become silanol groups in the presence of hydrogen.

We employed Voronoi analysis in OVITO⁵⁰ to compute the atomic volumes of CuZr structures. Particle radii of 1.35 Å for Cu and 1.55 Å for Zr atoms are applied for analysis, with relative face area threshold set to 1%. Voronoi indices for CuZr structures were also computed using OVITO using the same particle radii. For Voronoi indices, surface atoms occupying less than 1% of the total surface area were excluded to minimize effect from experimental measurement and structural reconstruction errors. Voronoi indices are represented using reduced Schlaefli notation, 〈n₃, n₄, n₅, n₆〉, where n_i denotes the number of polyhedral faces containing i edges. The polyhedral face distribution quantifies the frequency of faces with varying edge numbers. We adopted the analysis functions published by Yang et al.⁴² to analyze Voronoi tessellation of atomic structures for both experimental and generated nanoparticles for consistency with that work. Surface atoms occupying less than 1% of the total surface area were excluded to minimize effect from experimental measurement and structural reconstruction errors.

Information entropy and QUESTS method

The representation of atomic environments was computed as described in our previous work²². In summary, a number of k = 32 neighbors was used to represent the atomic environment, with a cutoff of 5 Å. No distinction was made between element types, which is implicitly recovered based on the coordination environments, and thus captured in the information entropy of the system.

The information entropy of descriptor distributions was computed as described before.²² Given a set of feature vectors {X}, their information entropy is computed as follows:

$${\mathcal{H}}(\{{{\bf{X}}}\})=-\frac{1}{n}\mathop{\sum }\limits_{i=1}^{n}\,{\text{log}}\,\left[\frac{1}{n}\mathop{\sum }\limits_{j = 1}^{n}{K}_{h}({{\boldsymbol{X}}}_{i},{{\boldsymbol{X}}}_{j})\right],$$

(7)

where our choice of the kernel K_h is the Gaussian kernel,

$${K}_{h}({{\bf{X}}}_{i},{{\bf{X}}}_{j})=\,{\text{exp}}\,\left(\frac{-| | {{\bf{X}}}_{i}-{{\bf{X}}}_{j}| {| }^{2}}{2{h}^{2}}\right),$$

(8)

with a constant bandwidth h = 0.015 Å⁻¹, as studied before.²²

We define the differential entropy $\delta {\mathcal{H}}$ of a data point Y with respect to a reference dataset {X} as

$$\delta {\mathcal{H}}(\,{\text{Y}}\,| \{{\bf{X}}\})=-\,{\text{log}}\,\left[\mathop{\sum }\limits_{i=1}^{n}{K}_{h}({\bf{Y}},{{\bf{X}}}_{i})\right].$$

(9)

Throughout this work, the natural logarithm was used for the entropy in Eq. (7), which scales the information to natural units (nats).

From the definition of $\delta {\mathcal{H}}$, the overlap between two discrete distributions of atomic environments {Y} and {X} is the fraction of environments Y_i ∈ {Y} with $\delta {\mathcal{H}}({{\bf{Y}}}_{i}| \{{\bf{X}}\})\le 0$. Therefore, a zero overlap between two distributions indicates that the distributions have disjoint supports, whereas a 100% overlap indicates that the two distributions have the same support, even when the point-wise probabilities are different. Within the context of generative models described in the main text, a high overlap indicates that the generated structures follow the same distribution of atomic environments, while a low overlap suggests that the structures do not share the same environments.

Data Availability

The datasets generated in this work are available at https://github.com/digital-synthesis-lab/2025-dm2-data. The datasets of metallic glasses were obtained from the original source at: https://figshare.com/articles/dataset/Heterogeneous_thermal_activation_energy_in_Cu-Zr_metallic_glasses/12485795. The datasets of the experimental nanoparticle was obtained from the original source at: https://github.com/AET-MetallicGlass/Supplementary-Data-Codes.

Code availability

The code, trained models and demos for training and denoising are available at https://github.com/digital-synthesis-lab/DM2. This work was based on the code from Hsu et al.,¹⁹ available at https://github.com/LLNL/graphite. The code for computing Voronoi indices of amorphous nanoparticles was from Yang et al.,⁴² available at https://github.com/AET-MetallicGlass/Supplementary-Data-Codes. The information theoretical analysis was performed with the QUESTS package from Schwalbe-Koda et al.,²² available at https://github.com/dskoda/quests. A webpage demonstrating this work with interactive visualizations can be accessed at https://kaiyang1010.github.io/DM2/.

References

Zunger, A. Inverse design in search of materials with target functionalities. Nat. Rev. Chem. 2, 0121 (2018).
Article CAS Google Scholar
Goodfellow, I. J. et al. Generative adversarial nets. In Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N. & Weinberger, K. (eds.) Advances in Neural Information Processing Systems, vol. 27 (NIPS, 2014).
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. Adv. Neural Inf. Process. Syst. 33, 6840–6851 (2020).
Google Scholar
Yang, L. et al. Diffusion models: A comprehensive survey of methods and applications. ACM Comput. Surv. 56, 1–39 (2023).
Article Google Scholar
Xie, T., Fu, X., Ganea, O.-E., Barzilay, R. & Jaakkola, T. S. Crystal diffusion variational autoencoder for periodic material generation. In International Conference on Learning Representationshttps://openreview.net/forum?id=03RLpj-tc_ (ICLR, 2022).
Zeni, C. et al. A generative model for inorganic materials design. Nature 639, 624–632 (2025).
Article CAS PubMed PubMed Central Google Scholar
Guo, G. et al. Ab initio structure solutions from nanocrystalline powder diffraction data via diffusion models. Nat. Mater.https://doi.org/10.1038/s41563-025-02220-y (2025).
Zhong, P., Dai, X., Deng, B., Ceder, G. & Persson, K. A. Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials. Mater. Horiz. 12, 9669−9678 (2025).
Schwalbe-Koda, D. & Gómez-Bombarelli, R. Generative models for automatic chemical design. In Machine Learning Meets Quantum Physics, 445–467 (Springer, 2020).
Liu, Y., Madanchi, A., Anker, A. S., Simine, L. & Deringer, V. L. The amorphous state as a frontier in computational materials design. Nat. Rev. Mater. 10, 228–241 (2024).
Article CAS Google Scholar
Urata, S., Bertani, M. & Pedone, A. Applications of machine-learning interatomic potentials for modeling ceramics, glass, and electrolytes: A review. J. Am. Ceram. Soc. 107, 7665–7691 (2024).
Article CAS Google Scholar
Comin, M. & Lewis, L. J. Deep-learning approach to the structure of amorphous silicon. Phys. Rev. B 100, 094107 (2019).
Article CAS Google Scholar
Kilgour, M., Gastellu, N., Hui, D. Y., Bengio, Y. & Simine, L. Generating multiscale amorphous molecular structures using deep learning: a study in 2d. J. Phys. Chem. Lett. 11, 8532–8537 (2020).
Article CAS PubMed Google Scholar
Chen, Q., Annamareddy, A., Li, Y.-F., Morgan, D. & Wang, B. Physical regularized hierarchical generative model for metallic glass structural generation and energy prediction. arXiv:2505.09977 (2025).
Li, H. et al. Conditional generative modeling for amorphous multi-element materials. arXiv:2503.07043 (2025).
Xu, X. & Hu, J. A generative adversarial networks (GAN) based efficient sampling method for inverse design of metallic glasses. J. Non-Crystal. Solids 613, 122378 (2023).
Article CAS Google Scholar
Kwon, H. et al. Spectroscopy-guided discovery of three-dimensional structures of disordered materials with diffusion models. Mach. Learn. Sci. Technol. 5, 045037 (2024).
Article Google Scholar
Yong, A. X. B., Su, T. & Ertekin, E. Dismai-Bench: benchmarking and designing generative models using disordered materials and interfaces. Digit. Discov. 3, 1889–1909 (2024).
Article Google Scholar
Hsu, T. et al. Score-based denoising for atomic structure identification. npj Comput. Mater. 10, 155 (2024).
Article CAS Google Scholar
McGreevy, R. L. Reverse monte carlo modelling. J. Phys. Condens. Matter 13, R877 (2001).
Article CAS Google Scholar
Maffettone, P. M. et al. When can we trust structural models derived from pair distribution function measurements? Faraday Discuss. 255, 311–324 (2025).
Article PubMed Google Scholar
Schwalbe-Koda, D., Hamel, S., Sadigh, B., Zhou, F. & Lordi, V. Model-free estimation of completeness, uncertainties, and outliers in atomistic machine learning using information theory. Nat. Commun. 16, 4014 (2025).
Article CAS PubMed PubMed Central Google Scholar
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B. & Hochreiter, S. GANs trained by a two time-scale update rule converge to a local nash equilibrium. Adv. Neural Inf. Process. Syst. 30, 6629–6640 (2017).
Vollmayr, K., Kob, W. & Binder, K. Cooling-rate effects in amorphous silica: a computer-simulation study. Phys. Rev. B 54, 15808 (1996).
Article CAS Google Scholar
Lane, J. M. D. Cooling rate and stress relaxation in silica melts and glasses via microsecond molecular dynamics. Phys. Rev. E 92, 012320 (2015).
Article Google Scholar
Li, X. et al. Cooling rate effects in sodium silicate glasses: bridging the gap between molecular dynamics simulations and experiments. J. Chem. Phys. 147, 074501 (2017).
Berthier, L., Ozawa, M. & Scalliet, C. Configurational entropy of glass-forming liquids. J. Chem. Phys. 150, 160902 (2019).
Article PubMed Google Scholar
Bouhadja, M., Jakse, N. & Pasturel, A. Structural and dynamic properties of calcium aluminosilicate melts: a molecular dynamics study. J. Chem. Phys. 138, 224510 (2013).
Article CAS PubMed Google Scholar
Célarié, F. et al. Glass breaks like metal, but at the nanometer scale. Phys. Rev. Lett. 90, 075504 (2003).
Article PubMed Google Scholar
Luo, J. et al. Size-dependent brittle-to-ductile transition in silica glass nanofibers. Nano Lett. 16, 105–113 (2016).
Article CAS PubMed Google Scholar
Yuan, F. & Huang, L. Molecular dynamics simulation of amorphous silica under uniaxial tension: From bulk to nanowire. J. Non-Crystal. Solids 358, 3481–3487 (2012).
Article CAS Google Scholar
Zhang, Z., Ispas, S. & Kob, W. The critical role of the interaction potential and simulation protocol for the structural and mechanical properties of sodosilicate glasses. J. Non-Crystal. Solids 532, 119895 (2020).
Article CAS Google Scholar
Fan, M. et al. Effects of cooling rate on particle rearrangement statistics: rapidly cooled glasses are more ductile and less reversible. Phys. Rev. E 95, 022611 (2017).
Article PubMed Google Scholar
Kresge, aC., Leonowicz, M. E., Roth, W. J., Vartuli, J. & Beck, J. Ordered mesoporous molecular sieves synthesized by a liquid-crystal template mechanism. Nature 359, 710–712 (1992).
Article CAS Google Scholar
Tang, F., Li, L. & Chen, D. Mesoporous silica nanoparticles: synthesis, biocompatibility and drug delivery. Adv. Mater. 24, 1504–1534 (2012).
Article CAS PubMed Google Scholar
Calderon V, S., Ribeiro, T., Farinha, J. P. S., Baleizão, C. & Ferreira, P. J. On the structure of amorphous mesoporous silica nanoparticles by aberration-corrected STEM. Small 14, 1802180 (2018).
Article Google Scholar
Fought, E. L. et al. Modeling of linear nanopores in a-SiO2 tuning pore surface structure. Microporous Mesoporous Mater. 341, 112077 (2022).
Article CAS Google Scholar
Zhao, X. S., Lu, G., Whittaker, A., Millar, G. & Zhu, H. Comprehensive study of surface chemistry of MCM-41 using 29Si CP/MAS NMR, FTIR, pyridine-TPD, and TGA. J. Phys. Chem. B 101, 6525–6531 (1997).
Article CAS Google Scholar
Kozlova, S. A. & Kirik, S. D. Post-synthetic activation of silanol covering in the mesostructured silicate materials MCM-41 and SBA-15. Microporous Mesoporous Mater. 133, 124–133 (2010).
Article CAS Google Scholar
Wang, Q. et al. Predicting the propensity for thermally activated β events in metallic glasses via interpretable machine learning. npj Comput. Mater. 6, 194 (2020).
Article Google Scholar
Nawano, A., Jin, W., Schroers, J., Shattuck, M. D. & O’Hern, C. S. Characterizing the mechanical response of metallic glasses to uniaxial tension using a spring network model. Phys. Rev. Mater. 7, 073605 (2023).
Article CAS Google Scholar
Yang, Y. et al. Determining the three-dimensional atomic structure of an amorphous solid. Nature 592, 60–64 (2021).
Article CAS PubMed Google Scholar
Hyvärinen, A. Estimation of non-normalized statistical models by score matching. J. Mach. Learn. Res. 6, 695–709 (2005).
Google Scholar
Vincent, P. A connection between score matching and denoising autoencoders. Neural Comput. 23, 1661–1674 (2011).
Article PubMed Google Scholar
Batzner, S. et al. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).
Article CAS PubMed PubMed Central Google Scholar
Thompson, A. P. et al. LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comp. Phys. Comm. 271, 108171 (2022).
Article CAS Google Scholar
Martínez, L., Andrade, R., Birgin, E. G. & Martínez, J. M. PACKMOL: a package for building initial configurations for molecular dynamics simulations. J. Comput. Chem. 30, 2157–2164 (2009).
Article PubMed Google Scholar
Cheng, Y., Ma, E. & Sheng, H. Atomic level structure in multicomponent bulk metallic glass. Phys. Rev. Lett. 102, 245501 (2009).
Article CAS PubMed Google Scholar
Trott, C. R. et al. Kokkos 3: programming model extensions for the exascale era. IEEE Trans. Parallel Distrib. Syst. 33, 805–817 (2022).
Article Google Scholar
Stukowski, A. Visualization and analysis of atomistic simulation data with OVITO-the open visualization tool. Model. Simul. Mater. Sci. Eng. 18, 015012 (2010).
Article Google Scholar
Le Roux, S. & Jund, P. Ring statistics analysis of topological networks: new approach and application to amorphous GeS2 and SiO2 systems. Comput. Mater. Sci. 49, 70–83 (2010).
Article Google Scholar

Download references

Acknowledgements

This work was supported by Toyota Research Institute under the Synthesis Advanced Research Challenge. This work used computational and storage services associated with the Hoffman2 Shared Cluster provided by UCLA Office of Advanced Research Computing’s Research Technology Group. The authors thank Linda Hung, Amalie Trewartha, Steven Torrisi, Fei Zhou, Jiawei Guo, and Long Qi for discussions and suggestions regarding this work. The authors also thank Jianwei (John) Miao and his group for making the experimental dataset available.

Author information

Authors and Affiliations

Department of Materials Science and Engineering, University of California, Los Angeles, CA, USA
Kai Yang & Daniel Schwalbe-Koda
Department of Civil and Environmental Engineering, University of California, Los Angeles, CA, USA
Kai Yang

Authors

Kai Yang
View author publications
Search author on:PubMed Google Scholar
Daniel Schwalbe-Koda
View author publications
Search author on:PubMed Google Scholar

Contributions

D.S.-K. conceived the research. K.Y. and D.S.-K. conducted the investigations, developed the methodology, and performed the formal analysis. K.Y. curated the data, developed the software, and validated the results. Both authors contributed to the visualizations and manuscript writing. D.S.-K. supervised the research and was responsible for funding acquisition.

Corresponding author

Correspondence to Daniel Schwalbe-Koda.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, K., Schwalbe-Koda, D. A generative diffusion model for amorphous materials. npj Comput Mater 12, 29 (2026). https://doi.org/10.1038/s41524-025-01901-1

Download citation

Received: 30 August 2025
Accepted: 25 November 2025
Published: 10 December 2025
Version of record: 14 January 2026
DOI: https://doi.org/10.1038/s41524-025-01901-1