Multi-target digital material design via a conditional denoising diffusion probability model

Yue, Wei; Gao, Yuan; Pan, Zhenliang; Sui, Fanping; Lin, Liwei

doi:10.1038/s41524-025-01759-3

Download PDF

Article
Open access
Published: 11 August 2025

Multi-target digital material design via a conditional denoising diffusion probability model

Wei Yue¹^na1,
Yuan Gao¹^na1,
Zhenliang Pan²,
Fanping Sui¹ &
…
Liwei Lin¹

npj Computational Materials volume 11, Article number: 260 (2025) Cite this article

1398 Accesses
Metrics details

Subjects

Abstract

Multi-target digital material design has been challenging due to the expansive design space and instability of traditional methods in satisfying multiple objectives. This work proposes and demonstrates a customizer based on a classifier-free, conditional denoising diffusion probability model (cDDPM) to efficiently create the layouts of digital materials meeting the design goal of multiple mechanical properties all together. A case study has been conducted based on a micro mechanical resonator with four pre-assigned resonant frequencies. Using 29,430 samples generated via finite element analysis (FEA), the cDDPM is trained to simultaneously customize up to four vibrational modes, achieving over 95% prediction accuracy. Furthermore, the cDDPM approach also shows superior performances in the single-target customization for up to 99% in prediction accuracy when compared with traditional conditional generative adversarial networks (cGANs). As such, the proposed design framework provides a highly customizable and robust methodology for the design of complicated digital materials.

Microstructure reconstruction of 2D/3D random materials via diffusion-based deep generative models

Article Open access 29 February 2024

Data-driven discovery of 2D materials by deep generative models

Article Open access 11 November 2022

Self-supervised optimization of random material microstructures in the small-data regime

Article Open access 21 March 2022

Introduction

The rapid development of additive manufacturing technologies such as 3D printing^1,2,3 has produced highly refined and complex digital material structures. As manufacturing complexity greatly increases, traditional design methodologies encounter inevitable challenges in realizing optimal structural design among an extremely large number of possible solutions. For example, conventional approaches based on theoretical analysis⁴, bioinspiration^5,6, or Edisonian methods^7,8 are not effective to find global optimum for additive manufacturing. Recently, the concept of digital material (DM) has been introduced to integrate advanced design methods with additive manufacturing^9,10. In principle, complex structures are discretized into a distribution of voxels to eliminate parameters related to macroscopic shapes, such as curvature and aspect ratio, and simple mathematical forms, such as matrices, are used in advanced design processes based on machine learning schemes. By leveraging neural networks, diverse and complex functions can construct and connect the underlying relationships between design inputs and outputs, which removes explicit physical explanations and accelerates design progresses. For instance, supervised learning schemes with efficient predictors have been utilized to speedily estimate the mechanical properties in the design process, such as stiffness and toughness¹¹, thermal conductivity¹², material defects¹³, and vibration frequency¹⁴ as compared with conventional numerical simulations. The inverse design problem of DM structures has also seen good progress by gradient-based topology optimizations^15,16 and reinforcement learning approaches¹⁷. However, one drawback for these DM design approaches is that they have to be reevaluated from the beginning even if there is a small change in setting up the targets.

The rise of generative models offers a promising solution to this problem. By applying variational autoencoders (VAEs)^18,19 and conditional generative adversarial networks (cGANs)^20,21, a single model could address the single-target customization problem with a one-time effort. On the other hand, practical problems often involve multiple requirements and targets simultaneously. For example, DM designs may aim for a structure with both high stiffness and high toughness or a structure with precisely controlled frequency response across multiple vibrational modes, and either VAEs or cGANs struggle to provide effective solutions. Specifically, VAEs are based on a multivariate Gaussian distribution for all data, which often leads to suboptimal solutions. Meanwhile, cGANs require the simultaneous training of both generator and discriminator networks and suffer from mode collapse issues and the loss of diversity^22,23,24. Recently, AI-based painting has utilized various generative models, including stable diffusion²⁴ and DALLE-2²⁵ to successfully generate images by simulating the reverse process of physical diffusions for arbitrary image generations^26,27. Furthermore, conditional diffusion models (CDMs) have been utilized for topology optimization²⁸ and metamaterial design²⁹, with superior performances over cGANs. In this work, the cDDPM^26,30 algorithm is used to efficiently address multi-target customization challenges in DM designs. Specifically, a planar resonator structure is used as a case study to demonstrate efficient, stable, and accurate multi-target customization by simultaneously meeting four modal frequency requirements under various material and geometric constraints. A total of 29,430 FEA simulation cases are used as the dataset and feasible designs are obtained within 600 milliseconds with 95% accuracy for all four target frequencies via an A100 GPU. This case exemplifies the efficiency and accuracy of the proposed customization scheme which could be applicable for other DM design problems.

Results

System architecture

The system architecture follows a similar process in a text-to-image problem, where text descriptions are used as the target inputs and images containing these descriptions are the outputs. Specifically, the proposed customizer accepts a series of target properties as the inputs while designs meeting these requirements are the outputs as illustrated in Fig. 1a. Using a planar resonator structure in a typical MEMS (Microelectromechanical Systems) surface manufacturing process^31,32 as an example, this pixelated system consists of an anchor at the center, an outer ring, and an adjustable labyrinth-like pattern between them^33,34. Appropriate patterns are to be generated to endow the resonator with target frequencies corresponding to one or more vibrational modes. The denoising process starts with pixelated random noises and continues with the generation of labyrinth-like patterns, which gradually evolve into coarse structures and later fine structures. Random orphan pixels are gradually agglomerated into larger pixel blocks to form intricate beam structures. The flow chart is depicted in Fig. 1b. The denoising process operates in the continuous floating-point space and begins with a Gaussian noise, ${x}_{N}$, which is a randomly sampled representation constituting one quarter of a labyrinth-like pattern under the symmetry constraint. A trained U-Net performs denoising at each step, with network parameters modified based on the current time step, $t$, and target property, $c$, to progressively refine the design from ${x}_{t}$ to ${x}_{t-1}$. The U-Net follows a standard encoder-decoder structure with advanced features such as time embedding, condition embedding, attention mechanism, and residual connection to enhance learning. After $N=800$ denoising steps, the process concludes at $t=0$ and outputs the desired design ${x}_{0}$. The details of the network architecture and the training process are provided in Methods section and Figs. S1 and 2.

**Fig. 1: Overview of the customization process for digital material (DM) design with multiple-target properties.**

Dataset preparation

A total of 29,430 resonators are randomly generated by the depth-first search (DFS) approach, starting from the center anchor and extending to the outer ring with fully connected labyrinth under material and geometric constraints. A representative resonator is shown in Fig. S3. The resonator structure is made of 2 μm-thick polycrystalline silicon, with a Young’s modulus of 150 GPa, a density of 2.3 × 10³kg/m³, and a Poisson’s ratio of 0.29. The anchor is placed at the center by a square-shape stem with a side length of 1.76 μm and a thickness of 0.7 μm. The outer ring has the inner and outer diameter of 30.8 μm and 44.0 μm respectively. The connection points between springs and the outer ring are fixed, and the entire structure has two symmetrical axes. Both the beam width and the gap between the beams are 0.88 μm^34,35,36. Finite element analysis (FEA) simulations are used to calculate the resonant frequencies of each vibrational mode for every resonator and serve as “ground truth” labels. With parallelization and an Intel® Core™ Ultra 9 185H CPU (2.30 GHz), all simulations are completed in approximately 12 h. Figure 2 shows four representative fundamental modes, including the X-tilting and Y-tilting (torsion around two symmetric axes), Rotational (rotating around the central axis), and Flexural mode (vertical oscillation). All motions can be decomposed into translational motions along the x, y, and z axes as ${X}_{T}$, ${Y}_{T}$, and ${Z}_{T},$ and rotational motions along with these axes as ${X}_{R}$, ${Y}_{R}$, and ${Z}_{R}$, while the mass participating in each motion is referred as the effective mass of the vibrational mode. In the X-tilting mode, the effective mass is mainly concentrated in the y-direction translation and rotation about the x-axis. In the Y-tilting mode, the effective mass is concentrated in the x-direction translation and rotation about the y-axis. The Rotational mode and Flexural mode are primarily concentrated in rotation about the z-axis and translation in the z-direction, respectively. By comparing the effective mass distribution from the natural frequency analysis across different modes, automated mode identification and extraction can be achieved (see details in Methods section).

**Fig. 2: Representative vibrational modes and effective mass distribution of the DM-based resonator.**

Single-target customization

For the single-target customization, design results for the X-tilting mode are shown in Fig. 3a. The left side plot is the structural frequency versus the target frequency, where each point represents the best candidate selected from 32 generated candidates (the same batch size is used for the multi-target customization in this work) to achieve a high matching accuracy of over 99% throughout the feasible design domain by using FEA results as “ground truths”. When the target frequency falls into the physically infeasible domain (either too high or too low), the accuracy drops sharply and forms two nearly flat regions parallel to the x-axis. The blue, green and purple arrows are structures for lower bound, midpoint, and upper bound frequencies, respectively, and they satisfy all material and geometric constraints, including the beam width, gap distance, connectivity of the labyrinth-like pattern, and the connection points at the outer ring. The effective stiffness of the labyrinth-like patterns influences the resonant frequency. For example, a low-frequency design typically features a long beam along the x-axis near the central anchor, while a high-frequency design typically has a long beam along the y-axis at the center. These asymmetric structures in the diagonal direction help reducing the stiffness in one direction while increasing the stiffness in the perpendicular direction. The single-target customization results for the Y-tilting, Rotational, and Flexural modes are presented in Fig. S4 with an accuracy of better than 99% across the entire physically feasible domain for all modes.

**Fig. 3: Customization accuracy and representative designs for single-, dual-, and triple-target scenarios.**

The architecture from a previous study based on the cGAN method²¹ is employed to train customizers for single target tasks for comparison, using the same dataset of 29,430 data points. Four distinct cGANs are trained separately for four different vibrational modes, with their training processes shown in Fig. S5, where both the discriminator and generator stabilize after 3000 training steps. In contrast, this work utilizes a single conditional denoising diffusion probability model (cDDPM) to address all four customization tasks. As shown in Fig. S6, the performance of cGANs is inferior to that of the cDDPM scheme for single-target customization tasks to achieve barely 80% in accuracy for the X-tilting and Y-tilting modes and only 50% in accuracy for the other two modes.

Dual- and triple-target customization

Dual-target customizations are conducted for the six pairwise combinations of the four vibrational modes. The accuracy heatmap and representative resonators are shown in Fig. 3b and Figs. S7–S8, where 95% and 90% iso-accuracy lines are illustrated and the single-target customization results are used to guide the testing range. The combination of the X-tilting and Y-tilting modes leads to a traversal region of [0.32 MHz, 0.90 MHz] $\times$ [0.32 MHz, 0.90 MHz] with the reduction in accuracy when compared with those of the single-target customization tasks across the entire testing domain. In general, the average accuracy falls below 90% in less than 5% of all results (0.4%, 0.7%, 5.2%, 1.7%, 4.7%, and 4.3% for the six combinations, respectively) and an average accuracy of 95% or higher is achieved in more than 80% of the region (92.4%, 94.5%, 81.9%, 91.4%, 83.2%, and 80.0% for the six combinations, respectively). The three resonator designs on the right side satisfy all material and geometric constraints with different labyrinth-like patterns. As the number of the target frequency increases, the number of qualified resonator designs decreases such that generated resonator designs will progressively be deterministic. The accuracy of the dual-target customization is constrained by two main factors. First, it has more constraints and fewer relevant data points as compared with those of the single-target customization task. Increased size and quality of the dataset can help alleviate the problem. Second, the interdependence between different modes may shrink the physically feasible domain. In general, a two-dimensional region is determined by the physically feasible domains of the two resonant modes. However, some points within this region are not feasible as different modes can be interrelated and are not independent. For example, some points in the combination of the rotational mode and flexural mode require an extremely low rotational frequency and an extremely high flexural frequency, simultaneously. However, results show a positive correlation between the rotational frequency and flexural frequency (see Supplementary Note 1 and Fig. S9) such that extremely high flexural frequency and low rotational frequency or vice versa cannot coexist. For these cases, designs close to target resonant frequencies are provided. On the other hand, regions with low accuracy also imply physically infeasible design domains.

The single-target customization task is like finding a plane in a three-dimensional space and the dual-target customization task is like finding the intersection line between two planes. Adding the third target is like finding the intersection of three planes, or a single point, which is extremely difficult. While cGAN can achieve a certain level of accuracy in single-target customization, it is not effective to produce meaningful outputs in the DM design for dual-target or triple-target tasks. For triple-target customization, the customizer developed in this work exhibits excellent performance to match the specified resonant frequencies of three different vibrational modes. A traversal generation process is conducted within the three-dimensional region defined by the physically feasible domains of each vibrational mode. The accuracy results are presented in multi-sectional views, as shown in Fig. 3c (for the combination of X-tilting, Y-tilting, and Rotational modes) and Figs. S10 and S11 (for the other three combinations). Red and black solid lines represent the 95% and 90% iso-accuracy lines, respectively. These results indicate that 95% accuracy regions cover 73.2%, 62.9%, 63.6%, and 62.5% of the three-dimensional region for the four different combinations, while 90% accuracy regions cover 88.5%, 83.8%, 87.0%, and 84.7% of the three-dimensional region for the four different combinations, respectively. In the triple-target customization task, the interdependence between different modes further reduces the physically feasible domain, which results in more unsolvable regions. Extremely high or low modal frequencies are more likely to produce unsolvable regions as shown in the top and bottom sectional views in Fig. 3c with larger low-accuracy areas. In contrast, the mid-sectional view more closely resembles the results in Fig. 3b because the Rotational frequency constraint at 0.51 MHz is relatively weak. Three design examples are shown in Fig. 3c with identical X-tilting and Y-tilting frequencies and different rotational frequencies by using distinctly different labyrinth-like patterns. Together with the second resonator shown in Fig. 3b, a total of four different designs meeting the X-tilting and Y-tilting frequency requirement of 0.6 MHz have been presented. This demonstrates the versatility for the cDDPM-based customizer in contrast to the mode collapse issues commonly seen when using the cGAN scheme.

Customization for specific applications

The resonator in this study may respond to external forces, such as electrostatic³⁷, piezoelectric³⁸, or optical actuation³⁹, with its vibrational behavior dependent on the applied force frequency, location, and amplitude. These parameters influence which vibrational modes are predominantly excited and their relative amplitudes. In this study, the four primary fundamental vibrational modes—X-tilting, Y-tilting, Rotational, and Flexural mode—can be selectively excited by tuning the applied force characteristics, as demonstrated in Loads #1–#4 in Fig. 4a. In contrast, Load #5 in Fig. 4a represents a linear superposition of Loads #1, #2, and #4 (see Supplementary Note 2), designed to simultaneously excite three vibrational modes with a dominant z-directional displacement (excluding the rotational mode). Under this driving force, the response curve of the resonators varies according to the frequencies of these three vibrational modes. In order to optimize the performance under Load #5, the amplitude-frequency curve measured at Node A is characterized, where the z-directional displacement is a superposition of the movements due to these three modes. If the goal is to enhance the resonator’s output at a specific frequency, one possible arrangement is to align the frequencies of selected vibrational modes. To achieve this, the resonant frequencies for the X-tilting, Y-tilting, Rotational, and Flexural modes are set at 0.55 MHz, 0.55 MHz, 0.6 MHz, and 1.75 MHz, respectively. This configuration ensures that the X-tilting and Y-tilting modes overlap at 0.55 MHz, reinforcing their contribution, while the Flexural mode is positioned far away to minimize interference. For this case, the Rotational frequency is not affecting the outcome, while it is given a specific frequency to simultaneously demonstrate the performance of four-target customization. A frequency domain analysis is conducted with four different loading conditions (Loads #1-#4) to characterize the amplitude-frequency curves at Node A as shown in Fig. 4b. The amplitude of the Rotational mode is represented by the x-directional component, while the other modes are represented by the z-directional component with further details in Methods section. The load profile #1-#4 can each activate a different vibrational mode to result in a unique amplitude-frequency curve to represent the resonant frequency of the corresponding mode. Results show the structure has X-tilting frequency, Y-tilting frequency, Rotational frequency, and Flexural frequency of 0.56 MHz, 0.56 MHz, 0.60 MHz, and 1.77 MHz, respectively, for a high average accuracy of 99.0% to the pre-assigned frequencies. Under the application of Load #5, the response of the resonator with respect to frequency is shown in Fig. 4c. It is observed that the two amplitude peaks appear at 0.56 MHz and 1.77 MHz, respectively. The high peak at 0.56 MHz is due to the superposition of the first two modes with an amplitude approximately 14 times higher than that of the second peak at 1.77 MHz. The large frequency gap of 1.21 MHz between the two peaks further reduces the influences between these two peaks.

**Fig. 4: Multi-target customizations for specific applications.**

In the second case, the goal is to arrange the X-tilting, Y-tilting, and Flexural frequencies in an evenly spaced manner to increase the effective bandwidth^40,41,42 by setting the X-tilting mode, Y-tilting mode, and Flexural mode at 0.6 MHz, 0.85 MHz, and 1.1 MHz, respectively. The Rotational frequency can be left unspecified but is set at 0.55 MHz, which represents the dataset’s average value. The resulting amplitude-frequency curve is shown in Fig. 4d to have amplitude peaks at 0.60 MHz, 0.84 MHz, 0.56 MHz, and 1.14 MHz, respectively, which match the pre-set frequencies well with an average accuracy of 98.3%. Figure 4e shows the simulated amplitude-frequency response at Node A under Load #5 with a critical damping fraction of 0.2 as characterized in a previous study¹⁴. The curve exhibits three peaks in ascending order of frequency (with the third peak being almost invisible), corresponding to the X-tilting mode, Y-tilting mode, and Flexural mode, respectively. The red dashed line in the figure corresponds to the half of the maximum peak to evaluate the Full Width at Half Maximum (FWHM) value. It can be observed that the three modes are spaced evenly to effectively increase the FWHM to about 0.731 MHz (between 0.257 MHz to 0.988 MHz), which is 60% higher than the FWHM of the original X-tilting mode (see Supplementary Note 3).

Robustness

In practical applications, the robustness of the customizer is essential for stable performances even when users provide unreasonable or unsolvable requirements. Here, various physically infeasible requirements are used in the robustness tests for single-target and dual-target customization cases. Figure 5a–d demonstrate the single-target customization results for four different vibrational modes. For each vibrational mode, two physically infeasible frequencies of extremely high and low outliers, and one physically feasible frequency are tested. The physically infeasible targets are selected sufficiently far from the dataset range: 0.18 MHz, 1.04 MHz for X-tilting and Y-tilting modes; 0.25 MHz, 0.71 MHz for the Rotational mode; and 0.53 MHz, 2.77 MHz for the Flexural mode. The physically feasible targets are selected near the medium values of the dataset: 0.61 MHz for X-tilting and Y-tilting modes, 0.48 MHz for the Rotational mode, and 1.65 MHz for the Flexural mode. For each task, the probability distribution for frequencies generated from 300 designs is plotted. Results show physically feasible designs (blue vertical dashed lines) are tightly clustered around the target frequency with vast majority of them over 99% in accuracy and an average accuracy of 99.5%, 99.4%, 99.6%, and 99.4% for the four modes, respectively. For physically infeasible designs with red (low frequency) and green (high frequency) dashed lines, spread-out frequency distributions are found without reaching targe frequencies, while all designs are relatively close to target frequencies. This illustrates that the customizer can generate valid designs close to the target without losing functionality within the physically feasible domain to prevent malfunction for best possible solutions. For example, if a user needs to find a micro resonator design with the highest possible X-tilting frequency, they can simply set the target frequency to an extremely large value, and the customizer will generate a micro resonator with the highest achievable X-tilting frequency. Furthermore, the best designs generated from infeasible targets have modal frequencies close to or beyond the boundaries of the dataset. For example, multiple designs have shown higher Flexural frequencies than that of the dataset’s upper limit at 2.57 MHz in Fig. S12.

The robustness of the customizer also applies to multi-target customizations. In the case of the dual-target customization in the physically feasible domain in Fig. 5e, a micro resonator should satisfy both an X-tilting frequency and Y-tilting frequency of 0.61 MHz. The 600 generated designs form a tight distribution around the target point (red dot) with an accuracy of over 95% (the average accuracy is 97.7%), matching the observations in Fig. 3b. The one-dimensional probability distributions for both modal frequencies are also shown in the figure with similar shapes to those in the single-target customization tasks, symmetrically centered around the target frequency. In the case of the dual-target customization in the physically infeasible domain in Fig. 5f, a micro resonator should satisfy both an X-tilting and Y-tilting frequencies of 1.04 MHz. The 600 generated designs form a band-like distribution in the upper right corner of the figure with the direction parallel to the line y = -x. All generated designs in this band exhibit roughly the same average accuracy close to 77.2%. This result shows that the customizer attempts to provide feasible solutions with good accuracy, which is valuable for near-optimal solutions. In this case, the generated designs show two peaks around (0.72 MHz, 0.88 MHz) and (0.88 MHz, 0.72 MHz) with a valley near (0.8 MHz, 0.8 MHz). This implies that there are fewer designs near (0.8 MHz, 0.8 MHz), where the two modal frequencies are equal.

Discussion

An effective methodology to design DM-based resonators with multiple target properties has been developed and demonstrated. By using the conditional denoising diffusion probabilistic model (cDDPM), the proposed scheme successfully achieves high accuracy in both single- and multi-target customization tasks and significantly outperforms traditional methods such as cGAN. The customizer is capable of handling complex multi-modal frequency structures to achieve over 95% accuracy across four vibrational modes in this demonstration. The robustness of the scheme is proved to provide optimal solutions with physically infeasible or ill-posed inputs. This helps possible designs to maintain functional integrity and maximizes user satisfaction under diverse customization conditions. Results from single-target, dual-target, and multi-target customizations together with the two application-specific case studies illustrate the potential of cDDPM in DM design, paving the way for broader applications.

Methods

Training of cDDPM

The main structure of cDDPM is adopted based on the classifier-free diffusion guidance method³⁰ and DDPM²⁶. A data point is first sampled from the real data distribution ${x}_{0} \sim q({x}_{0})$ and a forward process is defined using a Markov chain to generate latent variables ${\{x}_{1},\ldots ,{x}_{T}\}$. At every timestep, Gaussian noise with variance ${\beta }_{t}\in (\mathrm{0,1})$ is added to the latent variables:

$$q\left({x}_{1},\ldots ,{x}_{T}|{x}_{0}\right)\,{{:= }}\,\mathop{\prod }\limits_{t=1}^{T}q\left({x}_{t}|{x}_{t-1}\right)$$

(1)

$$q\left({x}_{t}|{x}_{t-1}\right)\,{{:= }}\,N\left({x}_{t};\sqrt{1-{\beta }_{t}}{x}_{t-1},{\beta }_{t}I\right)$$

(2)

With a sufficiently large $T$ and an appropriate schedule for ${\beta }_{t}$, the final latent variable ${x}_{T}$ approaches an isotropic Gaussian distribution. An important property of this forward process is that ${x}_{t}$ can be calculated in closed form:

$$q\left({x}_{t}|{x}_{0}\right)=n\left({x}_{t};\sqrt{{\bar{\alpha }}_{t}}{x}_{0},\left(1-{\bar{\alpha }}_{t}\right)I\right),{\alpha }_{t}\,{{:= }}\,1-{\beta }_{t},{\bar{\alpha }}_{t}\,{{:= }}\,\mathop{\prod}\limits_{s=1}^{t}{\alpha }_{s}$$

(3)

The model learns the reverse process to generate the desired data (design) that meets the target properties, starting from random noises. Target properties are encoded as condition $c$, and the timestep is denoted as $t$. The model is defined as:

$${p}_{\theta }\left({x}_{t-1}|{x}_{t}\right)\,{{:= }}\,n\left({x}_{t-1};{\mu }_{\theta }\left({x}_{t},t,c\right),{\varSigma }_{\theta }\left({x}_{t},t,c\right)\right)$$

(4)

The combination of $q$ and $p$ forms a variational auto-encoder, and the loss function can be defined based on the Variational Lower Bound (VLB):

$$\begin{array}{ll}L={E}_{q}\left[{D}_{{KL}}(q({x}_{T}|{x}_{0})||{p}_{\theta }({x}_{T}))+\mathop{\sum}\limits_{t=2}^{T}{D}_{{KL}}(q({x}_{t-1}|{x}_{t},{x}_{0})||{p}_{\theta}({x}_{t-1}|{x}_{t}))\right.\\\qquad\left.{-log}\left({p}_{\theta }\left({x}_{0},|,{x}_{1}\right)\right)\right]={E}_{q}[{L}_{T}+\mathop{\sum}\limits_{t=2}^{T}{L}_{t-1}+{L}_{0}]\end{array}$$

(5)

The reverse conditional probability is tractable when conditioned on ${x}_{0}$:

$$q\left({x}_{t}|{x}_{t-1},{x}_{0}\right)=N\left({x}_{t};\widetilde{\mu }\left({x}_{t},{x}_{0}\right),{\widetilde{\beta }}_{t}I\right)$$

(6)

$${\widetilde{\mu }}_{t}\left({x}_{t},{n}_{t}\right)=\frac{1}{\sqrt{{\alpha }_{t}}}\left({x}_{t}-\frac{1-{\alpha }_{t}}{\sqrt{1-{\bar{\alpha }}_{t}}}{n}_{t}\right)$$

(7)

$${\widetilde{\beta }}_{t}=\frac{1-{\bar{\alpha }}_{t-1}}{1-{\bar{\alpha }}_{t}}{\beta }_{t}$$

(8)

In the model, the variance ${\varSigma }_{\theta }\left({x}_{t},t\right)$ of the Gaussian distribution is set to a constant value, as defined by the posterior variance schedule ${\widetilde{\beta }}_{t}I$. For the mean ${\mu }_{\theta }\left({{\rm{x}}}_{{\rm{t}}},{\rm{t}}\right)$, the model uses a parameterization that predicts the noise ${\epsilon }_{\theta }$:

$${\mu }_{\theta }\left({x}_{t},t,c\right)=\frac{1}{\sqrt{{\alpha }_{t}}}\left({x}_{t}-\frac{{\beta }_{t}}{\sqrt{1-{\bar{\alpha }}_{t}}}{n}_{\theta }\left({x}_{t},t,c\right)\right)$$

(9)

The loss function can be simplified by removing unnecessary coefficients:

$$\begin{array}{ll}{L}_{{simple}}={E}_{t,{x}_{0},\epsilon \sim N\left(0,I\right)}\left[{\rm{||}}\epsilon -{\epsilon }_{\theta }({x}_{t},t,c)|{|}^{2}\right]\\\qquad\quad={E}_{t,{x}_{0},\epsilon \sim N\left(0,I\right)}\left[{\rm{||}}\epsilon -{\epsilon }_{\theta }\left(\sqrt{\bar{{\alpha }_{t}}}{x}_{0}+\sqrt{1-\bar{{\alpha }_{t}}}\epsilon ,t,c\right)|{|}^{2}\right]\end{array}$$

(10)

Training the model with respect to $\theta$ is conducted using the simplified loss function ${L}_{{simple}}$, based on randomly sampled ${x}_{0}$, $t$, $c$ and $\epsilon$. The U-Net architecture, depicted in Fig. S1, is employed to predict the noise in the reverse process. After training is complete (with the loss curve shown in Fig. S2), the model generates the desired design through a sampling process. Starting from ${x}_{T} \sim N(0,I)$, ${x}_{t}$ is generated as t decreases to 0:

$$\begin{array}{c}z \sim N\left(0,I\right){if\; t} > 1,{else\; z}=0\\ {x}_{t-1}=\frac{1}{\sqrt{{{\rm{\alpha }}}_{{\rm{t}}}}}\left({x}_{t}-\frac{1-\alpha }{\sqrt{1-\bar{{\alpha }_{t}}}}{\epsilon }_{\theta }\left({x}_{t},t,c\right)\right)+{\sigma }_{t}z\end{array}$$

(11)

where $c$ represents the encoded condition corresponding to target properties.

Simulation details

A commercial software package, ABAQUS, is utilized for finite element analysis. In all simulations, the resonator structures are discretized using first-order, eight-node elements with reduced integration (C3D8R). The planar dimensions of each element correspond to the size of one pixel in the binary pattern shown in Fig. S3. The entire structure, including the anchor, outer ring, and labyrinth-like pattern, is made of polysilicon with a density of 2.3 g/cm³, a Young’s modulus of 150 GPa, and a Poisson’s ratio of 0.29.

During the natural frequency analysis (using the *frequency command in ABAQUS), a fixed boundary condition is imposed on the nodes located on the bottom surface of the anchor. A subspace eigensolver (Lanczos) is employed to determine the natural frequencies, mode shapes, and effective mass distributions for each mode, with the solver configured to output the first ten natural modes of the resonator. An automated Python interface is then used to extract the four modes of interest for this study based on the effective mass distribution.

For the four-target customization, the amplitude frequency response FEA (steady-state dynamics, modal) is conducted immediately following the natural frequency analysis, using the modal information as intermediate results. The mesh and boundary conditions are identical to those used in the natural frequency analysis. The five types of loads shown in Fig. 4a are applied as concentrated forces, with the total amplitude of each load set to be the same (the sum of all concentrated forces equals 10 N). The frequency sweep is performed by sampling 100 points within a range that includes the frequencies of the four modes, with a bias of 3 to capture variations near each peak as accurately as possible. For the simulations in Fig. 4b–d, the critical damping factor is set to 4e-4 to highlight the peak positions, while for the simulation in Fig. 4e, the critical damping factor is set to 0.2.

Data availability

All data generated or analysed during this study are included in this published article and its supplementary information files. The underlying code for this study is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author.

References

Qu, Z. et al. High fatigue resistance in a titanium alloy via near-void-free 3D printing. Nature 626, 999–1004 (2024).
Article PubMed CAS Google Scholar
Li, F. et al. 3D printing of inorganic nanomaterials by photochemically bonding colloidal nanocrystals. Science 381, 1468–1474 (2023).
Article PubMed CAS Google Scholar
Larson, N. M. et al. Rotational multimaterial printing of filaments with subvoxel control. Nature 613, 682–688 (2023).
Article PubMed CAS Google Scholar
Milton, G. W. & Sawicki, A. T. Theory of composites. Cambridge monographs on applied and computational mathematics. Appl. Mech. Rev. 56, B27–B28 (2003).
Article Google Scholar
Schaedler, T. A. et al. Designing metallic microlattices for energy absorber applications. Adv. Eng. Mater. 16, 276–283 (2014).
Article CAS Google Scholar
Valdevit, L., Jacobsen, A. J., Greer, J. R. & Carter, W. B. Protocols for the optimal design of multi‐functional cellular structures: from hypersonics to micro‐architected materials. J. Am. Ceram. Soc. 94, s15–s34 (2011).
Article CAS Google Scholar
Rafsanjani, A., Jin, L., Deng, B. & Bertoldi, K. Propagation of pop ups in kirigami shells. Proc. Natl Acad. Sci. 116, 8200–8205 (2019).
Article PubMed PubMed Central CAS Google Scholar
Wang, P., Casadei, F., Shan, S., Weaver, J. C. & Bertoldi, K. Harnessing buckling to design tunable locally resonant acoustic metamaterials. Phys. Rev. Lett. 113, 014301 (2014).
Article PubMed CAS Google Scholar
Hiller, J. & Lipson, H. Design and analysis of digital materials for physical 3D voxel printing. Rapid Prototyp. J. 15, 137–149 (2009).
Article Google Scholar
Zhang, Z. & Gu, G. X. Finite‐element‐based deep‐learning model for deformation behavior of digital materials. Adv. Theory Simul. 3, 2000031 (2020).
Article CAS Google Scholar
Gu, G. X., Chen, C. T. & Buehler, M. J. De novo composite design based on machine learning algorithm. Extreme Mech. Lett. 18, 19–28 (2018).
Article Google Scholar
Luo, Y., Li, M., Yuan, H., Liu, H. & Fang, Y. Predicting lattice thermal conductivity via machine learning: a mini review. Npj Comput. Mater. 9, 4 (2023).
Article Google Scholar
Holzmond, O. & Li, X. In situ real time defect detection of 3D printed parts. Addit. Manuf. 17, 135–142 (2017).
Google Scholar
Guo, R. et al. Deep learning for non-parameterized MEMS structural design. Microsyst. Nanoeng. 8, 91 (2022).
Article PubMed PubMed Central Google Scholar
Brackett, D., Ashcroft, I. & Hague, R. Topology optimization for additive manufacturing. Proc. Solid Free. Fabr. Symp. 348, 362 (2011).
Google Scholar
Deng, C., Wang, Y., Qin, C., Fu, Y. & Lu, W. Self-directed online machine learning for topology optimization. Nat. Commun. 13, 388 (2022).
Article PubMed PubMed Central CAS Google Scholar
Sui, F., Guo, R., Zhang, Z., Gu, G. X. & Lin, L. Deep reinforcement learning for digital materials design. ACS Mater. Lett. 3, 1433–1439 (2021).
Article CAS Google Scholar
Kim, N., Lee, D. & Hong, Y. Data-efficient deep generative model with discrete latent representation for high-fidelity digital materials. ACS Mater. Lett. 5, 730–737 (2023).
Article CAS Google Scholar
Pahlavani, H. et al. Deep learning for size‐agnostic inverse design of random‐network 3D printed mechanical metamaterials. Adv. Mater. 36, 2303481 (2024).
Article CAS Google Scholar
Mao, Y., He, Q. & Zhao, X. Designing complex architectured materials with generative adversarial networks. Sci. Adv. 6, eaaz4169 (2020).
Article PubMed PubMed Central Google Scholar
Sui, F., Guo, R., Yue, W., Behrouzi, K. & Lin, L. Customizing mems designs via conditional generative adversarial networks. Proc. IEEE Int. Conf. Micro Electro Mech. Syst. 35, 450–453 (2022).
Google Scholar
Dhariwal, P. & Nichol, A. Diffusion models beat GANs on image synthesis. Adv. Neural Inf. Process. Syst. 34, 8780–8794 (2021).
Google Scholar
Alverson, M. et al. Generative adversarial networks and diffusion models in material discovery. Digit. Discov. 3, 62–80 (2024).
Article Google Scholar
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. High-resolution image synthesis with latent diffusion models. roc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. 10684-10695 (2022).
Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., & Chen, M. Hierarchical text-conditional image generation with CLIP latents. arXiv arXiv:2204.06125 (2022).
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. Adv. Neural Inf. Process. Syst. 33, 6840–6851 (2020).
Google Scholar
Rao, C. et al. Encoding physics to learn reaction-diffusion processes. Nat. Mach. Intell. 5, 765–779 (2023).
Article Google Scholar
Mazé, F. & Ahmed, F. Diffusion models beat gans on topology optimization. Proc. AAAI Conf. Artif. Intell. 37, 9108–9116 (2023).
Google Scholar
Zhang, J. et al. Conditional diffusion models for the inverse design of lattice structures. Struct. Multidiscip. Optim. 68, 1–23 (2025).
Article Google Scholar
Ho, J., & Salimans, T. Classifier-free diffusion guidance. arXiv arXiv:2207.12598 (2022).
Nguyen, C. C. & Howe, R. T. An integrated CMOS micromechanical resonator high-Q oscillator. IEEE J. Solid-State Circuits 34, 440–455 (1999).
Article Google Scholar
Clark, J. R., Hsu, W. T., Abdelmoneum, M. A. & Nguyen, C. C. High-Q UHF micromechanical radial-contour mode disk resonators. J. Microelectromech. Syst. 14, 1298–1310 (2005).
Article Google Scholar
Gu, L., Zhang, W., Lu, H., Wu, Y., & Fan, C. Flower-like disk resonator for gyroscopic application. Rev. Sci. Instrum. 93, 115006 (2022).
Hodjat-Shamami, M. & Ayazi, F. Eigenmode operation of piezoelectric resonant gyroscopes. Microsyst. Nanoeng. 6, 108 (2020).
Article PubMed PubMed Central Google Scholar
Stassi, S. et al. Reaching silicon-based NEMS performances with 3D printed nanomechanical resonators. Nat. Commun. 12, 6080 (2021).
Article PubMed PubMed Central CAS Google Scholar
Wen, X. et al. 3D-printed silica with nanoscale resolution. Nat. Mater. 20, 1506–1511 (2021).
Article PubMed CAS Google Scholar
Um, Y. & Nguyen, C. Wideband dual-bandpass 0.18-µm CMOS SPDT switch utilizing dual-band resonator concept. Microw. Opt. Technol. Lett. 60, 1215–1219 (2018).
Article Google Scholar
Anderson, J., He, Y., Bahr, B. & Weinstein, D. Integrated acoustic resonators in commercial fin field-effect transistor technology. Nat. Electron. 5, 611–619 (2022).
Article CAS Google Scholar
Chen, I. T. et al. Optomechanical ring resonator for efficient microwave-optical frequency conversion. Nat. Commun. 14, 7594 (2023).
Article PubMed PubMed Central CAS Google Scholar
Wang, K., Wang, G., Dai, X., Ding, G. & Zhao, X. Implementation of dual-nonlinearity mechanism for bandwidth extension of MEMS multi-modal energy harvester. J. Microelectromech. Syst. 30, 2–14 (2020).
Article Google Scholar
Feng, H. et al. Multimodal MEMS vibration energy harvester with cascaded flexible and silicon beams for ultralow frequency response. Microsyst. Nanoeng. 9, 33 (2023).
Article PubMed PubMed Central CAS Google Scholar
Zhang, F., Zhao, Y. & Ruan, C. A high-power and broadband G-band extended interaction klystron based on mode overlap. IEEE Trans. Electron Devices 69, 4611–4616 (2022).
Article Google Scholar

Download references

Acknowledgements

The authors acknowledge funding from the Berkeley Sensor & Actuator Center.

Author information

These authors contributed equally: Wei Yue, Yuan Gao.

Authors and Affiliations

Department of Mechanical Engineering, University of California, Berkeley, CA, 94720, USA
Wei Yue, Yuan Gao, Fanping Sui & Liwei Lin
College of Engineering, Purdue University, West Lafayette, IN, 47907, USA
Zhenliang Pan

Authors

Wei Yue
View author publications
Search author on:PubMed Google Scholar
Yuan Gao
View author publications
Search author on:PubMed Google Scholar
Zhenliang Pan
View author publications
Search author on:PubMed Google Scholar
Fanping Sui
View author publications
Search author on:PubMed Google Scholar
Liwei Lin
View author publications
Search author on:PubMed Google Scholar

Contributions

W.Y. proposed mechanism, designed the research, carried out simulations and data analysis. W.Y., Y.G. and F.S. prepared the dataset. W.Y., Z.P. and Y.G. carried out machine learning modeling and data visualization. W.Y., Y.G., Z.P., F.S. and L.L. participated in the manuscript drafting. L.L. and F.S. coordinated and directed this research. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Fanping Sui or Liwei Lin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yue, W., Gao, Y., Pan, Z. et al. Multi-target digital material design via a conditional denoising diffusion probability model. npj Comput Mater 11, 260 (2025). https://doi.org/10.1038/s41524-025-01759-3

Download citation

Received: 14 April 2025
Accepted: 29 July 2025
Published: 11 August 2025
DOI: https://doi.org/10.1038/s41524-025-01759-3