Machine learning surrogate for 3D phase-field modeling of ferroelectric tip-induced electrical switching

Alhada–Lahbabi, Kévin; Deleruyelle, Damien; Gautier, Brice

doi:10.1038/s41524-024-01375-7

Download PDF

Article
Open access
Published: 30 August 2024

Machine learning surrogate for 3D phase-field modeling of ferroelectric tip-induced electrical switching

Kévin Alhada–Lahbabi ORCID: orcid.org/0009-0006-5514-0088¹,
Damien Deleruyelle¹ &
Brice Gautier¹

npj Computational Materials volume 10, Article number: 197 (2024) Cite this article

3777 Accesses
11 Citations
Metrics details

Subjects

Abstract

Phase-field modeling offers a powerful tool for investigating the electrical control of the domain structure in ferroelectrics. However, its broad application is constrained by demanding computational requirements, limiting its utility in inverse design scenarios. Here, we introduce a machine-learning surrogate to accelerate 3D phase-field modeling of tip-induced electrical switching. By dynamically handling the boundary conditions, the surrogate achieves accurate reproduction of switching trajectories under various tip locations and applied voltages. With stable predictions throughout entire morphological evolution pathways and a relative error inferior to 10% compared to direct solvers, the model efficiently emulates intricate switching sequences. By successfully replicating the boundary conditions, the presented framework strides towards a holistic surrogate for the ferroelectric phase field. With up to 2500-fold speed-ups over classical methods, our approach opens the path for the tractable design of the domain structure and the resolution of realistic inverse problems.

Local and correlated studies of humidity-mediated ferroelectric thin film surface charge dynamics

Article Open access 05 October 2021

On-demand nanoengineering of in-plane ferroelectric topologies

Article Open access 26 September 2024

Mapping electric fields and observation of ferroelectric domain switching in hafnia-zirconia devices by electron holography

Article Open access 18 December 2025

Introduction

Ferroelectric thin films hold promise for the future of modern nanoelectronic devices¹. Given their potential applications in nonvolatile memories^2,3, extensive research efforts have been directed towards manipulating the domain structure, employing either electrical^4,5 or mechanical stresses to accomplish domain switching^6,7.

In recent years, the manipulation of ferroelectric domain walls (DWs) has garnered substantial attention, revealing topological entities with distinct properties compared to traditional ferroelectric domains^4,8,9,10. Specifically, the observed electrical conductivity near DWs has prompted the emergence of DW nanoelectronics, enabling information storage in these regions rather than within the domains themselves. However, DW memory devices hinge on strategic wall placement, thereby requiring precise control of domain states. Currently, DW engineering often employs electrode setups and metallic scanning probe tips, strategically triggering electrical switching to design domain structures^4,8. Yet, polarization reversal typically exhibits intricate dynamics^2,3, underscoring the necessity for a comprehensive understanding of ferroelectric switching mechanisms.

Phase-field modeling stands out as a prominent mesoscale computational technique, offering valuable physical insights into ferroelectric materials^11,12,13. Based on energetic considerations, it is commonly employed to elucidate the domain dynamics encountered in experimental scenarios^{10,14,15,16,17}. However, its broader adoption is impeded by the substantial computational cost associated with solving complex partial differential equations (PDEs), underscoring the need for faster alternative methods.

Nowadays, machine-learning surrogate models have garnered significant attention for expediting phase-field simulation, due to their capacity to swiftly infer solutions for complex systems of PDEs^{18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33}. These surrogate models, often designed as explicit time-steppers, forecast the subsequent microstructural state based on information from the current input state.

A common approach involves employing dimensionality reduction techniques, such as principal component analysis (PCA) or autoencoders (AE), thus facilitating more efficient learning of trajectory dynamics^19,24,27,29. For instance, Montes de Oca Zapiain et al. introduced a framework utilizing PCA and recurrent neural networks, demonstrating high accuracy and remarkable speedups in emulating the two-phase mixture problem¹⁹.

Alternatively, some research groups have opted for convolutional neural networks (CNNs) as surrogate models^18,27,29,30. Leveraging the inherent image-based structure of phase-field microstructures, CNNs utilize morphology grid representations directly as input. Recent investigations highlight their potential in successfully inferring ferroelectric microstructural evolutionary pathways¹⁸.

A distinct strategy for developing machine-learning emulators involves physics-informed neural networks (PINNs)^22,34. PINNs incorporate system-specific physical knowledge during training, constructing a physically constrained loss function, and have shown remarkable efficacy in addressing PDE-based problems^35,36. A recent milestone by Lu et al. unveiled Deep Operator Networks (DeepONet), an innovative framework adept at learning the intrinsic nonlinear operator directly from the data²¹. DeepONet-based approaches have successfully been applied to general phase-field problems, where they exploit the free energy as a physics-informed loss function^20,37.

The recent synergy of phase-field modeling and reinforcement learning (RL) has yielded breakthrough results in material inverse design^38,39,40. In this context, the specified microstructure serves as the target state, while an RL agent, able to manipulate boundary conditions, learns and implements an optimal strategy to achieve this configuration. In a recent study, Vasudevan et al. explored the application of RL for microstructure optimization, aiming to uncover the physical mechanisms behind enhanced material properties⁴⁰. Utilizing a 2D phase-field model, RL agents were assigned the task of reaching energetically unfavorable configurations, leading to the development of non-intuitive strategies for material design optimization.

In a notable advancement, Smith et al. employed RL to electrically design domain structures using a piezoresponse force microscopy (PFM) tip in an automated manner⁴¹. By constructing a physical surrogate of domain dynamics based on extensive PFM experiments, they trained an RL agent to optimize tip trajectories to replicate target DW structures. While their experimental surrogate yielded impressive results, employing phase-field modeling as the physical environment for trajectory optimization could extend exploration to more diverse situations and complex phenomena. Unfortunately, traditional phase-field methods are considered prohibitively expensive for such scenarios, given RL’s requirement for thousands to millions of state transitions for meaningful policy learning, as highlighted by the authors. The development of fast surrogate models is therefore crucial to fully leverage RL’s potential and expedite material inverse design tasks.

In a prior study, we introduced a novel CNN-based surrogate to the ferroelectric phase field to efficiently infer the temporal evolution of domain formation in PbZr_xTi_1−xO₃(PZT) in 2D¹⁸. By incorporating physical biases, our model achieves accurate long-term forecasts of morphological trajectories, offering over 600× speedup compared to high-fidelity solvers. Unfortunately, the framework was limited to 2D domain formation with static boundary conditions. To address scenarios involving the electrical design of the domain structure, a 3D surrogate capable of replicating time-evolving boundary conditions becomes necessary.

In this work, we introduce a machine learning approach to significantly accelerate 3D phase-field modeling of tip-induced electrical switching. Our framework incorporates dynamic boundary conditions to accurately capture domain dynamics across diverse morphological evolution pathways under complex electrical switching trajectories. Notably, the model successfully emulates tip-induced switching for various tip locations, applied voltages, and application times. Demonstrating high accuracy, with a relative error below 10% compared to traditional phase-field methods, and achieving an acceleration factor of up to 2500, our model serves as a computationally efficient surrogate for investigating the electrical control of polarization in both direct and inverse problems.

Results

Learning tip-induced electrical switching with machine-learning

In this study, our primary goal is to develop a surrogate capable of accurately reproducing the electrical reversal of polarization induced by an atomic force microscopy (AFM) tip. Moreover, for flexible application across diverse situations, the model must handle arbitrary tip placements on the film surface and a broad spectrum of specified voltages. In this section, we focus on introducing the methodology used to forecast electrical domain switching trajectories using machine learning.

Surrogate model operation

In ferroelectric phase-field modeling, the temporal evolution of the microstructure is governed by the time-dependent Ginzburg–Landau (TDGL) equation¹²

$$\frac{\partial {\mathcal{P}}({\boldsymbol{r}},t)}{\partial t}=-L\frac{\delta \psi }{\delta {\mathcal{P}}({\boldsymbol{r}},t)}$$

(1)

where ${\mathcal{P}}({\boldsymbol{r}},t)$ is the spontaneous polarization, L is the kinetic coefficient, and ψ signifies the total free energy. For a detailed description of the phase-field methodology and incorporation of the tip-induced electrical boundary conditions, please refer to the dedicated in the “Methods” section.

This study presents a surrogate model designed as an explicit time-stepper to replace the TDGL equation. Based on the current state ${X}^{{t}_{k}}$ at time t_k, the model forecasts the subsequent microstructural state ${X}^{{t}_{k+1}}$ at time t_k+1 through the operation:

$${X}^{{t}_{k+1}}={\mathcal{S}}({X}^{{t}_{k}}),$$

(2)

in which ${\mathcal{S}}$ is an operation representing the neural network’s forward pass.

The microstructural morphology can be effectively characterized at any time t_k by the polarization components [${{\mathcal{P}}}_{x}^{{t}_{k}}$, ${{\mathcal{P}}}_{y}^{{t}_{k}}$, ${{\mathcal{P}}}_{z}^{{t}_{k}}$] and the electrostatic potential ${{\mathcal{V}}}^{{t}_{k}}$¹⁸. To accommodate changes in boundary conditions along the domain-switching trajectory, the machine-learning framework also receives the tip-related boundary conditions as inputs at time t_k. Specifically, the tip location $[{y}_{{\rm {tip}}}^{{t}_{k}},{z}_{{\rm {tip}}}^{{t}_{k}}]$, and prescribed voltage ${u}_{{\rm {T}}}^{{t}_{k}}$, are incorporated to succinctly characterize the tip’s action. Thus, the microstructural state representation at time t_k can be expressed as

$${X}^{{t}_{k}}=[{{\mathcal{P}}}_{x}^{{t}_{k}},{{\mathcal{P}}}_{y}^{{t}_{k}},{{\mathcal{P}}}_{z}^{{t}_{k}},{{\mathcal{V}}}^{{t}_{k}},{u}_{{\rm {T}}}^{{t}_{k}},{y}_{{\rm {tip}}}^{{t}_{k}},{z}_{{\rm {tip}}}^{{t}_{k}}]$$

(3)

The surrogate must then adeptly learn to predict the microstructure one time-step Δt ahead:

$$({{\mathcal{P}}}_{x}^{{t}_{k+1}},{{\mathcal{P}}}_{y}^{{t}_{k+1}},{{\mathcal{P}}}_{z}^{{t}_{k+1}},{{\mathcal{V}}}^{{t}_{k+1}})={\mathcal{S}}({X}^{{t}_{k}})$$

(4)

By iteratively using its predictions, the network can generate rollout predictions from the initial state at t₀ to the final time t_N across times t = {t₀, …, t_N}, formally expressed as

$${X}^{{t}_{N}}={{\mathcal{S}}}^{N}(\ldots {\mathcal{S}}({X}^{{t}_{0}}))$$

(5)

To mimic the incremental update of the polarization field governed by the TDGL equation, the polarization output [${{\mathcal{P}}}_{x}^{{t}_{k+1}}$, ${{\mathcal{P}}}_{y}^{{t}_{k+1}}$, ${{\mathcal{P}}}_{z}^{{t}_{k+1}}$] was calculated using the residual learning approach, consistent with a previous study¹⁸.

Surrogate model architecture

The surrogate model employed a 3D CNN based on an encoder–decoder architecture, similar to an anterior work¹⁸. Specifically, we adopted a 3D U-Net with skip connections, a well-established architecture in computer vision⁴².

At each time step, the model receives the current microstructural state as input, denoted ${X}^{{t}_{k}}$. It is important to note that the boundary conditions, represented by ${[{u}_{{\rm {T}}},{y}_{{\rm {tip}}},{z}_{{\rm {tip}}}]}^{{t}_{k}}$ are scalar values, whereas the concatenation ${[{{\mathcal{P}}}_{x},{{\mathcal{P}}}_{y},{{\mathcal{P}}}_{z},{\mathcal{V}}]}^{{t}_{k}}$ follows the grid shape [N_x, N_y, N_z, 4]. During prediction, these scalar boundary conditions are directly integrated into the encoder’s latent space.

Initially, the ${[{{\mathcal{P}}}_{x},{{\mathcal{P}}}_{y},{{\mathcal{P}}}_{z},{\mathcal{V}}]}^{{t}_{k}}$ inputs are fed into the encoder, extracting essential features, and encoding them into a 1D latent vector. At this stage, the scalar boundary conditions are concatenated with the latent encoding. This combined representation is then fed into a multi-layer perceptron (MLP) for further information processing within the latent space. Subsequently, the decoder progressively upsamples the latent information back to the original input shape, ultimately predicting the subsequent state ${X}^{{t}_{k+1}}={[{{\mathcal{P}}}_{x},{{\mathcal{P}}}_{y},{{\mathcal{P}}}_{z},{\mathcal{V}}]}^{{t}_{k+1}}$ at the next time step. Detailed information about the network architecture, including a comprehensive report of the hyperparameters used in different model layers, is provided in Supplementary Note 1.

Training loss error

In this work, the model was trained in a supervised fashion, employing the ${{\mathcal{L}}}_{2}$ error loss function as detailed in the “Methods” section. The training loss is expressed as

$${\mathcal{L}}={{\mathcal{L}}}_{2}({Y}^{{t}_{k+1}},{\mathcal{S}}({X}^{{t}_{k}}))$$

(6)

Here, ${Y}^{{t}_{k+1}}$ represents the microstructure labels obtained from high-fidelity phase-field simulations and ${\mathcal{S}}({X}^{t})$ denotes the model outputs. Specifically, the loss formulation involves a contribution of the output components:

$${\mathcal{L}}={{\mathcal{L}}}_{2}^{{{\mathcal{P}}}_{x}}+{{\mathcal{L}}}_{2}^{{{\mathcal{P}}}_{y}}+{{\mathcal{L}}}_{2}^{{{\mathcal{P}}}_{z}}+{{\mathcal{L}}}_{2}^{{\mathcal{V}}}$$

(7)

where ${{\mathcal{L}}}_{2}^{{{\mathcal{P}}}_{x}}$, ${{\mathcal{L}}}_{2}^{{{\mathcal{P}}}_{y}}$, ${{\mathcal{L}}}_{2}^{{{\mathcal{P}}}_{z}}$ and ${{\mathcal{L}}}_{2}^{{\mathcal{V}}}$ denote the polarization and electrostatic potential components of the total loss, and the subscripts distinguish between the variable components.

Electrical switching prediction of c⁺/c⁻ domains

In this section, we begin by first demonstrating our approach with a PZT thin film vertically oriented along the (001) direction as a representative model, a common system for studying domain switching dynamics^43,44,45. In these structures, the significant lattice compressive mismatch (see the “Methods” section) promotes a domain structure comprised solely of vertical c⁺/c⁻ domains. Consequently, solely the out-of-plane component of the polarization ${{\mathcal{P}}}_{x}$ and the electrostatic potential ${\mathcal{V}}$ are considered as microstructural inputs within this section.

Dataset

This section details the construction of a diverse and representative dataset of tip-induced switching trajectories, crucial to guarantee comprehensive learning of electrical switching dynamics. Aiming to construct a surrogate model for designing electrical domain structures, each trajectory was initiated with a single vertically oriented monodomain. At every grid point, the polarization was uniformly set either to—P_c0 or P_c0, with a randomly assigned direction (Upward or Downward) for each simulation. Importantly, applying uniform electrical poling for a sufficient duration readily achieves these desired states, offering a realistic starting point for designing the domain state.

Each trajectory consisted of 10 distinct switching events, each programmed to last 200Δt, where Δt denotes the time-step employed in the phase-field simulations. Consequently, the total simulation duration spanned 2000Δt. For each event, the tip location (y_tip, z_tip) was randomly chosen on the sample surface, as depicted in Fig. 1a, b. The prescribed voltage u_T was randomly selected from the distribution shown in Fig. 1c. This distribution encompassed a range of voltages leading to electric fields approaching and exceeding the film’s coercive field, thereby ensuring frequent occurrences of electrical switching. Notably, voltages corresponding to electric fields below the coercive field, incapable of inducing domain reversal, were also included to train the model on the nuanced relationship between applied voltage and domain dynamics.

**Fig. 1: Distribution of the training data and parameters in the training dataset.**

Additionally, within each switching event spanning 200Δt, the tip application time t_app is randomly selected from the distribution shown in Fig. 1d. This distribution covers a range of approximately 50Δt–150Δt, representing diverse tip interaction durations. Throughout the remaining timesteps of each event, the domain undergoes relaxation without any applied voltage. This methodology ensures the model learns not only the temporal aspects of electrical switching but also the subsequent domain dynamics, including potential outcomes such as domain nucleation on the black electrode or back-switching to the initial state.

The microstructure state was then recorded at uniformly spaced time intervals of 20Δt, resulting in trajectories comprising 100 frames per simulation ({t₀, …, t₁₀₀}). Here, we conducted 1400 phase-field simulations to model ferroelectric domain switching on a system size of N_x × N_y × N_z = 16 × 32 × 32, producing $[{{\mathcal{P}}}_{x}^{{t}_{k}},{{\mathcal{V}}}^{{t}_{k}}]$ sequences with a shape of (100, 16, 32, 32, 2). The tip-related electrical boundary conditions [y_tip, z_tip, u_T] were recorded at the same intervals along the trajectory as scalar values, resulting in a tensor of shape (100, 3). The dataset was subsequently divided into a training dataset (1000 simulations), a validation dataset (200 simulations), and a test dataset (200 simulations).

Figure 1e, f illustrates the distributions of polarization ${{\mathcal{P}}}_{x}$ and electrostatic potential ${\mathcal{V}}$ within the training dataset. An overview of a typical electrical domain switching trajectory from the training dataset is presented in Fig. 2a, depicting the evolution of the polarization and electrostatic potential variables, through a sequence of 10 tip-induced switching events.

**Fig. 2: Illustration of the machine learning framework in the case of c⁺/c⁻ domains.**

The inherently complex nature of phase-field simulations generates highly intricate and nonlinear trajectories. In this study, we employed principal component analysis (PCA) to facilitate a clear and concise visualization of the switching trajectories^18,19 (Details on PCA are given in Supplementary Note 2). Figure 2b illustrates five arbitrarily chosen training dataset trajectories in the low-dimensional space delineated by the initial three principal components, for the polarization and electrostratic potential. In this representation, each switching event within a trajectory is characterized by abrupt directional changes, facilitating the observation of alterations in the boundary conditions across the simulation. Furthermore, this visualization approach effectively underscores the extensive diversity of scenarios encompassed within the generated datasets. Finally, the surrogate model architecture specifically tailored for the prediction of electrical switching in c⁺/c⁻ domains is presented in Fig. 2c.

The model was trained on the 1200 structures comprising the training/validation dataset over 100 epochs (refer to the “Methods” section for Training details). The training history, illustrated in Supplementary Fig. 1, depicts the evolution of the total ${{\mathcal{L}}}_{2}$ loss, as well as its two components (${{\mathcal{L}}}_{2}^{{{\mathcal{P}}}_{x}}$ and ${{\mathcal{L}}}_{2}^{{\mathcal{V}}}$), during the training process. Following training, the model’s performance was evaluated on the 200 test simulations, assessing the model’s ability to accurately forecast the microstructure using both direct one-step and long-timestep rollout strategies.

Evaluation of one-step prediction

The performance of one-step predictions is quantified using the ${{\mathcal{L}}}_{2}$ mean squared error (MSE) and ${{\mathcal{L}}}_{1}$ mean absolute error (MAE) metrics in Table 1 for the two output fields. Detailed metric computation procedures are described in the “Methods” section.

Table 1 Performance evaluation of the model for one-step prediction

Full size table

The model demonstrates remarkable accuracy in forecasting subsequent microstructural states during electrical switching, achieving a quantitative MAE of 2.86 × 10⁻³ C/m² for the polarization field and 0.11 mV for the electrostatic potential. MSE values are also notably low, at 1.10 × 10⁻⁵ and 7.79 × 10⁻⁵ for ${{\mathcal{P}}}_{x}$ and ${\mathcal{V}}$, respectively, highlighting the model’s ability to capture the influence of boundary condition modifications and anticipate domain state dynamics.

While a model demonstrates accuracy for one-step predictions, it may not necessarily effectively replace high-fidelity methods, especially for longer time frames requiring full simulation unfolding. This can lead to a consistent accumulation of errors over the trajectories, highlighting the crucial need for robust models. Therefore, a thorough assessment is essential to evaluate the model’s ability to sustain error accumulation and ensure stable predictions over long time intervals.

Evaluation of unrolled prediction

For the rollout trajectories, simulations were initiated from diverse initial frames to evaluate the model’s robustness in scenarios with potential error accumulation. Our primary objective is to develop a surrogate that can effectively replace the high-fidelity phase field for a maximized number of frames while forecasting the switching process, leading to significant computational acceleration. As such, the model’s performance was analyzed across a spectrum of initial frames ranging from time t₀ to t₈₀ with the goal of predicting the complete morphological evolutionary pathway up to time t₁₀₀. Therefore, the surrogate unfolds the simulation from 20 (starting from t₈₀) to 100 (starting from t₂₀) timesteps.

For each starting frame in the test dataset, the mean, 25th, and 75th quartiles of the MSE and the macro average relative error (MARE) were calculated over the 200 unrolled test trajectories for the ${{\mathcal{P}}}_{x}$ ferroelectric morphology (Fig. 3). The results highlight that simulations initiated at earlier frames tend to show higher prediction errors. In fact, data-driven surrogates naturally accumulate errors over long-time step inferences, as each prediction builds upon the previous one²⁶.

**Fig. 3: Evaluation in unrolled prediction scenarios in the case of c⁺/c^- tip-induced domain switching.**

Despite the observed sensitivity to initial conditions, the model displayed noteworthy robustness and stability. Even when starting from early frames and predicting all 10 switching events, no significant error accumulation was observed. The MARE stayed consistently below 10% across test samples, even for full simulation unrolling. In particular, the mean MARE hovered around 6% when starting from the t₀ state. Even better accuracy was achieved for predictions starting from slightly later frames, between t₂₀ and t₃₀ (corresponding roughly to 7 switching events). In these cases, the MARE dropped below 5%. Further reduction of the forecasted timesteps significantly enhances accuracy, ultimately yielding a 2% MARE.

Following these guidelines, it becomes feasible to define an error threshold for the surrogate, aligning with the accuracy requirements imposed by the application. This threshold would determine the acceptable number of predictable timesteps by the surrogate. Additionally, a hybrid solver approach could be envisaged, periodically incorporating high-fidelity phase-field iterations to restore microstructure state accuracy. This restored state could then be used for a new surrogate prediction sequence, as demonstrated in comparable literature²⁶.

Interestingly, both the mean and quartile error curves demonstrate consistent oscillations that coincide with individual switching events. This suggests that the model’s performance is sensitive to the specific initial state within a switching event. Notably, the error increases as the initial state approaches the end of a tip application period. This finding implies that the model performs best when starting its prediction at the very beginning of the tip application.

Illustration of forecasted trajectories

Figure 4 depicts the model’s ability to predict domain switching in a complete test simulation, covering the trajectory from t₀ to t₁₀₀ and including 10 switching events. The final outputs of ${{\mathcal{P}}}_{x}$ (Fig. 4a) and ${\mathcal{V}}$ (Fig. 4b) at t₁₀₀ closely match ground truth values. With high accuracy with minimal error accumulation during microstructure evolution, the surrogate proves its efficacy in anticipating domain dynamics during tip-induced electrical switching events. To provide a concise representation of the entire trajectory, both the ground truth and model predictions are depicted in the PCA space in Fig. 4 at each discrete timestep (t₀, …, t₁₀₀). Crucially, the dynamics of the reference solver are faithfully reproduced, capturing the overarching trends in the ${{\mathcal{P}}}_{x}$ and ${\mathcal{V}}$ sequences, respectively. Additional insights into the internal structure of the surrogate predictions are given from the 2D cross-sectional views presented in Supplementary Fig. 2.

**Fig. 4: Illustration of an unrolled model prediction versus the high-fidelity reference solution, initialized at time t₀ for a test trajectory.**

A detailed overview of the domain state evolution during switching forecasting is presented in Fig. 5. Here, the model serves as a complete replacement for the reference phase-field, unfolding a test simulation from t₀ to t₁₀₀. The corresponding domain state evolution is depicted at various time steps across the simulation (t₅, t₂₅, t₅₀, t₇₅, and t₁₀₀) for both the prediction and ground truth. Remarkably, the model closely mimics the true domain dynamics with impressive consistency throughout the simulation. While the model closely tracks the overall domain evolution, minor timing-related discrepancies exist. These mainly appear as slight overestimation of domain shrinkage (e.g., at t₂₅), ultimately having minimal impact on the final state. Conversely, the model occasionally diverges more significantly towards the trajectory’s end, missing a small domain formation (e.g., at t₂₅).

**Fig. 5: Detailed temporal representation of the surrogate prediction for a c⁺/c⁻ tip-induced domain switching test trajectory.**

These findings highlight the model’s overall effectiveness in capturing domain dynamics but also point to areas for further improvement. By successfully replicating entire tip-induced switching sequences with a relative error below 10%, the model demonstrates a remarkable ability to capture the fundamental physical trend governing electrical domain switching. This achievement underscores the model’s potential to serve as a viable alternative to computationally demanding direct numerical solvers, offering a valuable compromise between computational efficiency and accuracy.

Unveiling generalization with unseen domain structure

While the model succeeds at predicting domain switching starting from single-domain states, a critical step for practical use involves its ability to handle unfamiliar initial domain structures. While pooling prior to electrical domain design might be an option in some cases, real-world applications may require operation on randomly configured domains. Therefore, a key question is whether the model can generalize its predictions to arbitrary conditions.

To assess the model’s ability to handle unseen domains, a new test set of 200 simulations (each with 10 switching events) was created. Here, these simulations did not start from single-domain states. Instead, they began with arbitrary domain structures resulting from natural domain formation. For each simulation, prior to tip-induced switching, the polarization was randomly initialized at each grid point, following a uniform distribution between ${{\mathcal{P}}}_{c0}$ and $-{{\mathcal{P}}}_{c0}$. Then, a classical domain formation process was simulated until equilibrium was reached (see the “Methods” section). The final domain state then became the starting point (t₀) for the tip-induced electrical trajectory. Supplementary Fig. 3 showcases representative switching trajectories initiated from diverse, realistic domain configurations. These complex starting states reflect real-world scenarios and lead to more intricate dynamics during tip applications, as seen in the figure.

The model’s performance on unseen initial states was directly assessed for unrolled trajectories. The results, reported in Fig. 6, demonstrate accuracy levels comparable to the single-domain cases, with consistently low MARE even for long-term predictions (100 frames), averaging below 6%. These findings highlight the model’s ability to generalize to unseen domain structures, accurately predicting 10 switching events without compromising accuracy.

**Fig. 6: Evaluation in unrolled prediction scenarios for c⁺/c⁻ domain switching initiated from realistic domain state configurations.**

Finally, Fig. 7 illustrates the model’s predictions initiated from arbitrary domain configurations throughout an entire simulation. The ${{\mathcal{P}}}_{x}$ microstructure inferred by the surrogate is compared with the corresponding ground truth for the final states, along with its representation in the PCA space. Remarkably, even when starting from unseen initial configurations, the dynamical pathways produced by the surrogate exhibit significant agreement with the high-fidelity trajectories. These observations underscore the surrogate’s comprehension of the underlying evolution equation governing domain switching dynamics, thereby exhibiting remarkable generalization to unseen scenarios and enabling exploration of real-world applications.

**Fig. 7: Illustration of an unrolled model prediction versus the high-fidelity solution, initialized at time t₀ from a realistic domain state configuration.**

Electrical switching prediction of a/c domains

In this section, we address the case of electrical switching in a/c ferroelectric domain states, which are commonly examined in the field of domain and DW control engineering^46,47,48. These structures are characterized by mechanical boundary conditions that allow for both in-plane and out-of-plane polarization orientations. When subjected to a tip-induced electric field, such systems have the potential to exhibit both out-of-plane and in-plane domain switching. Thus, all components of the polarization vector (${{\mathcal{P}}}_{x}$, ${{\mathcal{P}}}_{y}$, ${{\mathcal{P}}}_{z}$) were taken into account during the training of the machine learning surrogate specifically developed for predicting a/c domain switching dynamics in this section.