HfO2 has emerged as a promising material due to its high dielectric constant, ferroelectricity, thermal stability, and CMOS compatibility1,2,3. These properties make HfO2 an excellent candidate for various applications, including high-k dielectrics, resistive random-access memory (RRAM), and neuromorphic computing4,5,6,7. In these applications, the amorphous and non-stoichiometric hafnium oxide (a-HfOx) play a pivotal role in optimizing device performance8,9,10,11. For instance, a-HfOx allows for higher energy storage density and enhanced breakdown strength, which are key for advanced dielectric capacitors that require both efficiency and durability9. In the case of RRAM, oxygen vacancies within a-HfOx enable resistive switching driven by external electric fields (\({E}_{{field}}\)), facilitating reliable data storage and retrieval10. Furthermore, in neuromorphic systems, the density and distribution of oxygen vacancies in a-HfOx allow fine-tuning of resistive states under controlled \({E}_{{field}}\), enhancing the efficiency of artificial synapses12. Thus, gaining a comprehensive understanding of the structural dynamics and behaviors of a-HfOx under \({E}_{{field}}\) is critical for advancing the performance of these emerging technologies.

To explore the structural features and behaviors of a-HfOx, density functional theory (DFT)-based ab initio molecular dynamics (AIMD) simulations can be utilized to provide atomic-level insights with quantum mechanical accuracy13,14,15. However, this system exhibits complex atomic structures, necessitating large-scale simulations to fully capture their behaviors. The significant computational costs associated with AIMD, especially for systems with a large number of atoms and long simulation times, make it computationally prohibitive for such large-scale studies16. These constraints limit our ability to deeply understand how atomic interactions govern the behaviors of a-HfOx under various conditions.

One solution to overcome these limitations is to use classical molecular dynamics (CMD) simulations, which rely on empirically parameterized force fields. CMD can handle larger systems and longer simulation times than AIMD, making it cost-effective for studying the behaviors of a-HfOx17. For instance, Urquiza et al. used the reactive force field to simulate resistive switching process in a-HfOx-based RRAM cells under applied voltage, characterizing the migration of oxygen vacancies that facilitate resistive changes during the set and reset operations17,18. However, the empirical nature of the force fields used in CMD limits their accuracy compared to AIMD, particularly in capturing complex atomic interactions in a-HfOx systems19,20.

Machine learning potential (MLP) offers a promising solution by bridging the gap between AIMD’s accuracy and CMD’s efficiency21,22,23,24,25. MLP not only provides quantum-level accuracy but also enables the simulation of larger systems under complex conditions26. Recent studies have applied MLPs to explore the behaviors of a-HfOx27,28. For example, Sivaraman et al. utilized the active learned gaussian approximation potential (GAP) to simulate the atomic behaviors across a wide temperature range in a-HfOx and liquid HfO221,27. Zhang et al. used the neuroevolution potential (NEP) to investigate thermal conductivity changes in a-HfOx at high temperatures22,28. However, while these studies provided valuable insights into fundamental properties, they did not account for the effects of \({E}_{{field}}\), which are crucial for understanding a-HfOx behaviors in advanced electronic applications. Therefore, despite the growing interest in MLPs, a significant challenge remains in adapting MLPs for practical applications such as RRAM and neuromorphic systems.

To address this, we first develop the charge equilibration (QEq) integrated graph neural network (CIGNN) model, which can account for the effects of \({E}_{{field}}\). Using the datasets obtained from AIMD simulations without \({E}_{{field}}\), we train the CIGNN model and develop CIGNN-based MLP (CNMP) for a-HfOx systems. The CNMP achieves quantum mechanical accuracy in predicting atomic charge, energy, and force for these systems. It also accurately predicts structural behaviors and dynamic properties consistent with experimental results, such as oxygen ion diffusivity across various density and temperatures, as well as dielectric breakdown fields in the presence of \({E}_{{field}}\). This indicates that the CNMP effectively bridges the gap between simulations and applications in a-HfOx systems. Building on this capability, we believe that the CNMP can serve as a valuable tool, significantly aiding future research on complex phenomena in advanced electronic applications involving amorphous and non-stoichiometric systems.

Results

CIGNN model architecture and performance

CIGNN model represents the local atomic structures of materials in two ways using nodes and edges of graph representations (Fig. 1a)29. First, we construct an atom-bond graph (GAB), where the 16 closest atoms to a central atom are defined as neighboring atoms. In this graph, atoms act as nodes, and bonds between the central atom and its neighboring atoms act as edges. This graph captures the local bonding environment of each atom, focusing on atomic distances. Second, we define nearest neighboring atoms that are closer than the average distance calculated over all neighboring atoms and their respective center atoms within the structure. To ensure at least 10 nearest neighbors, we iteratively increase the average distance by 0.1 Å until the criterion is met. Using these nearest neighbors, we construct a bond-angle graph (GBA), where bonds are represented as nodes and angles between bonds are represented as edges30. This graph captures the angular relationships between bonds, providing additional geometric information. Through these two ways, we build a CIGNN model capable of simulating amorphous system by considering neighboring atoms and reconsidering nearest neighbors. The CIGNN model consists of two organically connected models: the atomic charge prediction model (CPM) and the energy and force prediction model (EFPM). The CPM predicts atomic charges based on GAB by incorporating the QEq method. The EFPM learns and predicts energy and force based on GAB, GBA, and predicted atomic charges from the CPM (Fig. 1b).

Fig. 1: Overview of the CIGNN model architecture.
Fig. 1: Overview of the CIGNN model architecture.
Full size image

a Graph representations used in the CIGNN model, GAB and GBA. Red represents nodes, and blue represents edges in each graph. b CIGNN model components: CPM and EFPM. Gray, green, red, and sky blue boxes represent the inputs, embedding layers, graph convolution layers, and outputs, respectively, in both CPM and EFPM. The yellow box represents the QEq layer in CPM. The light blue box represents the element scale layer in EFPM.

The CPM is composed of an embedding layer, a graph convolution layer, and a QEq layer. In the embedding layer, atom and bond features are embedded to capture latent representations that signify the hidden physical and chemical states of the atoms and bonds (Supplementary Fig. S1). In the graph convolution layer, with Nth iterations, both atom and bond features are iteratively updated (Supplementary Fig. S2). The iterative process allows the CPM to indirectly consider long-range interactions by continuously refining the features over multiple steps. In the QEq layer, the element-specific QEq method is used (Supplementary Fig. S4). The QEq layer predicts atomic charges (\(Q)\), using Bader charges as the target, by solving a linear system equation according to the QEq method:

$${A}_{{matrix}}Q=-\chi \,$$
(1)

where \({A}_{{matrix}}\) represents the hardness matrix, constructed using updated atom and bond features31. \(\chi\) represents the chemical potential, which is adjusted based on the element-specific electronegativity in the periodic table. When the \({{\rm{E}}}_{{\rm{field}}}\) is considered, \(\chi\) changes to \({\chi }_{{eff}}\)32.

$${\chi }_{{eff}}=\chi -{R}_{v}\cdot {E}_{{field}}\,$$
(2)

where \({\chi }_{{eff}}\) is the modified chemical potential vector that accounts for the influence of the \({E}_{{field}}\). \({R}_{v}\) represents the \(v\)th spatial component of the position vector \(R\) of each atom in the direction of the applied \({E}_{{field}}\). By substituting \({\chi }_{{eff}}\) into Eq. (1), the \(Q\) under the \({E}_{{field}}\) (\({Q}_{{field}}\)) can be predicted.

$${A}_{{matrix}}{Q}_{{field}}=-{\chi }_{{eff}}\,$$
(3)

The EFPM consists of an embedding layer, a graph convolution layer with a readout layer, and an element scale layer. In the embedding layer, similar to the CPM, atom, bond, and angle features are embedded to capture latent representations of these entities. In the graph convolution layer with Nth iterations, atom, bond, and angle features are iteratively updated (Supplementary Fig. S2). The atomic charges predicted by the CPM are incorporated into the updated atom and bond features at each iteration step. Through these update processes, a new representation of atom, bond, and angle features is presented, enhancing the EFPM’s ability to simulate atomic interactions effectively. During each feature update process, two distinct normalization methods are used (Supplementary Fig. S3): the newly developed crystal normalization (C-Norm) for preserving crystal-specific features and preventing data leakage between different crystals after aggregation of features, and the layer normalization (L-Norm) for scaling features before the feature update process33. These normalization techniques help stabilize and optimize the learning process. The effectiveness of using C-Norm with L-Norm in the feature update layer is verified by replacing C-Norm with other normalization methods and comparing their performance (Supplementary Table S1).

Using the updated atom features, we derive the atomic energies in the readout layer. Then, to optimize our energy predictions, the element scale layer adjusts the atomic energies by scaling and shifting them using species-specific parameters derived from the energy and force characteristics of each atomic species. This ensures that species-dependent variations are properly accounted for in the energy predictions. Finally, the atomic energies are aggregated to compute the system’s total energy, and forces are directly derived as the gradient of this total energy with respect to each atom’s positions. When the \({E}_{{field}}\) is applied, there is an additional force contribution (\(\Delta F\)) arising from the interaction between the \({E}_{{field}}\) and the \({Q}_{{field}}\).

$$\Delta F={Q}_{{field}}\cdot {E}_{{field}}$$
(4)

By adding \(\Delta F\), the forces under the \({E}_{{field}}\) (\({F}_{{field}}\)) can be predicted.

$${F}_{{field}}=F+\Delta F$$
(5)

where \(F\) is the force without the \({E}_{{field}}\).

With this CPM and EFPM, the CIGNN model demonstrates competitive performance in predicting energy and forces compared to other MLP models (Supplementary Table S2). Importantly, the CIGNN model offers the added advantage of predicting atomic charges and outperforms other MLP models capable of charge prediction. This underscores the unique capability of the CIGNN model to accurately predict atomic charge, energy and force. Moreover, its charge prediction capability based on the QEq method, enables the CIGNN model to predict atomic charge, energy, and force under \({E}_{{field}}\) without requiring a dedicated \({E}_{{field}}\) dependent training process. This demonstrates the model’s versatility and advanced predictive power.

Development of CNMP

To develop the CNMP, an MLP specifically designed for a-HfOx systems, we construct a dataset through AIMD calculations and Bader charge analysis. This dataset includes structure, charge, energy, and force information for a-HfOx systems. A detailed description of the dataset is provided in the Methods section. Using this dataset, we train the CIGNN model and develop the CNMP. The CNMP achieves a validation MAE of 7.7 me for charge, 1.9 meV/atom for energy, and 78 meV/Å for force predictions. To verify that these low MAE values are not caused by overfitting, we conduct predictive analyses on 3000 new a-HfOx samples not included in the application dataset. As shown in Fig. 2, the CNMP achieves an MAE of 2.8 meV/atom for energy predictions with an R² value of 0.96 when compared to AIMD results. For force predictions, it achieves an MAE of 83 meV/Å and an R² value of 0.99. For atomic charge estimations, it demonstrates an MAE of 17 me for Hf and 7 me for O, with R² values of 0.95 for Hf and 0.92 for O. These results indicate that CNMP achieves quantum mechanical accuracy on the new samples, confirming its generalization ability across diverse configurations on a-HfOx systems. Additionally, we assess the CNMP’s capability in predicting local atomic structures and diffusion dynamics compared to AIMD simulations. Both simulations are conducted under identical conditions: 10 ps at 1100 K within the NVT ensemble, considering a-HfOx structures with zero, single, and double oxygen vacancies. Each structure has stoichiometries of Hf32O64, Hf32O63, and Hf32O62, respectively. We then calculate the total radial distribution function (RDF) and oxygen ion mean square displacements (MSD). For RDF calculations of both O-O and Hf-O pairs, the CNMP results closely match those of AIMD, with an value greater than 0.99. Figure 3a shows the RDF of Hf32O64 (For the RDF of Hf32O63 and Hf32O62, refer to Supplementary Fig. S6). The first-peak position for Hf-O pairs is 2.1 Å, with RDF value of 2.77 for AIMD and 2.81 for CNMP. For O-O pairs, the first-peak position is 2.75 Å, with RDF values of 1.85 and 1.83 for AIMD and CNMP, respectively. These results confirm that CNMP accurately captures the local atomic structures in a-HfOx. In the MSD calculations, both CNMP and AIMD show the same tendency: as the oxygen vacancy increases from Hf32O64 to Hf32O62, the structures exhibit higher oxygen ion diffusivity (Fig. 3b). For AIMD, the diffusivity increases from 6.68 × 10⁸ cm²/s, to 1.57 × 10⁷ cm²/s, and 2.10 × 10⁷ cm²/s for Hf32O64, Hf32O63, and Hf32O62, respectively. Similarly, in the case of CNMP, the diffusivity increases from 6.06 × 10⁸ cm²/s, to 1.49 × 10⁷ cm²/s, and 1.86 × 10⁷ cm²/s for Hf32O64, Hf32O63, and Hf32O62,

Fig. 2: Comparison between CNMP predictions and AIMD values.
Fig. 2: Comparison between CNMP predictions and AIMD values.
Full size image

Plots for (a) energy, (b) force, (c) Hf atomic charge, and (d) O atomic charge. The x-axis represents AIMD values, and the y-axis represents CNMP predictions. R² values are indicated at the top of each panel.

Fig. 3: RDF and MSD comparison between CNMP and AIMD.
Fig. 3: RDF and MSD comparison between CNMP and AIMD.
Full size image

a RDF of Hf32O64 structures. The red line represents the Hf-O pair and the blue line represents the O-O pair. Solid lines indicate CNMP results, and dashed lines indicate AIMD results. b Linear slope of MSD for Hf32O64, Hf32O63, and Hf32O62. Yellow, blue, and red lines represent the MSD results for each structure, respectively. Solid lines indicate CNMP results, and dashed lines indicate AIMD results. Shaded areas show the standard deviation of diffusivity in CNMP.

respectively. These results demonstrate that CNMP successfully replicates the diffusion behavior of oxygen ions, validating its capability in capturing atomic dynamics in a-HfOx systems.

Additionally, to verify that the CNMP can capture the atomic structures and diffusion dynamics for other defect configurations not considered in the training set, we calculate the RDF and MSD for a-HfOx structures with zero (Hf32O64) and single (Hf31O64) hafnium vacancies. These simulations are conducted for 10 ps at 300 K within the NVT ensemble. For the RDF calculations of both O-O and Hf-O pairs, the CNMP results closely match those of AIMD, with an \({R}^{2}\) value greater than 0.99. In the MSD calculations, both CNMP and AIMD show the same tendency: as the number of hafnium vacancies increases from zero to one, the structures exhibit higher oxygen ion diffusivity (For the detailed RDF and MSD results, refer to Supplementary Fig. S7).

Evaluating the performance of CNMP

To extend the utility of CNMP, we verify whether it can accurately represent the diffusivity of oxygen ions under various conditions. We first perform CNMP-MD simulations at different temperatures on a-HfOx (Hf32O63) structures and compare the results with previous studies. Simulations are conducted at 300 K, 600 K, 900 K, 1100 K, and 1500 K, each lasting 80 ps. The oxygen ion diffusivity gradually increases from \(1.37\times {10}^{-12}\) cm²/s to \({1.16\times 10}^{-6}\) cm²/s as the temperature rises from 300 K to 1500 K, resulting in an Arrhenius slope of _2.1 K (Fig. 4a). A recent experimental study has reported that the diffusivity of m-HfOx, which exhibits similar short-range order to that of a-HfOx, increases from \(2.89\times {10}^{-9}\) cm²/s to \(1.22\times {10}^{-7}\) cm²/s as the temperature increases from 570 K to 966 K, with a Arrhenius slope of -2.26 K9,34,35,36,37,38. Additional experimental and computational studies have demonstrated similar trends in increasing oxygen ion diffusivity with increasing temperature, as shown in Fig. 4a. Therefore, these results demonstrate the consistency of CNMP with previous studies and confirm its validity and reliability34,39,40,41. It is important to note that the purpose of this comparison is to demonstrate that CNMP predictions exhibit a physically consistent Arrhenius trend, rather than to serve as a direct quantitative benchmark against previous studies performed under different simulation or experimental conditions. Next, we conduct CNMP-MD simulations on a-HfOx structures with varying densities and compare the results to experimental trends. Lee et al. reported that as the density of a-HfOx films decreases in the order of 9.22 g/cm³, 8.01 g/cm³, and 5.89 g/cm³, oxygen ion diffusivity increases, enhancing device performance42. We used Hf238O478, Hf207O415, and Hf152O307 structures with these respective densities. Five MD simulations were performed using the NVT ensemble, yielding median diffusivity values of 0.94 × 10⁶ cm²/s, 1.99 × 10⁶ cm /s, and 6.98 × 10⁶ cm²/s for the structures with densities of 9.22 g/cm³, 8.01 g/cm³, and 5.89 g/cm³, respectively (Fig. 4b). These trends align with experimental observation, confirming that CNMP accurately predicts the atomic behaviors and dynamics of a-HfOx systems under various conditions.

Fig. 4: Oxygen ion diffusivity trends under various temperatures and densities.
Fig. 4: Oxygen ion diffusivity trends under various temperatures and densities.
Full size image

a Temperature dependence of oxygen ion diffusivity for m-HfO2 and a-HfOx systems. The red stars represent the CNMP results for a-HfOx. Squares and circles in the legend represent data points from previous studies on m-HfO2 and a-HfOx, respectively. The thick, semi-transparent red line shows the best fit to the CNMP results, illustrating the temperature dependency. b Oxygen ion diffusivity for a-HfOx systems with varying densities. Each density is denoted on the x-axis along with its corresponding stoichiometry. The median is represented by the inner line of each box plot, with 25th and 75th percentiles as the lower and upper edges, respectively. Outliers are denoted by dots.

Exploring the capability of our CNMP to describe practical scenarios, we focus on its application in RRAM. In RRAM applications, both theoretical and experimental studies suggest that the (soft) dielectric breakdown voltage, which corresponds to the forming voltage, decreases as the oxygen vacancy concentration (nvacancy) increases. This trend has been observed in m-HfOx systems, which exhibit similar short-range order to a-HfOx systems9,35,36,37,38. For example, Schmidt et al. reported that the m-HfO2 sample exhibited the breakdown voltage of 7.0 V, while the HfO1.7 sample showed a reduced breakdown voltage of 1.9 V38.

To evaluate whether CNMP can capture the dielectric breakdown trend, we perform CNMP-MD simulations under varying \({E}_{{field}}\) conditions. Since the CNMP-MD simulation assumes a linear response to a homogeneous \({E}_{{field}}\), it is important to first quantify the valid range of this assumption. Therefore, we analyze the average atomic charge as a function of the \({E}_{{field}}\) to determine this linear window. We define the linear response limit, \({E}_{\mathrm{lim}}\), as the \({E}_{{field}}\) at which the average atomic charges begin to deviate from their initial linear trend. The CNMP-MD simulations are conducted on m-HfOx systems with a thickness of 15.8 Å. The systems have stoichiometries of HfO2 (Hf36O72), HfO1.9 (Hf36O68), HfO1.8 (Hf36O64), and HfO1.7 (Hf36O62), and nvacancy values of 0.0, 5.6, 11.1, and 13.9 cm−3, respectively. Figure 5 shows the average atomic charge for Hf and O as a function of \({E}_{{field}}\). It reveals that as nvacancy increases, \({E}_{\mathrm{lim}}\) decreases. For the less defective systems (HfO2.0 and HfO1.9), the systems deviate from linearity at 0.35 V/Å. For the highly defective systems, HfO1.8 and HfO1.7, the \({E}_{\mathrm{lim}}\) decreases to 0.3 V/Å and 0.275 V/Å, respectively. Additionally, in the linear response regime of each m-HfOx system, the average Hf charge increases linearly, and the average O charge decreases linearly with \({E}_{{field}}\). Then, at the same \({E}_{{field}}\), systems with lower nvacancy exhibit a larger charge response. This is physically consistent, as a higher concentration of polarizable O atoms allows for greater field-induced charge transfer.

Fig. 5: Average atomic charge and fdisorder as a function of the Efield for different m-HfOx systems.
Fig. 5: Average atomic charge and fdisorder as a function of the Efield for different m-HfOx systems.
Full size image

Average (a), Hf charge and (b), O charge plotted against the \({E}_{{field}}\). The solid lines are linear fits to the data within the linear response regime. The vertical dashed lines, color-coded according to the legend, represent the \({E}_{\mathrm{lim}}\) for each system. The lines for HfO1.9 and HfO2.0 overlap. (c), fdisorder plotted against the \({E}_{{field}}\). The vertical dashed lines represent the \({E}_{{bd}}\) for each system. The data points for HfO1.9 and HfO2.0 overlap.

With this linear range established, we then evaluated the dielectric breakdown trend. We define the fraction of disordered atoms (fdisorder) as the percentage of atoms with normalized bond orientational order (BOO) parameters below 0.5 (detailed description is provided in the “Methods” section)43. This concept can capture the behavior of disruption of atomic structure, so we have used the point where fdisorder increases from 0 as the starting point of dielectric breakdown. Thus, the dielectric breakdown field (\({E}_{{bd}}\)) is defined as the point where fdisorder becomes greater than zero. As shown in Fig. 5c, for the highly defective systems the \({E}_{{bd}}\) coincides with the \({E}_{\mathrm{lim}}\), at 0.3 V/Å for HfO1.8 and for 0.275 V/Å for HfO1.7. It means that for these systems, the onset of electronic non-linearity is tightly coupled with structural collapse. In contrast, for the more structurally stable systems (HfO2.0 and HfO1.9), the system remains stable even after \({E}_{{field}}\) reaches \({E}_{\mathrm{lim}}\) (Supplementary Fig. S10). This may be physically reasonable, since the structural stability of less defective systems may require additional forces to make the system physically unstable and lead to the dielectric breakdown. Thus, while both \({E}_{\mathrm{lim}}\) and \({E}_{{bd}}\) decrease as nvacancy increases, the relationship between \({E}_{\mathrm{lim}}\) and \({E}_{{bd}}\) is dependent on the defect concentration. In addition to this, the \({E}_{{bd}}\) determined in this study fall within a similar range as the experimentally observed breakdown voltages, from 7.0 V for HfO2 to 1.9 V for HfO1.740. These results are consistent with experimental observations and confirm that CNMP can effectively describe device-level phenomena under \({E}_{{field}}\) conditions.

Discussion

In this study, we develop a CIGNN model that integrates the QEq method within a GNN framework, utilizing both GAB and GBA. The integration of the QEq method enables the CIGNN model to account for the effects of \({E}_{{field}}\). Using the CIGNN model, we develop a CNMP optimized for a-HfOx systems. Compared to AIMD results, the CNMP accurately predicts atomic charge, energy, and force. It also successfully captures local atomic structures and replicates the diffusion behavior of oxygen ions. With comparison to previous studies, the CNMP coherently describes trends in oxygen ion diffusivity. In the context of RRAM applications under \({E}_{{field}}\) conditions, the CNMP reproduces the experimentally observed trend that the \({E}_{{bd}}\) decreases as oxygen vacancy concentration increases. Furthermore, the CNMP-MD simulation shows the relationship between \({E}_{\mathrm{lim}}\) and \({E}_{{bd}}\) depends on the structural stability associated with the nvacancy.

Accurately describing the behavior of complex systems is a critical step toward bridging atomic-level simulations with advanced electronic applications. This task becomes even more challenging under \({E}_{{field}}\) conditions due to the increased complexity of simulating atomic-scale phenomena. The CNMP offers several distinct advantages over existing MLP approaches that incorporate \({E}_{{field}}\), such as FieldSchNet, SCFNN, FIREANN, PNNP, and CACE-LR44,45,46,47,48,49,50,51. First, CNMP significantly reduces the computational cost of generating training datasets. While FieldSchNet, SCFNN, and FIREANN require training datasets generated under applied \({E}_{{field}}\), CNMP completely avoids this requirement by training exclusively on datasets obtained from AIMD simulations without \({E}_{{field}}\). Subsequent CNMP-MD simulations are then conducted under \({E}_{{field}}\) conditions, as described in Eqs. (2)–(5). This strategy eliminates the need for costly data generation under \({E}_{{field}}\) conditions, providing a more efficient path toward \({E}_{{field}}\)-capable MLP development. Although PNNP, is also trained on datasets obtained without \({E}_{{field}}\), it depends on the atomic polar tensor (APT) to characterize the system’s response to \({E}_{{field}}\). Computing APTs can be extremely demanding, particularly for systems with complex atomic structures such as a-HfOx. Similarly, the CACE-LR, which introduces the Latent Ewald Summation (LES) framework, has been shown to effectively describe not only long-range interactions but also the response to \({E}_{{field}}\). However, a key aspect of the LES formulation for incorporating an \({E}_{{field}}\) is its reliance on the high-frequency permittivity as an external parameter. In contrast, our QEq method uses Bader charge, which has been demonstrated to sufficiently capture \({E}_{{field}}\) induced charge redistribution, while drastically reducing the computational overhead associated with dataset preparation and eliminating the need for an external parameter52,53. Second, CIGNN model of the CNMP offers a physically coherent integration of the CPM and EFPM, which provides a distinct advantage over existing approaches. In the case of PNNP, the energy and force prediction model and the APT prediction model are trained separately and only coupled during PNNP-MD simulations. In contrast, CIGNN model integrates CPM and EFPM during the training phase, enabling CNMP to operate within a physically unified framework under \({E}_{{field}}\) conditions. Specifically, the \({Q}_{{field}}\), influenced by the applied \({E}_{{field}}\) as described in Eq. (3), are directly fed into the EFPM. This allows the model to consistently incorporate the indirect response, i.e., the internal electronic redistribution induced by the \({E}_{{field}}\). Additionally, the \(\Delta F\) in Eq. (4) is explicitly incorporated to account for the direct response. Therefore, CNMP enables simulation of both direct and indirect responses under \({E}_{{field}}\) conditions within a unified physical framework.

Nonetheless, the current implementation of CNMP exhibits several limitations that must be addressed to broaden its applicability. Specifically, the framework assumes a linear response regime and is currently restricted to simulations under spatially homogeneous \({E}_{{field}}\). These limitations hinder its ability to capture higher-order nonlinear effects that emerge under strong \({E}_{{field}}\) conditions, as well as to simulate \({E}_{{field}}\) gradient driven phenomena. A promising solution to both challenges involves extending the QEq method, currently based on Bader charge, to incorporate Born effective charge tensors. This extension could enable the CNMP to capture higher-order electronic responses and nonlinear effects. Furthermore, as discussed in the PNNP, \({E}_{{field}}\) gradients interact with quadrupole moments, which can be computed from Born effective charge tensors. Including quadrupolar terms would thus allow CNMP to capture \({E}_{{field}}\) gradients driven effects. However, the practical implementation of this approach is nontrivial. The computation of Born effective charges and construction of a corresponding training dataset remains computationally expensive. While we are actively exploring efficient data-generation strategies to mitigate this cost, a robust and scalable solution has yet to be fully realized. Additionally, CNMP does not explicitly account for local field enhancement, where the \({E}_{{field}}\) can be significantly intensified in the vicinity of defects. Overcoming these challenges remains a key direction for future development.

Despite these limitations, our study demonstrates that the CNMP can effectively describe the behavior of complex a-HfOx systems under \({E}_{{field}}\) conditions. Given its capabilities, CNMP has the potential to become a foundational tool in materials science, enhancing our understanding of field-induced phenomena and driving innovation in electronic applications. By closely aligning simulation predictions with experimental results, the CNMP paves the way for optimizing material properties and designing next-generation electronic devices.

Methods

Datasets

The datasets used in this study were generated through DFT and AIMD simulations without \({E}_{{field}}\). Initially, to generate a-HfOx structures m-HfOx structures were used as the initial configurations. For m-HfOx with double oxygen vacancies, initial structures with both clustered vacancies and vacancies separated the furthest apart were considered. For m-HfOx with triple oxygen vacancies, initial structures with all vacancies close together, vacancies clustered in pairs with the third vacancy separated, and all three vacancies furthest apart were considered to include a variety of interactions between multiple vacancies. Each of these m-HfOx structures was optimized using DFT calculations. Using these optimized m-HfOx structures, a-HfOx structures were formed by the melt-quenching method under the NVT ensemble of AIMD simulations. This process involved melting m-HfOx structures at 5000 K for 5 ps, followed by gradual quenching from 5000 K to 300 K over more than 5 ps.

To construct the dataset, additional AIMD simulations were performed on the a-HfOx structures. A total of 555,674 snapshots were collected from these AIMD trajectories. From these snapshots, 55,567 data points, including structure, energy, and force information, were extracted at uniform 10 fs intervals, corresponding to a uniform-in-time selection strategy. Atomic charge data were obtained by performing Bader charge analysis on the extracted data points. It is important to note that all data points were considered only for the neutral systems.

As a result, datasets containing structure, energy, force, and atomic charge data were constructed.

Hyperparameters of CIGNN model

Both CMP and EFPM of the CIGNN model were implemented using PyTorch54 and trained on a single NVIDIA A100 GPU. The models were optimized using the Adam optimizer with an initial learning rate of 0.001, momentum of 0.9, and no weight decay55. The learning rate was adjusted using the ReduceLROnPlateau scheduler, triggered based on the validation loss, with a patience of 10 epochs, a threshold of 0.00001 in relative mode, a cooldown period of 10 epochs, a minimum learning rate of 1.0 × 10⁷, and an epsilon of 1e-8. The root mean squared error (RMSE) loss was used, and the best model was selected based on the lowest validation mean absolute error (MAE). Detailed parameters used in this study are available at https://github.com/CNMD-POSTECH/CIGNN.

CPM parameters

Embedding dimensions were set at 64 for both atoms and bonds. Radial features were generated using 5 Bessel basis functions with a polynomial envelope (p = 5)56. The convolution layer iterated 3 times. Training spanned 300 epochs with a batch size of 8, incorporating an exponential moving average (EMA) for evaluation. The charge loss function is defined as:

$${L}_{Q,{RMSE}}=\sqrt{\frac{1}{B\cdot N}\mathop{\sum }\limits_{i=1}^{B\cdot N}{\left({Q}_{{pred},i}-{Q}_{{target},i}\right)}^{2}}$$
(6)

where\(\,{L}_{Q,{RMSE}}\) represents charge loss. \(B\) is the batch size, \(N\) is the number of atoms in a batch, \(i\) represents an atom. The total loss is calculated as:

$${L}_{{total}}={\lambda }_{Q}\cdot {L}_{Q,{RMSE}}$$
(7)

With \({\lambda }_{Q}\) representing the charge weight loss set to 100.

EFPM parameters

Embedding dimensions were set at 64 for atoms, 64 for bonds, and 96 for angles. Radial features were generated using 5 Bessel basis functions with a polynomial envelope (p = 5), and angular features were created using 7 spherical harmonics56. The convolution layer, followed by readout layers, iterated 3 times. Training spanned 1000 epochs with a batch size of 8, incorporating an exponential moving average (EMA) for evaluation. The energy and force loss functions are defined as:

$${L}_{E,{RMSE}}=\sqrt{\frac{1}{B}\mathop{\sum }\limits_{b}^{B}{\left({E}_{{pred},b}-{E}_{{target},b}\right)}^{2}}$$
(8)
$${L}_{F,{RMSE}}=\sqrt{\frac{1}{B\cdot N\cdot 3}\mathop{\sum }\limits_{i=1}^{B\cdot N}\mathop{\sum }\limits_{v=1}^{3}{\left({F}_{{pred},i,v}-{F}_{{target},i,v}\right)}^{2}}$$
(9)

Here, \({L}_{E,{RMSE}}\) represents energy loss, and\(\,{L}_{F,{RMSE}}\) represents force loss. In these equations, \(B\) is the batch size, \(N\) is the number of atoms in a batch, \(v\) represents the direction of force component, \(i\) represents an atom. The total loss is calculated as:

$${L}_{{total}}={{\lambda }_{E}\cdot {L}_{E,{RMSE}}+\,{\lambda }_{F}\cdot L}_{F,{RMSE}}$$
(10)

with \({\lambda }_{E}\) and \({\lambda }_{F}\) representing the energy weight losses set to 40 and 100, respectively, to balance the contribution of energy and force terms in the loss function.

CNMP-MD simulations

CNMP-MD simulations were conducted using the large-scale atomic/molecular massively parallel simulator (LAMMPS) with the compiled CNMP57. Simulations of a-HfOx were performed in the NVT ensemble using a Nosé-Hoover thermostat58,59. For CNMP-MD simulations with \({E}_{{field}}\), the m-HfOx structures were minimized under the \({E}_{{field}}\) condition along the z-axis. The criteria for minimization were set to 1.0 × 10⁸ for energy (eV/atom) and 1.0 × 10−9 for force (eV/Å), with a maximum of 5000 iterations, using the fire minimization algorithm60.

Local amorphous environments

The fdisorder was quantified using bond orientational order (BOO) parameters, specifically \({Q}_{4}\) and \({Q}_{6}\). These parameters describe the local atomic environments based on spherical harmonics \({Y}_{{lm}}\), which were computed for each atom61,62,63. The calculated \({Q}_{4}\) and \({Q}_{6}\) values were normalized (normalized BOO) against the reference values for an ideal face-centered cubic (fcc) structure, using the following equation:

$$\frac{\sqrt{{Q}_{4}^{2}+{Q}_{6}^{2}}}{\sqrt{{Q}_{4,{fcc}}^{2}+{Q}_{6,{fcc}}^{2}}}$$
(11)

Atoms with normalized BOO values greater than or equal to 0.5 were classified as part of the crystalline phase, while those with value below 0.5 were classified as part of the amorphous phase. This method provided a clear distinction between crystalline and amorphous regions, enabling the tracking of the evolution of local amorphous environments. To quantify the fdisorder, we identified atoms with BOO values below 0.5 and calculated their percentage within the system43.

Density functional theory (DFT)

First-principles density functional theory (DFT) calculations with Bader charge analysis were performed to construct datasets for a-HfOx systems52. Vienna Ab-initio Simulation Package (VASP) code was used to implement DFT calculations64,65. The Perdew-Burke-Ernzerhof (PBE) generalized gradient approximation (GGA) with the projector augmented wave (PAW) method for the exchange and correlation potentials was used66,67,68. The Monkhorst-Pack k-point sampling of a 1 × 1 × 1 grid with a 500 eV energy cutoff was applied69. The convergence criteria for force and energy for the fully relaxed structures were established at 0.01 eV/Å and 10−6 eV, respectively.