Predicting symmetric structures of large crystals with GPU-based Ising machines

Liang, Chen; Das, Diptesh; Guo, Jiang; Tamura, Ryo; Mao, Zetian; Tsuda, Koji

doi:10.1038/s42005-025-02380-y

Download PDF

Article
Open access
Published: 26 November 2025

Predicting symmetric structures of large crystals with GPU-based Ising machines

Communications Physics volume 8, Article number: 477 (2025) Cite this article

2865 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Solving black-box optimization problems with Ising machines is increasingly common in materials science. However, their application to crystal structure prediction (CSP) is still ineffective due to symmetry agnostic encoding of atomic coordinates. We introduce CRYSIM, an algorithm that encodes the space group, the Wyckoff positions combination, and coordinates of independent atomic sites as separate variables. This encoding reduces the search space substantially by exploiting the symmetry in space groups. When CRYSIM is interfaced to Fixstars Amplify, a GPU-based Ising machine, its prediction performance is competitive with CALYPSO and Bayesian optimization for crystals containing more than 150 atoms in a unit cell. Although it is not realistic to interface CRYSIM to current small-scale quantum devices, it has the potential to become the standard CSP algorithm in the coming quantum age.

Efficient crystal structure prediction based on the symmetry principle

Article 27 February 2025

Pressure-induced superconductivity in the nonsymmorphic topological insulator KHgAs

Article Open access 06 October 2023

Deep Bayesian local crystallography

Article Open access 10 November 2021

Introduction

The advancement of various significant technology fields relies on discovery of innovative materials with desired chemical or physical properties¹, and obtaining correct structures of materials, arrangement of atoms in the unit cell, is the prerequisite. To achieve the goal, crystal structure prediction (CSP)^2,3, in which the most stable crystal structure is inferred only from its chemical composition, has been widely adopted. The vast configuration space and the richness of local minima on potential energy surfaces (PESs) renders CSP a challenging task⁴. Optimization algorithms, such as genetic algorithms^5,6,7,8,9, particle-swarm optimization (PSO)^10,11, Bayesian optimization (BO)^12,13,14, are proposed and successfully applied in practice. Typically, they create roughly-shaped initial structures and the final optimization is done by a geometric relaxation software either based on first-principles calculation or pretrained universal neural network potentials (NNPs). Nevertheless, these methods generally require a great number of iterations. In recent years, deep learning-based crystal generative models^{15,16,17,18,19,20,21} are developing fast, but they might find problems in extrapolation outside their training datasets. Therefore, as an example, due to the scarcity of corresponding data, CSP on 2D materials^22,23 and nanoclusters^24,25,26 generally relies on optimization methods. Besides, in both categories, most of the methods work well for crystals containing less than 60 atoms¹⁸ in a unit cell, but are still not ideal for larger crystals. For example, the training data of CDVAE¹⁵ and MatterGen²¹ does not include large crystals with more than 20 atoms in a unit cell. GNoME¹⁶ successfully explores larger ones approximating 100 atoms, but considerable computational cost is required.

Ising machines^27,28 are hardware-assisted discrete optimizers that solve a quadratic unconstrained binary optimization (QUBO) problem,

$${{{{\bf{x}}}}}{\ast }={{{{\rm{argmin}}}}}_{{{{\bf{x}}}}\in {\{0,1\}}^{M}}\left({\sum }_{i=1}^{M}{h}_{i}{x}_{i}+{\sum }_{i=1}^{M}{\sum }_{j=i+1}^{M}{J}_{ij}{x}_{i}{x}_{j}\right),$$

(1)

where x is an M-dimensional bit vector and h_i and J_ij are real-valued parameters. CSP can be represented as a QUBO problem, either by simplifying the energy function^29,30,31 or the use of a surrogate machine learning model^32,33. Among the existing studies, Gusev et al.²⁹, Ichikawa et al.³⁰, and Xu et al.³² provided QUBO formulations of CSP under a fixed space group. Couzinié et al.³¹ and Couzinié et al.³³ disregarded the space group and employed a grid-based representation of atomic coordinates in their QUBO formulation. Notably, they lack a feature of dynamically adjusting the space group, which is common in state-of-the-art CSP algorithms such as CALYPSO^10,11, USPEX^7,9,34, and CRYSPY^13,14. Furthermore, these algorithms have not been tested on complex problems, mainly due to the scale restriction of D-Wave³⁵ quantum annealer.

In this work, we develop a method named CRYSIM (CRYstal structure prediction with Symmetry-encoded Ising Machine). Our bit vector represents the lattice parameters, the symmetry information including the crystal system (CS), the space group (SG) and the Wyckoff positions combination (WPC), and coordinates of independent sites. This bit vector is translated to a crystal structure, and M3GNet³⁶ provides its potential energy. Our goal is to find the optimal bit vector that gives the lowest potential energy. To enable the search with an Ising machine, a 2-order factorization machine (FM)^37,38,39,40 is trained with available pairs of bit vectors and corresponding energies with an active learning workflow⁴¹. Since the prediction function is quadratic, the optimal bit vector that minimizes the FM-approximated potential energy can be found with an Ising machine. It does not always coincide with the real optimal solution, but one can expect that the error decreases as the amount of training data increases during the search process. Our information-rich bit vector inevitably inhibits the use of D-Wave quantum annealers. Instead, we employ a GPU-based Ising machine, Fixstars Amplify⁴², to solve a problem with over several thousand bits. It is based on simulated annealing and uses multi-level parallel processing on multiple GPUs to find optimal solutions. Fixstars Amplify relies on conventional semiconductor technologies, but can handle large-scale problems up to 130,000 bits with full connectivity. It has been employed in molecular generation⁴³, materials design^44,45,46, and various engineering fields^47,48.

Our method outperforms BO¹² and CALYPSO^10,11 on three small crystals as well as large ones containing more than 150 atoms in unit cells. Specifically, CRYSIM successfully generates the ground truth Ca₂₄Al₁₆(SiO₄)₂₄ structure, containing 160 atoms in the unit cell, within 300 relaxations in 4 out of the total 5 trials. In this work, GPUs are adopted, but CRYSIM can leverage any Ising machines, including rapidly developing quantum devices.

Results

Bit vector encoding

The binary representation in CRYSIM consists of the following three parts: lattice parameters, symmetry information, and 3D coordinates of independent sites. In the first part, the six-dimensional lattice parameters are individually discretized and summarized into a bit vector with one-hot encoding. The second part includes a crystal system (CS), a space group (SG), a group of Wyckoff position combinations (WPCs). The sizes of each vector segment depend on the set of all possible space groups compatible with the given chemical composition, which is determined by whether there exists at least one WPC for achieving symmetry of the SG. Similarly, only compatible CSs are included in the embeddings. Accordingly, if m crystal systems are involved, each of which has s₁, …, s_m compatible space groups, the CS part has m bits to represent the CS and the SG part has ${\max }_{i=1,\ldots ,m}{s}_{i}$ bits to represent the SG. If the crystal structure has the i-th CS and j-th SG, the corresponding bits are set as 1, and the remaining are 0s.

Given an SG, we engineered the WPC generator in GN-OA package⁴⁹ to compute the list of plausible WPCs according to the input chemical composition^50,51. In the process, WPCs for each element species are calculated recursively based on its frequency in the unit cell. They are sorted in a descending order according to the maximum multiplicity of involved WPs, so that more plausible combinations are prioritized, considering the fact that most crystals in nature tend to occupy WPCs with general Wyckoff positions^34,52. Then, the Cartesian product among the WPCs of elements is conducted to derive WPCs for the whole system, which are appended in the list until the maximum number, 30,000, is reached. Finally, they are divided into 300 groups of size 100, which is encoded in the WPC segment with a 300-dimensional one-hot vector for specifying the group.

The third part consists of k copies of a g³-dimensional bit vector in order to represent a crystal containing k element species, in which g denotes lattice discretization resolution (LDR). A 3D g × g × g grid is assumed within the unit cell. If an independent site of the atom species exists near a grid point, the corresponding bit is set to 1. In decoding, 100 structures are generated corresponding to all WPCs in the specified WPC group. Among them, the one with the largest minimum interatomic distance (MID) is selected to increase the possibility of deriving stable states⁴. Details of the encoding and decoding procedures are provided in the Method. Besides, Supplementary Note 1 and 2 present a detailed explanation about WPCs generation and application.

CRYSIM Workflow

The workflow of CRYSIM is depicted in Fig. 1. First, 1000 initial structures are obtained by random generation (RG) developed in this work (see Method for details) with the given chemical composition, and converted to bit vectors x_l. Their potential energies y_l are estimated using M3GNet without structure relaxation, leading to a training dataset described as the pairs of bit vectors and energies, i.e., D = {(x_l, y_l)∣l = 1, 2, …, 1000}. The dataset is then used to train an FM model ^37,38, simulating the M3GNet-estimated potential energy surface of the materials family. The functional form of FM is described as

$$y=b+{\sum }_{i=1}^{M}{h}_{i}{x}_{i}+\mathop{\sum }_{i=1}^{M}{\sum }_{j=i+1}^{M}{\sum }_{k = 1}^{K}{w}_{ki}{w}_{kj}{x}_{i}{x}_{j},$$

(2)

where b, h_i and w_ki are real-valued parameters, and x_i is the i-th element of a vector x. It is similar to QUBO, but the weight matrix of quadratic terms is a low-rank matrix parameterized by w_ki and w_kj. Then, Fixstars Amplify⁴² is used to optimize the bit vector to minimize the energy approximated by FM. PyTorch⁵³ is leveraged to implement FM and the training workflow, with detailed training settings and hyperparameters of Amplify provided in Supplementary Note 3.

Fig. 1: The workflow of CRYSIM that contains T iterations, using Si4O8 as an illustration. — **Fig. 1: The workflow of CRYSIM that contains T iterations, using Si₄O₈ as an illustration.**

Based on the solution, 100 structures with the same SG but different WPCs from the solved WPCs group are translated back to crystal structures, and the one with the largest MID is selected and relaxed using M3GNet. After relaxation, we sample 30 structure frames from the relaxation trajectory. Among the samples, if a structure has an MID lower than 0.5 Å but is still assigned a negative energy, the energy is adjusted to a high positive value to mitigate negative impact due to inaccuracy of NNP. Then the data points are added to D and FM is retrained. The above procedure is repeated T = 300 times and the most stable structure is recorded as the final result.

Recovering benchmark materials

We begin with relatively simple benchmark crystal tasks to demonstrate CRYSIM’s ability to address CSP. Wei et al.⁵⁴ developed a series of quantity measurements for evaluating CSP algorithms, and selected five crystals, including ScBe₅, Ca₄S₄, Ba₃Na₃Bi₃, Li₄Zr₄O₈ and Li₃Ti₃Se₆O₃, as examples to conduct tests. Ground states of all compounds are determined based on the Materials Project (MP) database⁵⁵, i.e., mp-11277, mp-1672, mp-31235, mp-755253 and mp-1211008, respectively. Classical algorithms considered for comparison include CALYPSO^10,11 and simple BO that directly optimizes lattice parameters, fractional coordinates, the SG number and the WPC index of crystals, implemented based on the hyperopt package⁵⁶ following the scheme of GN-OA⁴⁹, denoted as “Crystal param. + Hyperopt BO”. All methods are limited to perform 300 times of structure relaxation during one run to make a fair comparison. Accordingly, CALYPSO is leveraged for 30 generations, with the population size per iteration set as 10. Since the number of atoms are not optimized in CRYSIM, we treat the structure indicated by the input chemical formula as conventional cells, i.e., symmetry is calculated within the given lattice, instead of a combination of several cells. Accordingly, the NumberOfFormula parameter of CALYPSO, representing the smallest and largest times of the input chemical formula of considered structures in one optimization process, is set as “1 1” in all tests. Besides, in all experiments in this article, structures containing interatomic distances smaller than 1.0 Å are excluded from the statistics, to ensure that all crystals remain physically valid. Tests of each method repeat three times with different seeds. Hyperparameters of classical algorithms are reported in Supplementary Note 4.

We use StructureMatcher function from the pymatgen package⁵⁷ to determine structural similarity between predicted and known ground truth materials. The function can compute minimum average pair-wise displacement between two corresponding atoms in two configurations among all permutations. The predicted structure successfully matches the ground truth as long as the displacement is computable, which suggests that StructureMatcher is able to distinguish corresponding atoms between them. Details of criterion of matching is provided in Method section.

When assessing results, we only include structures reaching the lowest energy (${E}_{\min }$) among all obtained ones in 300 iterations, i.e., $\{E| E={E}_{\min }\}$, and compare them against the ground truth. Several major metrics are defined for this task: (1) I_M,0 denotes the first iteration at which the ground truth is identified; (2) N_E denotes the number of iterations reaching the lowest energy, i.e., ${N}_{E}=| \{E| E={E}_{\min }\}|$; (3) N_M denotes the number of successfully matched ones among all considered structures. We provide a further illustration on evaluation of CSP algorithms in Discussion section. Figure 2 summarizes results of two representative crystals, and comprehensive information is presented in Supplementary Tables 1–5. Other metrics, such as displacement calculated by StructureMatcher of the structure in iteration I_M,0, denoted as D_M,0, as well as the minimum displacement ${D}_{M,\min }$ and corresponding iteration ${I}_{M,\min }$, are also reported. Predicted configurations with the lowest estimated relaxed energies in the three trials are shown in Fig. 3.

**Fig. 2: Optimization performance comparison between CRYSIM and two classical crystal structure prediction (CSP) algorithms, CALYPSO and Bayesian optimization (BO), on two benchmark materials.**

Fig. 3: Comparison between the most stable benchmark configurations discovered by three crystal structure prediction (CSP) algorithms, CALYPSO, Bayesian optimization (BO) and CRYSIM, and the ground truth structure recorded in Materials Project (MP).

Ground states of ScBe₅ and Ca₄S₄ can be readily discovered by all three methods, but CRYSIM generates significantly more stable states than the two classical methods. Besides, smaller I_M,0 of CRYSIM optimizers indicate that FM can quickly and effectively characterize the PES by learning from initial datasets. For the more complicated Ba₃Na₃Bi₃ system, CRYSIM is the only method that successfully discovers the stable state with correct estimated energies.

All methods fail on Li₄Zr₄O₈ and Li₃Ti₃Se₆O₃ structure prediction if only the crystals of ${E}_{\min }$ are counted, which, however, reach an even lower estimated relaxed energy than the ground states. This may be attributed to two main reasons. First, the selected benchmark structures may not represent the ground states of corresponding chemical compositions, suggested by positive energies above hull. Especially, Li₃Ti₃Se₆O₃ (mp-1211008) exhibits a substantial energy above hull, 0.617 eV/atom according to MP, indicating a potential to transform into alternative phases predicted by the optimization methods. Second, the pretrained NNP leveraged in this study is not sufficiently accurate, and a 0.01 eV variation in potential energy can affect the behavior of optimization algorithms. As an instance, we further introduce predicting results on Li₈Zr₄O₁₂ (mp-4156), a stable structure of Li-Zr-O family that has been observed in experiments, in Supplementary Fig. 1 and Supplementary Table 6. In all configurations with the lowest energies, hexa-atomic rings absent in mp-4156 can be found, which may suggest a systematic energy underestimation on near-equilibrium structures, as is discussed in other related works⁵⁸. On the other hand, CRYSIM is a flexible tool, which can be incorporated with any energy estimators. For small systems, first-principles calculation software^59,60 is available. We also expect that the continuous improvement of NNP technology can mitigate the problem.

Large crystal structures prediction

This section introduces experiment results on three large material systems, including Y₆Co₅₁, Ca₂₄Al₁₆(SiO₄)₂₄ and (SiO₂)₉₆, to demonstrate capability of CRYSIM. The crystal Y₂Co₁₇ has been a classical benchmark for assessing CSP algorithms^12,61,62, but the two stable structures in this material family recorded in MP can only be achieved with unit cells Y₄Co₃₄ (mp-570718) and Y₆Co₅₁ (mp-1106140). Since the number of atoms in unit cells are not optimized in CRYSIM, we start directly with Y₆Co₅₁. Ca₂₄Al₁₆(SiO₄)₂₄ (mp-6008)^29,63 and (SiO₂)₃₂⁶³ have also been discussed in previous works as examples of CSP on large crystals. Here, (SiO₂)₉₆ (mp-1200292) is chosen since it is the largest SiO₂ crystal in MP that has been observed in experiments.

Apart from CALYPSO and BO introduced earlier, simple RG, which is employed to generate initial training set for CRYSIM, and PyXtal⁵²-based RG in CRYSPY¹³, denoted as “CRYSPY RG”, are additionally included as baseline CSP methods. 300 times of structure relaxation, i.e., 300 iterations, are conducted in one run, and tests of each method repeat five times with different seeds. Fig. 4a-c presents averaged accumulated lowest energies of crystals during generation in the 300 cycles. Optimal materials found by each algorithm are visualized in Fig. 4d. Corresponding data is summarized in Supplementary Table 7, as well as Supplementary Tables 8–10 for metrics defined in the last section.

**Fig. 4: Optimization performance comparison between CRYSIM and classical crystal structure prediction (CSP) algorithms on large crystals.**

On Y₆Co₅₁, BO is the only method that discovers the ground state with a −7.127 eV/atom energy, the same as corresponding relaxed energy of mp-1106140. However, computational complexity of BO scales with the total number of atoms^64,65,66, leading to a significantly reduced performance on Ca₂₄Al₁₆(SiO₄)₂₄ and (SiO₂)₉₆, even falling below CRYSPY RG. CALYPSO exhibits a higher stability than BO for large systems, and the implementation of pair-wise distance consideration in input renders it unaffected when screening out configurations with small MIDs, as is shown in Supplementary Table 11. However, this feature accelerates the optimization of energies only in the first tens of iterations in Fig. 4b-c, and then the algorithm is surpassed by CRYSIM methods.

On the other hand, length of CRYSIM embeddings is determined solely by the number of elements given the LDR, making it advantageous over other algorithms especially on large crystals. On the Ca₂₄Al₁₆(SiO₄)₂₄ system, which contains 160 atoms in the unit cell, CRYSIM successfully finds the ground truth structure in four out of five trials. For (SiO₂)₉₆, CRYSIM identifies a configuration with a relaxed energy (−7.890 eV/atom) close to the stable one (−7.891 eV/atom), significantly lower than the ones found by other methods. CSP for large crystals has been a long-standing challenging task. Energy distribution of the configuration space tends to concentrate on unstable states as the system size grows, which means that the difficulty of finding the ground state via RG exponentially increases^63,67,68. Though CRYSIM does not outperform in all systems, the superiority on large crystals establishes it as a promising approach for CSP.

Effects of Processing Techniques

Lattice Discretization Resolution

When converting 3D structures into binary vectors, a higher LDR can reduce information loss, nevertheless, leading to exponentially increasing solving difficulty. We investigate the influence of LDR on CRYSIM optimization performance by testing g ∈ {5, 7, 9, 12, 15} across the three large crystals considered in this study, namely Y₆Co₅₁, Ca₂₄Al₁₆(SiO₄)₂₄ and (SiO₂)₉₆. Table 1 summarizes lowest energies with corresponding average accumulated energy curves recorded in Supplementary Fig. 2, where CRYSIM with different LDRs are denoted as CRYSIM-g, such as “CRYSIM-5” for g = 5. Each value is the mean of three trials with different random seeds.

Table 1 Influence of lattice discretization resolutions (LDRs) and minimum interatomic distance-related (MID-related) procedures on optimization performance

Full size table

Y₆Co₅₁ and Ca₂₄Al₁₆(SiO₄)₂₄ achieve the best CSP results at LDR 12, while (SiO₂)₉₆ performs the best at 15. This phenomenon stems from the implementation of CRYSIM, where each atom species is encoded using a g × g × g grid of bits, with 0 or 1 indicating the presence of an atom at each discretized unit. Accordingly, a higher LDR is advantageous when the number of atoms per element increases, rather than the total number of atoms. An LDR of 12 appears to strike a balance between representability and optimization difficulty for Y₆Co₅₁ and Ca₂₄Al₁₆(SiO₄)₂₄, where the maximum number of atoms of a specific element is 51 and 96, respectively. In contrast, (SiO₂)₉₆, which contains 192 oxygen atoms, may require a higher LDR to better capture interatomic spatial relationship in CRYSIM. According to the results, as a guidance on selecting LDRs for other tests, it is recommended to use a 15 LDR when the maximum frequency of an element species in the unit cell apparently exceeds 100, otherwise 12. Numbers of bits for representing each parameter for the three systems are further reported in Supplementary Table 12. We also provide time costs in the Ising solving process by Amplify and spatial resolutions of each block for different LDR in Supplementary Table 13 and Supplementary Table 14, respectively. We refer the readers to the Method section for details of calculation.

Factorization machine

In this work, FM is employed as the regressor in CRYSIM to build Ising objective functions, and here the fitting accuracy is investigated. Supplementary Fig. 3a–e shows distributions of predicted versus calculated energies of one of the initial Ca₂₄Al₁₆(SiO₄)₂₄ datasets derived by RG, the system requiring the largest number of bits to represent due to its chemical composition. Learnable parameters of FM are decided upon metrics on the validation set, comprised of 10% of the dataset (see Supplementary Note 3 for details). Changes of Pearson correlation coefficients (PCCs) and mean absolute errors (MAEs) during training are further provided in Supplementary Fig. 3f. The consistently high PCC values indicate effective optimization toward the global optimum, despite of fluctuations due to out-of-distribution energies. Similar trends are observed across all other systems and random seeds.

Additionally, a comparison between FM and full-rank quadratic regression (QR), in which quadratic terms in regression functions are independently learned instead of multiplications between linear terms, is exhibited in Supplementary Fig. 4. These experiments are conducted with CRYSIM-5 representation of Y₆Co₅₁ system, containing 801 bits in the embeddings. Under these conditions, QR involves more than 600,000 trainable parameters, whereas FM requires only 13,617 ones. For reference, optimization results of BO and CALYPSO, previously shown in Supplementary Table 7 are also included as baselines, with three trials performed for each method. On this system, CRYSIM-QR achieves a lower average accumulated energy than CRYSIM-FM, indicating superior optimization performance. However, for systems represented with more than 1,000 bits, QR requires millions of trainable parameters, making FM a more practical option for these tasks in terms of computational efficiency.

Processing with minimum interatomic distance

Inaccuracy of NNPs on configurations with extremely small MIDs renders negative impact on regression models. To mitigate the effect, procedures related to MIDs are designed and integrated in the workflow, including selecting the structure with the largest MID from one solution vector in Fig. 1f, and adjusting unphysical energy estimations in Fig. 1g. Effectiveness of the procedures is demonstrated in Table 1 by comparing CRYSIM of all considered LDRs with and without including these steps during optimization. Tests of each method repeat three times with different seeds. The corresponding accumulated energy curves are provided in Supplementary Fig. 2. Optimizers equipped with the modules achieve a widespread performance enhancement, particularly for larger systems with higher LDRs. Besides, inclusion of MID processing enables structures exploration with high LDRs. For Y₆Co₅₁ and Ca₂₄Al₁₆(SiO₄)₂₄, CRYSIM-12 performs the best with MID-related procedures, but it cannot realize the full potential and is outperformed by CRYSIM-9 without them. The advantage is attributed to improved efficiency in obtaining valid crystals, indicated by a reduction of filtered-out configurations reported in Supplementary Table 15.

Discussion

In this section, we provide a detailed interpretation covering: (i) symmetry representation, (ii) atomic position representation, (iii) MID-related procedures in the workflow, (iv) evaluation on end-to-end CSP algorithms, and (v) the use of Ising solvers.

First of all, incorporating symmetry representations is a fundamental requirement in developing algorithms for crystal generation. Most stable crystal structures in the nature have symmetry^69,70, making it necessary to consider symmetry constraints in the CSP task. Classical optimization methods, including evolutionary algorithms^34,71, PSO^10,11, and BO^12,13,14,49, rely on RG algorithms^50,51,52 to sample symmetric configurations as starting points. Following discussions proposed in RandSpg⁵¹, two strategies of organizing and utilizing WPCs lists are summarized in Supplementary Note 1. CRYSIM differs from the above strategies since the search is conducted within a space of symmetric structures, in which the SG number is optimized directly. Other methods, including USPEX³⁴ and CALYPSO^10,11, only guarantee the symmetry requirement in initialization.

Recently, symmetry of crystals is also emphasized in generative models to increase the possibility of deriving reasonable configurations. As one of the earliest trials in this field, DiffCSP++¹⁷ does not represent symmetry of crystals explicitly, in which networks generate configurations given a specific pre-set WPC constraints. WyCryst²⁰, SymmCD⁷² and WyckoffDiff⁷³ successfully explore symmetry representations with one-hot labelling, operation matrices and graphs, respectively. However, current generative model-based solutions are still not perfect from the perspective of end-to-end CSP. The related works, including the ones trained with an auto-regression strategy, such as CrystalFormer⁷⁴ and WyFormer⁷⁵, and diffusion models, such as SymmCD and WyckoffDiff, rely on a prior distribution of SG from the training dataset for conditional generation. WyCryst is among the methods that predict both SG and WPC for constructing structures, but it requires PyXtal as an external tool to provide atomic positions and fill in WPCs, similar to WyFormer and WyckoffDiff. In comparison, CRYSIM treats the CSP problem in an exact end-to-end manner, in which SG, WPC index and atomic positions are all optimized and do not condition on any given information, except for the input chemical formula.

Second, an accurate description on the CSP problem relies on an effective representation of atomic coordinates in the lattices. There are two main strategies for representing fractional coordinates as variables to be optimized. The first is to treat each coordinate (x_i, y_i and z_i for the i-th atom) as independent variables^11,49, and the second is to split the whole crystal lattice and use the derived discrete blocks in the 3D space to encode positions^29,30,31,32. Most optimization methods based on Ising models adopt the second approach, as it aligns with the goal of achieving guaranteed optimal solutions through quantum annealing by fitting the system’s PES. By representing atomic positions via lattice splitting, an Ising model can encode physical interactions: first-order terms capture the energy contribution of a single atom due to external fields, while second-order terms describe pairwise atomic interactions. This allows the Ising model to approximate the interatomic potential in a quadratic form.

Nevertheless, in practical implementations that account for symmetry, such as CRYSIM and other works²⁹, the solved atomic positions are not external coordinates for building structures, but internal or independent sites to insert in WPs. As a result, the learned Ising model does not fully reflect an actual interatomic potential. One possible solution is to first estimate or sample an SG and WPC, derive corresponding constraints on lattice parameters and coordinates, and then optimize the two parts. Accordingly, by adding penalty terms, Ising solver can optimize directly on external coordinates and preserving the symmetry simultaneously. However, the order of constraints on coordinates would be the same as the multiplicity of corresponding WPs, making it challenging to implement for current solvers.

From a practical perspective, another advantage of the second strategy over the first one is that it requires less bits to encode coordinates in large systems, especially those with few atomic species. This is because the second strategy scales linearly with the number of atomic species and remains constant with respect to the number of atoms, whereas the first strategy scales linearly with the number of atoms within the cell. The scaling with the number of atoms is generally more computationally intensive in large systems. Taking the (SiO₂)₉₆ system in Results as an example, suppose the lattice is split into 15 * 15 * 15 blocks. Following the first strategy, each coordinates require 15 bits to represent, leading to 15 * 3 * (96 + 192) = 12960 bits in total, but only 15 * 15 * 15 * 2 = 6750 are needed based on the second one.

Third, the MID-related procedures employed in this study are necessary for preserving effectiveness of the algorithm and enhancing optimization efficiency practically. Ideally, a pretrained NNP should assign high energies to unstable structures, allowing the objective functions to reflect an accurate structure-energy relationship through active learning for diversion structural configurations. Accordingly, CSP optimizers, designed to identify low-energy solutions, can correctly discover the ground states. However, training sets for state-of-the-art NNPs generally lack out-of-distribution structures in the configuration space, especially for crystals containing extremely close atom pairs, rendering their estimated energies unreasonably low. As presented in Supplementary Table 15, many CRYSIM optimizers without MID processing are encouraged to generate abnormal structures, due to their low estimated relaxed energies, which are, however, ineffective from a practical perspective. Directly replacing pretrained NNP with first-principles calculation software^59,60 can circumvent the problem, but it is still unrealistic for large crystals considering current computational power.

According to an observation that most abnormal relaxed structures originate from abnormal decoded unrelaxed ones, we design MID processing techniques on generated configurations (Fig. 1f, g) to improve the efficiency of CRYSIM optimizers. Besides, previous works⁴ also suggest that atoms in stable structures tend to uniformly distribute, instead of clustering in a small space. However, for some material systems, strategies aimed at controlling MIDs of generated materials (e.g., CRYSPY RG and CALYPSO) or attempting to obtain materials with larger MIDs (e.g., CRYSIM) may hinder discovering of ground states, as evidenced by the Y₆Co₅₁ system in Supplementary Table 7. Although many structures derived by RG and BO implemented in this work are screened out due to very small MIDs, as reported in Supplementary Table 11, CSP of Y₆Co₅₁ is finally accomplished after structure relaxation on crystals that may not have high MIDs. We expect that when a more accurate pretrained NNP is proposed, the MID-related procedures can be discarded.

Forth, this work adopts a simple but rigorous criterion to evaluate end-to-end CSP optimizers. The primary objective of our optimization-based CSP algorithm, CRYSIM, is precisely to locate the global minimum on the PES, representing the most thermodynamically stable structure according to the chosen energy model. To rigorously assess this specific capability during benchmarking, we consider the number of “successful match” (N_M) only for structures with the lowest relaxed energies out of the 300 total structures instead of all of them. This prevents crediting success to fortuitous sampling into higher-energy local minima, thereby isolating the evaluation of optimization performance.

More critically, this methodology also reflects the practicability in CSP tasks. When the target structure is unknown, researchers inevitably rely on the calculated energy ranking, treating the lowest-energy prediction as the most likely candidate for experimental synthesis or validation. Higher-energy predictions, even if potentially correct, cannot be identified as such without prior knowledge of the ground truth. Thus, by focusing our evaluation on the lowest-energy structure, our metric is not only relevant to CRYSIM’s optimization objective but also aligned with the practical interpretation and utility of CSP results.

Finally, as a solver for addressing combinatorial optimization problems, quantum annealing (QA)^{76,77,78,79,80,81} has gained attention due to its theoretical ability to escape from local minima, and D-Wave system³⁵ is among the most widely used implementations of QA^29,31,33. However, the maximum number of variables for the D-Wave system is limited to 124 bits⁴³, severely restricting its application. In this work, we integrate Amplify⁴², a GPU-based Ising solver, into CRYSIM as a substitute of quantum annealers. Nevertheless, we claim that the present implementation can be applied directly on quantum annealers without adjustment as quantum computers continue to develop.

Conclusion

In conclusion, CRYSIM, a CSP optimizer based on symmetry-encoded Ising models, is proposed and tested across various CSP tasks. To the best of our knowledge, it is the first Ising machine-based optimizer for CSP that dynamically optimizes symmetry. CRYSIM outperforms CRYSPY RG, BO, and CALYPSO on most systems, showcasing its strong optimization capabilities not only for small benchmark crystals but also for larger ones, including Ca₂₄Al₁₆(SiO₄)₂₄ and (SiO₂)₉₆. The predicting accuracy of FM in CRYSIM is also discussed, highlighting its expressivity in CSP tasks. CRYSIM offers a promising Ising machine-based optimization tool for CSP that could potentially be applied to quantum annealers in the future.

Methods

Random generation of crystal structures

Random generation (RG) constitutes the foundation of CSP^11,13,68, in which atomic positions are randomly sampled according to the given chemical composition. In this work, a simple crystal RG tool is implemented as the CSP baseline, as well as preparing training data for Ising models.

In RG, six lattice parameters (lattice lengths a, b, c, and lattice angles α, β, γ) and fractional coordinates are sampled independently from uniform distributions. To determine the default lower and upper bounds of the distributions for lattice lengths, we perform statistical analysis on materials in MP. Let M be the set of all materials in the MP database. Define a function $| m| :M\to {\mathbb{N}}$ that maps each material m to the number of atoms in its unit cell. We then partition M into five categories, {M₁, M₂, M₃, M₄, M₅}, such that for each material m ∈ M, it belongs to category M_i if it satisfies

$$\left\{\begin{array}{l}{M}_{1}=\{m| | m| \le 20\},\hfill\\ {M}_{2}=\{m| 20 < | m| \le 50\},\\ {M}_{3}=\{m| 50 < | m| \le 80\},\\ {M}_{4}=\{m| 80 < | m| \le 100\},\\ {M}_{5}=\{m| | m| > 100\}.\hfill\end{array}\right.$$

(3)

Next, the averages of a, b, and c for materials in each category M_i are computed, denoted as ${a}_{{M}_{i}}$, ${b}_{{M}_{i}}$, and ${c}_{{M}_{i}}$, respectively. For a specific material system m₀ to be generated, if the number of atoms ∣m₀∣ falls within one of the ranges, the lower and upper bounds are determined as follows:

$$\left\{\begin{array}{l}{l}_{{m}_{0}}=0.8\times ({a}_{{M}_{i}}+{b}_{{M}_{i}}+{c}_{{M}_{i}}),\\ {u}_{{m}_{0}}=2\times ({a}_{{M}_{i}}+{b}_{{M}_{i}}+{c}_{{M}_{i}}).\end{array}\right.$$

(4)

These bounds are the same for the three lattice lengths. The lower and upper bounds for the lattice angles are set to 50^∘ and 130^∘, respectively.

Then, space group (SG) and corresponding Wyckoff positions combination (WPC) are derived for building symmetry. Given the input chemical composition, let ${{\mathbb{S}}}_{+}$ be the set of SGs that are compatible with the stoichiometry. For each $S\in {{\mathbb{S}}}_{+}$, let ${{\mathbb{W}}}_{S}$ be the set of corresponding WPCs (see Supplementary Note 1). Let $| {{\mathbb{W}}}_{S}|$ denote the number of distinct compatible WPCs for SG S. The process involves sampling an SG and then a WPC. An SG number is sampled from all compatible ones (${{\mathbb{S}}}_{+}$) uniformly, i.e., an SG S_l is selected by sampling its identifier ${i}_{{S}_{l}}$ uniformly from the set of identifiers for SGs in ${{\mathbb{S}}}_{+}$:

$${i}_{{S}_{l}} \sim {{{\mathscr{U}}}}(\{{{{\rm{id}}}}(S)| S\in {{\mathbb{S}}}_{+}\}),\quad {S}_{l}={{{\rm{SG}}}}({i}_{{S}_{l}}).$$

(5)

Based on S_l, a WPC is subsequently sampled. Define the maximum WPC count ${W}_{\max }={\max }_{S\in {{\mathbb{S}}}_{+}}| {{\mathbb{W}}}_{S}|$. Sample an integer ${i}_{{W}_{l}}$ uniformly from $\{0,1,\ldots ,{W}_{\max }-1\}$:

$${i}_{{W}_{l}} \sim {{{\mathscr{U}}}}(\{0,1,\ldots ,{W}_{\max }-1\}).$$

(6)

The WPC W_l is derived from the WPCs set corresponding to the chosen SG S_l based on the sampled identifier:

$${W}_{l}={{{{\rm{WPC}}}}}_{{S}_{l}}\left(\left\lfloor {i}_{{W}_{l}}\cdot \frac{| {{\mathbb{W}}}_{{S}_{l}}| }{{W}_{\max }}\right\rfloor \right),$$

(7)

where ⌊x⌋ is the floor function, which returns the greatest integer less than or equal to x.

Finally, fractional coordinates are uniformly sampled from the interval [0, 1). These coordinates are treated as independent sites and placed into the Wyckoff positions W_l, where they are transformed into external coordinates to satisfy symmetry constraints. Similarly, the generated lattice parameters are assigned to variables defined by the crystal system (CS) C_l associated with the sampled SG S_l. Since WPCs impose dependencies among coordinates, reducing the degrees of freedom, only the earliest generated coordinates are used. The same approach applies to lattice parameters constrained by a CS.

When generating datasets for training, structures containing atom pairs with distances smaller than 1.5 Å are removed to ensure that most generated structures have a reasonable estimated energy, which is essential for training an accurate objective function. Distance filtering is not involved when evaluating the performance of the RG baseline. A much refined RG process is implemented by PyXtal⁵², which has been tested as the CRYSPY RG baseline in this work.

Details of symmetry-informed integer encoding in CRYSIM

Integer encoding can be interpreted as follows. Suppose a binary vector segment containing N_v bits is leveraged for representing parameter v. For a continuous parameter $v\in [{v}_{\min },{v}_{\max })$, the i_v-th bit of the vector segment will be assigned as 1 and other elements are 0s with

$$\begin{array}{r}{i}_{v}=\left\lfloor \frac{v-{v}_{\min }}{{u}_{v}}\right\rfloor ,\end{array}$$

(8)

where ${u}_{v}=({v}_{\max }-{v}_{\min })/{N}_{v}$ is the unit or interval of the representation. For a discrete parameter, i_v = 1 if the parameter v of the system corresponds to the i-th category.

Lattice parameters encoding

In the workflow of CRYSIM, integer encoding is initially performed on training sets generated by RG. The upper and lower bound of lattice length encoding, simultaneously the highest and lowest lattice length of decoded materials, are calculated based on data points in the sets. Let $l{l}_{0,\max }$ and $l{l}_{0,\min }$ represent the maximum and minimum lattice lengths (for a, b, and c) among all structures in the initial training set. Then, the upper and lower bounds for lattice length encoding are defined as

$$\left\{\begin{array}{l}l{l}_{\max }=1.1* l{l}_{0,\max },\\ l{l}_{\min }=l{l}_{0,\min }.\hfill\end{array}\right.$$

(9)

The number of bits for representing lattice lengths N_a = N_b = N_c = N_ll is dependent on divisions of the lattice when encoding atomic coordinates, which is calculated by

$${N}_{ll}={C}_{ll}* g* \frac{l{l}_{\max }-l{l}_{\min }}{l{l}_{\max }},$$

(10)

in which g denotes the current LDR, and C_ll = 10 by default. For lattice angles (α, β and γ), the lower and upper bound are 50^∘ and 130^∘, the same as RG. The unit for encoding angles is 2^°, so that one lattice angle is encoded using

$${N}_{lg}=(130-50)/2=40$$

(11)

bits. The number of bits for encoding lattice parameters would be 3*N_ll + 3*N_lg.

Fractional coordinates encoding

Atomic configurations are represented by discretizing the unit cell into a g × g × g voxel grid, where g is the LDR. A 3D binary matrix X ∈ {0, 1}^g×g×g is constructed, where element X_l,m,n corresponds to the voxel region R_l,m,n defined by fractional coordinates y = (y₁, y₂, y₃):

$${R}_{l,m,n}=\left\{{{{\bf{y}}}}| \frac{l}{g} < {y}_{1} < \frac{l+1}{g},\ \frac{m}{g} < {y}_{2} < \frac{m+1}{g},\ \frac{n}{g} < {y}_{3} < \frac{n+1}{g}\right\}$$

(12)

for l, m, n ∈ {0, 1, …, g − 1}. X_l,m,n is set to 1 if an atom’s fractional coordinates fall within R_l,m,n, and 0 otherwise. For crystals containing multiple element species, a separate flattened matrix is constructed for each of them. After concatenation, encoded information of each element is stored in separate regions of the final embedding.

We note that the optimized coordinates are internal ones, which will be placed into WPCs to satisfy symmetry constraints. Besides, similar to RG, the derived bits might be redundant. In implementation, 1-bits in the leftmost positions in the vector segment for each element are used, until all variables in the solved WPC are decided. Special positions are not involved in optimization, therefore not encoded in the discretized space. Once a specific WPC is selected/decoded, the special positions participate in constructing structures automatically. For instance, when filling the optimized position (0.2, 0.4, 0.8) into a WP:

$$\left\{\begin{array}{l}{A}_{1}:(1/2,1/2,1/2),\\ {A}_{2}:(x,1/2,1/2),\hfill\\ {A}_{3}:(-x,1/2,1/2),\\ {A}_{4}:(x,y,z),\hfill\\ {A}_{5}:(-x,-y,-z),\hfill\end{array}\right.$$

(13)

the result is:

$$\left\{\begin{array}{l}{A}_{1}:(1/2,1/2,1/2),\hfill\\ {A}_{2}:(0.2,1/2,1/2),\hfill\\ {A}_{3}:(-0.2,1/2,1/2),\hfill\\ {A}_{4}:(0.2,0.4,0.8),\hfill\\ {A}_{5}:(-0.2,-0.4,-0.8).\end{array}\right.$$

(14)

For experiments on benchmark crystals, the Y₆Co₅₁ and Ca₂₄Al₁₆(SiO₄)₂₄ system in this study, LDRs of CRYSIM are set to 12, with (SiO₂)₉₆ being 15.

Symmetry information representation

Symmetry information involves the CS, SG and WPC, which define the crystal’s symmetry. Numbers of bits for the three parts, i.e., N_C, N_S and N_W, are dependent on the WPCs list calculated based on stoichiometry of the system. To be specific, only compatible SGs and CSs are encoded, and whether an SG and CS is compatible or not is determined by the existence of WPCs that can be used to build the corresponding symmetry, as is illustrated in Supplementary Note 1.

Details of calculating N_C, N_S and N_W are presented as follows. Let ${{\mathbb{C}}}_{+}$ and ${{\mathbb{S}}}_{+}$ denote all compatible CSs and SGs, respectively, and ${{\mathbb{S}}}_{C}$ denotes the set of SGs associated with a CS C. We then define the set of compatible SGs for C as ${{\mathbb{S}}}_{C,+}={{\mathbb{S}}}_{C}\cap {{\mathbb{S}}}_{+}$. The numbers of bits for representing CSs and SGs can be decided as

$$\left\{\begin{array}{l}{N}_{C}=| {{\mathbb{C}}}_{+}| ,\\ {N}_{S}=\mathop{\max }_{C\in {{\mathbb{C}}}_{+}}| {{\mathbb{S}}}_{C,+}| .\end{array}\right.$$

(15)

N_W can be independently set. In many cases, the number of distinct compatible WPCs for an SG S, denoted as $| {{\mathbb{W}}}_{S}|$, is so large that encoding all WPCs within a binary segment becomes impractical. To address this, only WPCs with indices within the set $\{\lfloor \frac{i\cdot | {{\mathbb{W}}}_{S}| }{{N}_{W}}\rfloor | i=0,1,\ldots ,{N}_{W}-1\}$ are included. In this work, N_W is set to 300 by default. However, during decoding, each bit i corresponds to a group of 100 WPCs with indices $\{\lfloor \frac{i\cdot | {{\mathbb{W}}}_{S}| }{{N}_{W}}\rfloor +j| j=0,1,\ldots ,99\}$.

When encoding symmetry information of a crystal having CS C_l, SG S_l and WPC W_l, first of all, the corresponding bit representing the CS is assigned as 1. The bit for S_l is derived by

$${i}_{{S}_{l}}=\lfloor {{{{\rm{id}}}}}_{{C}_{l},+}({S}_{l})* \frac{{N}_{S}}{| {{\mathbb{S}}}_{{C}_{l},+}| }\rfloor ,$$

(16)

in which ${{{{\rm{id}}}}}_{{C}_{l},+}({S}_{l})$ is the identifier of S_l in the ascending ordered sequence with respect to the set ${{\mathbb{S}}}_{{C}_{l},+}$. For instance, in the SG set of the cubic CS, i.e., $\{P23,F23,...,Ia\overline{3}d\}$, the identifier of SG P23 is 0. Similarly, the bit for W_l is calculated by

$${i}_{{W}_{l}}=\lfloor {{{{\rm{id}}}}}_{{S}_{l}}({W}_{l})* \frac{{N}_{W}}{| {{\mathbb{W}}}_{{S}_{l}}| }\rfloor .$$

(17)

where ${{{{\rm{id}}}}}_{{S}_{l}}({W}_{l})$ represents the identifier of W_l in ${{\mathbb{W}}}_{{S}_{l}}$.

During decoding processes, the following relationships are adopted

$$\left\{\begin{array}{l}{{{{\rm{id}}}}}_{{C}_{l},+}({S}_{l})=\lfloor {i}_{{S}_{l}}* \frac{| {{\mathbb{S}}}_{C,+}| }{{N}_{S}}+0.5\rfloor ,\quad {S}_{l}={{{{\rm{SG}}}}}_{{C}_{l},+}({{{{\rm{id}}}}}_{{C}_{l},+}({S}_{l})),\\ {{{{\rm{id}}}}}_{{S}_{l}}({W}_{l})=\lfloor {i}_{{W}_{l}}* \frac{| {{\mathbb{W}}}_{S}| }{{N}_{W}}+0.5\rfloor ,\quad {W}_{l}={{{{\rm{WPC}}}}}_{{S}_{l}}({{{{\rm{id}}}}}_{{S}_{l}}({W}_{l})),\end{array}\right.$$

(18)

to make the encoding-decoding procedures stable and reversible.

We note that according to the integer encoding strategy implemented in CRYSIM, only one bit should be assigned as 1 in a vector segment for one parameter, so that decoding from the segment to real values can be performed directly. Amplify provides options to add constraints as penalty terms to the objective function to encourage generation of solutions fulfilling specific requirements. In summary, the number of bits for symmetry encoding would be N_C + N_S + N_W.

An example on symmetry information representation

We further provide an illustrative example on encoding and decoding symmetry information. For the A₄B₄ system, there are 2 SGs available for the triclinic CS, 12 for monoclinic, 56 for orthorhombic, 66 for tetragonal, 12 for trigonal, 19 for hexagonal, and 17 for cubic. Since all CSs are compatible with the system, N_C = 7. The maximum number of compatible SGs across all CSs is 66, and therefore N_S = 66. N_W can be independently set as 300 by default. Accordingly, the total number of bits for encoding symmetry information is 7 + 66 + 300. When encoding SGs, as an example, P4₁, the No.76 SG, is the second compatible SG belonging to tetragonal CS, thus the 4_th bit for CS and the 2_nd bit for SG are set to 1. For P23 (SG No.195) and F23 (SG No. 196), the first and second compatible SG in the cubic category, the corresponding bit for SG is calculated following

$${i}_{P23}=\left\lfloor 0* \frac{66}{17}\right\rfloor =0,$$

(19)

and

$${i}_{F23}=\left\lfloor 1* \frac{66}{17}\right\rfloor =3,$$

(20)

respectively. On the other hand, in the decoding process, the bit 2, 3 and 4 will correspond to

$$\left\{\begin{array}{l}\lfloor 2* \frac{17}{66}+0.5\rfloor =0\to P23,\\ \lfloor 3* \frac{17}{66}+0.5\rfloor =1\to F23,\\ \lfloor 4* \frac{17}{66}+0.5\rfloor =1\to F23.\end{array}\right.$$

(21)

In actual implementation, since it is impossible to determine WPC index from an already generated structure, we only curate training sets obtained from our RG algorithm, in which crystals are constructed based on an already chosen WPC.

Criterion for matching structures

StructureMatcher function in pymatgen package⁵⁷ is used to compare configurations, in which parameters are set as stol=0.5, ltol=0.3, angle_tol=10.0, consistent with other related works^82,83. This function will calculate the minimum normalized average root mean square pair-wise displacement between two input structures among all atom permutations. But if corresponding atoms in the two structures are not detected, which means that the function cannot identify any similarity between them, the calculation will not be proceeded. Accordingly, a structure is recognized to be accordant with the ground truth if having a computable displacement with it, and a model successfully finds the ground truth in one run if there is at least one such structure being generated.

Factorization machine for quadratic regression

Factorization machine (FM) is a type of regression model proposed as a substitute of Support Vector Machine to address its failure on sparse data³⁷. FM creates a mapping between a vector ${{{\bf{x}}}}\in {{\mathbb{R}}}^{M}$ and real value y by equation (2), where b, h_i, w_k,i are coefficients for bias, linear and quadratic interactions, respectively. In principle, FM can be extended to model n-interaction terms, but we restrict it to quadratic terms since current combinatorial optimizers are efficient only for solving quadratic objective functions. In that case, FM can be reformulated as

$$y=b+\mathop{\sum }_{i=1}^{M}{h}_{i}{x}_{i}+\frac{1}{2}{\sum }_{k=1}^{K}\left({\left({\sum }_{i = 1}^{M}{w}_{ki}{x}_{i}\right)}^{2}-{\sum }_{i=1}^{M}{w}_{ki}^{2}{x}_{i}^{2}\right),$$

(22)

reducing computational complexity from O(KM) to O(2K)³⁷. One of advantages of FM in this work is that it requires less fitting parameters, enabling a quadratic regression on binary vectors containing thousands of bits. Taking a vector of 2000 bits as an example, a full-rank quadratic regressor requires 2000 × 1999 terms for interactions, while FM only needs 2000 × K terms, in which K is usually smaller than 30. In this work, we implement FM with PyTorch⁵³ based on equation (22) to the accelerate learning process.

Data availability

Ground state configurations considered in this study can be downloaded from the MP database⁵⁵. Initial datasets for training FM are generated using RG implemented in CRYSIM, and no external data is included. All relevant data is available from the authors upon request. The original data for the figures in this paper and Supplementary Information is stored in Supplementary Data.

Code availability

Implementation of CRYSIM is available at https://github.com/tsudalab/CRYSIM. As of March 2025, Fixstars Amplify is available via Python API free of charge, which can be accessed at https://amplify.fixstars.com/en/.

References

de Pablo, J. J. et al. New frontiers for the materials genome initiative. npj Comput. Mater. 5, 41 (2019).
Article ADS Google Scholar
Woodley, S. M. & Catlow, R. Crystal structure prediction from first principles. Nat. Mater. 7, 937–946 (2008).
Article ADS Google Scholar
Oganov, A. R., Pickard, C. J., Zhu, Q. & Needs, R. J. Structure prediction drives materials discovery. Nat. Rev. Mater. 4, 331–348 (2019).
Article ADS Google Scholar
Wang, Y., Lv, J., Gao, P. & Ma, Y. Crystal structure prediction via efficient sampling of the potential energy surface. Acc. Chem. Res. 55, 2068–2076 (2022).
Article Google Scholar
Bush, T. S., Catlow, C. R. A. & Battle, P. D. Evolutionary programming techniques for predicting inorganic crystal structures. J. Mater. Chem. 5, 1269–1272 (1995).
Article Google Scholar
M. Woodley, S., D. Battle, P., D. Gale, J. & Richard A. Catlow, C. The prediction of inorganic crystal structures using a genetic algorithm and energy minimisation. Phys. Chem. Chem. Phys. 1, 2535–2542 (1999).
Article Google Scholar
Glass, C. W., Oganov, A. R. & Hansen, N. Uspex—evolutionary crystal structure prediction. Computer Phys. Commun. 175, 713–720 (2006).
Article ADS Google Scholar
Lonie, D. C. & Zurek, E. Xtalopt: An open-source evolutionary algorithm for crystal structure prediction. Comput. Phys. Commun. 182, 372–387 (2011).
Article ADS Google Scholar
Oganov, A. R., Lyakhov, A. O. & Valle, M. How evolutionary crystal structure prediction works—and why. Acc. Chem. Res. 44, 227–237 (2011).
Article Google Scholar
Wang, Y., Lv, J., Zhu, L. & Ma, Y. Crystal structure prediction via particle-swarm optimization. Phys. Rev. B 82, 094116 (2010).
Article ADS Google Scholar
Wang, Y., Lv, J., Zhu, L. & Ma, Y. Calypso: A method for crystal structure prediction. Comput. Phys. Commun. 183, 2063–2070 (2012).
Article ADS Google Scholar
Yamashita, T. et al. Crystal structure prediction accelerated by bayesian optimization. Phys. Rev. Mater. 2, 013803 (2018).
Article Google Scholar
Yamashita, T. et al. Cryspy: a crystal structure prediction tool accelerated by machine learning. Sci. Technol. Adv. Mater. 1, 87–97 (2021).
Google Scholar
Yamashita, T., Kino, H., Tsuda, K., Miyake, T. & Oguchi, T. Hybrid algorithm of bayesian optimization and evolutionary algorithm in crystal structure prediction. Sci. Technol. Adv. Mater.: Methods 2, 67–74 (2022).
Google Scholar
Xie, T., Fu, X., Ganea, O.-E., Barzilay, R. & Jaakkola, T. S. Crystal diffusion variational autoencoder for periodic material generation. In International Conference on Learning Representations (2022).
Merchant, A. et al. Scaling deep learning for materials discovery. Nature 624, 80–85 (2023).
Article ADS Google Scholar
Jiao, R., Huang, W., Liu, Y., Zhao, D. & Liu, Y. Space group constrained crystal generation. In The Twelfth International Conference on Learning Representations (2024).
Luo, X. et al. Deep learning generative model for crystal structure prediction. npj Comput. Mater. 10, 254 (2024).
Article ADS Google Scholar
Chang, L. et al. Shotgun crystal structure prediction using machine-learned formation energies. npj Comput. Mater. 10, 298 (2024).
Article ADS Google Scholar
Zhu, R., Nong, W., Yamazaki, S. & Hippalgaonkar, K. Wycryst: Wyckoff inorganic crystal generator framework. Matter 7, 3469–3488 (2024).
Article Google Scholar
Zeni, C. et al. A generative model for inorganic materials design. Nature, 639, 624–632. https://doi.org/10.1038/s41586-025-08628-5 (2025).
Singh, A. K. et al. Genetic algorithm prediction of two-dimensional group-iv dioxides for dielectrics. Phys. Rev. B 95, 155426 (2017).
Article ADS Google Scholar
M. Dieb, T., Hou, Z. & Tsuda, K. Structure prediction of boron-doped graphene by machine learning. J. Chem. Phys. 148, 241716 (2018).
Article ADS Google Scholar
Walsh, A. & Woodley, S. M. Evolutionary structure prediction and electronic properties of indium oxide nanoclusters. Phys. Chem. Chem. Phys. 12, 8446–8453 (2010).
Article Google Scholar
Wang, Y. et al. Accelerated prediction of atomically precise cluster structures using on-the-fly machine learning. npj Comput. Mater. 8, 173 (2022).
Article ADS Google Scholar
Wang, H. et al. Seeded growth of single-crystal black phosphorus nanoribbons. Nat. Mater. 23, 470–478 (2024).
ADS Google Scholar
Tanaka, S., Tamura, R. & Chakrabarti, B. K.Quantum spin glasses, annealing and computation (Cambridge University Press, 2017).
Mohseni, N., McMahon, P. L. & Byrnes, T. Ising machines as hardware solvers of combinatorial optimization problems. Nat. Rev. Phys. 4, 363–379 (2022).
Article Google Scholar
Gusev, V. V. et al. Optimality guarantees for crystal structure prediction. Nature 619, 68–72 (2023).
Article ADS Google Scholar
Ichikawa, K., Ohuchi, S., Ueno, K. & Yokoyama, T. Accelerating optimal elemental configuration search in crystal using Ising machine. Phys. Rev. Res. 6, 033321 (2024).
Article Google Scholar
Couzinié, Y. et al. Annealing for prediction of grand canonical crystal structures: Implementation of n-body atomic interactions. Phys. Rev. A 109, 032416 (2024).
Article ADS MathSciNet Google Scholar
Xu, Z., Shang, W., Kim, S., Lee, E. & Luo, T. Quantum annealing-assisted lattice optimization. npj Comput. Mater. 11, 4 (2025).
Article ADS Google Scholar
Couzinié, Y. et al. Machine learning supported annealing for the prediction of grand canonical crystal structures. J. Phys. Soc. Jpn. 94, 044802 (2025).
Article ADS Google Scholar
Lyakhov, A. O., Oganov, A. R., Stokes, H. T. & Zhu, Q. New developments in evolutionary structure prediction algorithm uspex. Computer Phys. Commun. 184, 1172–1182 (2013).
Article ADS Google Scholar
McGeoch, C. C., Harris, R., Reinhardt, S. P. & Bunyk, P. I. Practical annealing-based quantum computing. Computer 52, 38–46 (2019).
Article Google Scholar
Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2, 718–728 (2022).
Article Google Scholar
Rendle, S. Factorization machines. In 2010 IEEE International Conference on Data Mining, 995–1000, https://doi.org/10.1109/ICDM.2010.127 (2010).
Kitai, K. et al. Designing metamaterials with quantum annealing and factorization machines. Phys. Rev. Res. 2, 013319 (2020).
Article Google Scholar
Guo, J., Kitai, K., Jippo, H. & Shiomi, J. Boosting the quality factor of tamm structures to millions by quantum-inspired classical annealer with factorization machine 2408.05799 (2024).
Xu, Z. et al. Quantum-inspired genetic algorithm for designing planar multilayer photonic structure. npj Comput. Mater. 10, 257 (2024).
Article ADS Google Scholar
Podryabinkin, E. V., Tikhonov, E. V., Shapeev, A. V. & Oganov, A. R. Accelerating crystal structure prediction by machine-learning interatomic potentials with active learning. Phys. Rev. B 99, 064114 (2019).
Article ADS Google Scholar
Fixstars amplify. https://amplify.fixstars.com/en/. Accessed: 2024-11-26.
Mao, Z., Matsuda, Y., Tamura, R. & Tsuda, K. Chemical design with GPU-based Ising machines. Digital Discov. 2, 1098–1103 (2023).
Article Google Scholar
Endo, K., Matsuda, Y., Tanaka, S. & Muramatsu, M. A phase-field model by an Ising machine and its application to the phase-separation structure of a diblock polymer. Sci. Rep. 12, 10794 (2022).
Article ADS Google Scholar
Urushihara, M., Karube, M., Yamaguchi, K. & Tamura, R. Optimization of core-shell nanoparticles using a combination of machine learning and Ising machine. Adv. Photonics Res. 4, 2300226 (2023).
Article Google Scholar
Tamura, R. et al. Machine learning prediction of the mechanical properties of injection-molded polypropylene through X-ray diffraction analysis. Sci. Technol. Adv. Mater. 25, 2388016 (2024).
Article Google Scholar
Kagemoto, H. Possible application of quantum computing in the field of ocean engineering: optimization of an offshore wind farm layout with the Ising model. J. Ocean Eng. Mar. Energy 10, 773–782 (2024).
Article Google Scholar
Sukulthanasorn, N. et al. A novel design update framework for topology optimization with quantum annealing: Application to truss and continuum structures. Computer Methods Appl. Mech. Eng. 437, 117746 (2025).
Article MathSciNet Google Scholar
Cheng, G., Gong, X.-G. & Yin, W.-J. Crystal structure prediction by combining graph network and optimization algorithm. Nat. Commun. 13, 1492 (2022).
Article ADS Google Scholar
Deng, X. & Dong, C. SMEPOC – a computer program for the automatic generation of trial structural models for inorganic compounds with symmetry restriction. J. Appl. Crystallogr. 42, 953–958 (2009).
Article ADS Google Scholar
Avery, P. & Zurek, E. Randspg: An open-source program for generating atomistic crystal structures with specific space groups. Computer Phys. Commun. 213, 208–216 (2017).
Article ADS Google Scholar
Fredericks, S., Parrish, K., Sayre, D. & Zhu, Q. Pyxtal: A Python library for crystal structure generation and symmetry analysis. Comput. Phys. Commun. 261, 107810 (2021).
Article MathSciNet Google Scholar
Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library. In Wallach, H. et al. (eds.) Advances in Neural Information Processing Systems, vol. 32 (Curran Associates, Inc., 2019).
Wei, L., Li, Q., Omee, S. S. & Hu, J. Towards quantitative evaluation of crystal structure prediction performance. Comput. Materials Science 235, https://doi.org/10.1016/j.commatsci.2024.112802 (2024).
Jain, A. et al. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article ADS Google Scholar
Bergstra, J., Komer, B., Eliasmith, C., Yamins, D. & Cox, D. D. Hyperopt: a Python library for model selection and hyperparameter optimization. Comput. Sci. Discov. 8, 014008 (2015).
Article Google Scholar
Ong, S. P. et al. Python materials genomics (pymatgen): A robust, open-source Python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article Google Scholar
Deng, B. et al. Systematic softening in universal machine learning interatomic potentials. npj Comput. Mater. 11, 9 (2025).
Article ADS Google Scholar
Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Article Google Scholar
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999).
Article ADS Google Scholar
Terayama, K., Yamashita, T., Oguchi, T. & Tsuda, K. Fine-grained optimization method for crystal structure prediction. npj Comput. Mater. 4, 32 (2018).
Article ADS Google Scholar
Ishikawa, T., Fukazawa, T., Xing, G., Tadano, T. & Miyake, T. Evolutionary search for cobalt-rich compounds in the yttrium-cobalt-boron system. Phys. Rev. Mater. 5, 054408 (2021).
Article Google Scholar
Lyakhov, A. O., Oganov, A. R. & Valle, M. How to predict very large and complex crystal structures. Computer Phys. Commun. 181, 1623–1632 (2010).
Article ADS Google Scholar
Frazier, P. I. A tutorial on Bayesian optimization 1807.02811 (2018).
Moriconi, R., Deisenroth, M. P. & Sesh Kumar, K. S. High-dimensional Bayesian optimization using low-dimensional feature spaces. Mach. Learn. 109, 1925–1943 (2020).
Article MathSciNet Google Scholar
Eriksson, D. & Jankowiak, M. High-dimensional Bayesian optimization with sparse axis-aligned subspaces. In de Campos, C. & Maathuis, M. H. (eds.) Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, vol. 161 of Proceedings of Machine Learning Research, 493–503 (PMLR, 2021).
Oganov, A. R. & Valle, M. How to quantify energy landscapes of solids. J. Chem. Phys. 130, 104504 (2009).
Article ADS Google Scholar
Pickard, C. J. & Needs, R. J. Ab initio random structure searching. J. Phys. Condens. Matter 23, 053201 (2011).
Article ADS Google Scholar
Wales, D. J. Symmetry, near-symmetry and energetics. Chem. Phys. Lett. 285, 330–336 (1998).
Article ADS Google Scholar
Urusov, V. S. & Nadezhina, T. N. Frequency distribution and selection of space groups in inorganic crystal chemistry. J. Struct. Chem. 50, 22–37 (2009).
Article Google Scholar
Avery, P., Toher, C., Curtarolo, S. & Zurek, E. Xtalopt version r12: An open-source evolutionary algorithm for crystal structure prediction. Computer Phys. Commun. 237, 274–275 (2019).
Article ADS Google Scholar
Levy, D. et al. SymmCD: Symmetry-preserving crystal generation with diffusion models. In The Thirteenth International Conference on Learning Representations (2025).
Kelvinius, F. E. et al. Wyckoffdiff – a generative diffusion model for crystal symmetry. In Forty-second International Conference on Machine Learning (2025).
Cao, Z., Luo, X., Lv, J. & Wang, L. Space group informed transformer for crystalline materials generation. Science Bulletin https://doi.org/10.1016/j.scib.2025.09.035 (2025).
Kazeev, N. et al. Wyckoff transformer: Generation of symmetric crystals. In Forty-second International Conference on Machine Learning (2025).
Finnila, A. B., Gomez, M. A., Sebenik, C., Stenson, C. & Doll, J. D. Quantum annealing: A new method for minimizing multidimensional functions. Chem. Phys. Lett. 219, 343–348 (1994).
Article ADS Google Scholar
Kadowaki, T. & Nishimori, H. Quantum annealing in the transverse Ising model. Phys. Rev. E 58, 5355–5363 (1998).
Article ADS Google Scholar
Johnson, M. W. et al. Quantum annealing with manufactured spins. Nature 473, 194–198 (2011).
Article ADS Google Scholar
Albash, T. & Lidar, D. A. Adiabatic quantum computation. Rev. Modern Phys. 90, https://doi.org/10.1103/RevModPhys.90.015002 (2018).
Hauke, P., Katzgraber, H. G., Lechner, W., Nishimori, H. & Oliver, W. D. Perspectives of quantum annealing: methods and implementations. Rep. Prog. Phys. 83, 054401 (2020).
Article ADS Google Scholar
King, A. D. et al. Coherent quantum annealing in a programmable 2000 qubit Ising chain. Nat. Phys. 18, 1324–1328 (2022).
Article Google Scholar
Jiao, R. et al. Crystal structure prediction by joint equivariant diffusion. In Thirty-seventh Conference on Neural Information Processing Systems (2023).
Sriram, A., Miller, B. K., Chen, R. T. Q. & Wood, B. M. FlowLLM: Flow matching for material generation with large language models as base distributions. In The Thirty-eighth Annual Conference on Neural Information Processing Systems (2024).
Momma, K. & Izumi, F. VESTA3 for three-dimensional visualization of crystal, volumetric, and morphology data. J. Appl. Crystallogr. 44, 1272–1276 (2011).
Article ADS Google Scholar

Download references

Acknowledgements

This work is supported by JST ERATO JPMJER1903, JST CREST JPMJCR21O2, and JSPS KAKENHI Young Scientist (23K16942). C.L. would like to gratefully acknowledge the financial support from the China Scholarship Council (CSC No. 202306210120). The authors thank Yaotang Zhang for discussions.

Author information

Authors and Affiliations

Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Japan
Chen Liang, Diptesh Das, Jiang Guo, Ryo Tamura, Zetian Mao & Koji Tsuda
Center for Basic Research on Materials, National Institute for Materials Science, Ibaraki, Japan
Ryo Tamura & Koji Tsuda
RIKEN Center for Advanced Intelligence Project, Tokyo, Japan
Koji Tsuda

Authors

Chen Liang
View author publications
Search author on:PubMed Google Scholar
Diptesh Das
View author publications
Search author on:PubMed Google Scholar
Jiang Guo
View author publications
Search author on:PubMed Google Scholar
Ryo Tamura
View author publications
Search author on:PubMed Google Scholar
Zetian Mao
View author publications
Search author on:PubMed Google Scholar
Koji Tsuda
View author publications
Search author on:PubMed Google Scholar

Contributions

C.L. implemented the CRYSIM package and conducted all experiments. D.D. contributed to insights into the design of Ising models and constraints. Z.M. contributed to implementation of the training framework of FM. J.G. and R.T. contributed to analysis about the usage of Amplify and other Ising solvers. K.T. proposed the idea of the work. Z.M. and K.T. provided guidance on experiments design and results analysis. All authors reviewed and contributed to the writing of the manuscript.

Corresponding authors

Correspondence to Zetian Mao or Koji Tsuda.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Physics thanks Wei Nong, Kedar Hippalgaonkar and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Description of Additional Supplementary Files (download DOCX )

Supplementary Data (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Liang, C., Das, D., Guo, J. et al. Predicting symmetric structures of large crystals with GPU-based Ising machines. Commun Phys 8, 477 (2025). https://doi.org/10.1038/s42005-025-02380-y

Download citation

Received: 13 April 2025
Accepted: 19 October 2025
Published: 26 November 2025
Version of record: 26 November 2025
DOI: https://doi.org/10.1038/s42005-025-02380-y