Accelerating hybrid XOR–CNF Boolean satisfiability problems natively with in-memory computing

Im, Haesol; Böhm, Fabian; Pedretti, Giacomo; Kushida, Noriyuki; Noori, Moslem; Valiante, Elisabetta; Zhang, Xiangyi; Yang, Chan-Woo; Bhattacharya, Tinish; Sheng, Xia; Ignowski, Jim; Heittmann, Arne; Strachan, John Paul; Mohseni, Masoud; Beausoleil, Raymond; Vaerenbergh, Thomas Van; Rozada, Ignacio

doi:10.1038/s41467-026-69465-2

Download PDF

Article
Open access
Published: 19 February 2026

Accelerating hybrid XOR–CNF Boolean satisfiability problems natively with in-memory computing

Nature Communications volume 17, Article number: 2922 (2026) Cite this article

4400 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The Boolean satisfiability (SAT) problem is a computationally challenging decision problem central to many industrial applications. For SAT problems in cryptanalysis, circuit design, and telecommunication, solutions can often be found more efficiently by representing them with a combination of exclusive OR (XOR) and conjunctive normal form (CNF) clauses. We propose a hardware accelerator architecture that natively embeds and solves such hybrid XOR–CNF problems using in-memory computing hardware. To achieve this, we introduce an algorithm and demonstrate, both experimentally and through simulations, how it can be efficiently implemented with memristor crossbar arrays. Compared to the conventional approaches that translate XOR–CNF problems to pure CNF problems, our simulations show that the accelerator improves computation speed, energy efficiency, and chip area utilization of in-memory accelerators by ~ 10 × for a set of hard cryptographic benchmarking problems. Moreover, the accelerator achieves a ~ 10 × speedup and a ~ 1000 × gain in energy efficiency over state-of-the-art SAT solvers running on CPUs.

Memristor-based hardware accelerators for artificial intelligence

Article 23 April 2024

Efficient combinatorial optimization by quantum-inspired parallel annealing in analogue memristor crossbar

Article Open access 22 September 2023

Strategies of high-accuracy memristor-based analogue computing in memory for artificial intelligence

Article 11 May 2026

Introduction

The Boolean satisfiability (SAT) problem is a fundamental decision problem that was the first problem to be proven NP-complete^1,2. Solving a SAT problem involves determining whether there is an assignment of Boolean variables satisfying a given propositional logic formula. Many problems in engineering and computer science reduce to SAT problems with a polynomial-time overhead, which then can be tackled with SAT solvers employing local search heuristics or exhaustive search. SAT solvers are thus widely employed in many industry-relevant applications, such as scheduling, planning, cryptanalysis, and integrated circuit design^3,4, as well as being used as the engine for more-general constrained optimization solvers⁵. Yet, due to the computational complexity of SAT problems, the cost of finding solutions could, in the worst case, scale exponentially with the number of variables.

Due to the ubiquity of SAT problems in industrial optimization applications, there is an ongoing effort to improve algorithms for SAT solvers, as well as to develop dedicated hardware accelerators^{6,7,8,9,10,11,12,13,14} that can find solutions faster and more energy efficiently. A promising line of research has been the study of SAT solvers in hybrid problem formulations^15,16,17. SAT problems are typically formulated in conjunctive normal form (CNF), where a set of clauses containing Boolean variables are connected by logical OR operations. However, many applications naturally involve clauses linked by exclusive-OR (XOR) operations, such as channel decoding in wireless receivers¹⁸, model counting¹⁵, circuit fault testing³, and cryptographic decoding attacks¹⁹. These problems can be formulated natively as hybrid XOR–CNF SAT problems containing both CNF and XOR clauses. Although XOR clauses can be reduced to CNF clauses using Tseitin transformations²⁰, doing so introduces a significant performance overhead as it increases the number of variables and clauses in the problem. Hybrid XOR–CNF SAT solvers that support both CNF and XOR clauses have therefore been found to considerably outperform pure CNF SAT solvers^17,21.

While hybrid XOR–CNF solvers have predominantly been implemented as software solutions running on digital computers^16,22, there is potential in harnessing the benefits of native XOR–CNF problem formulations using in-memory hardware accelerators. In-memory computing (IMC), leveraging analog crossbar arrays for low latency and parallel linear algebra computations, is a promising technology for building hardware accelerators²³. IMC accelerators have already demonstrated their ability to enhance both speed and energy efficiency for SAT solvers in the case of pure CNF SAT problems, outperforming conventional CPUs^8,9,24. Combining the advantages of a hybrid XOR–CNF formulation with IMC hardware could offer considerable advantages in tackling computationally challenging SAT problems with inherent XOR clauses. However, compared to pure CNF problems, evaluating XOR clauses requires more complex and energy-intensive circuits that can potentially offset the efficiency and latency advantages of IMC hardware. Moreover, XOR clauses can contain many literals, whereas SAT hardware accelerators can often support only a few literals per clause. For IMC hardware, a large number of literals can also make it more challenging to retain low error rates during computation, as the corresponding analog signals exhibit an increased dynamic range.

Therefore, in this work, we set to address the open question of whether IMC is suitable for accelerating the solving of hybrid XOR–CNF problems efficiently. We present an IMC accelerator architecture that can be used to natively implement and solve hybrid XOR–CNF problems. As part of this architecture, we propose WalkSAT-XNF, an XOR-native implementation of the WalkSAT stochastic local search (SLS) heuristic, where all variables within unsatisfied clauses are candidates for being flipped. We propose an efficient method for XOR–CNF clause evaluation and gradient computation using analog crossbar arrays. To demonstrate feasibility on hardware, we experimentally implement WalkSAT-XNF on crossbar arrays based on TaO_x memristors for a small-scale minimal disagreement parity (MDP) problem. Additionally, we simulate a memristor-based accelerator architecture in a 28 nm complementary metal-oxide-semiconductor (CMOS) process and evaluate the computation speed and energy consumption on benchmarking problems from cryptographic applications including the McEliece–Niederreiter cryptosystem^25,26 and the Advanced Encryption Standard (AES)^27,28. Compared to solving problems in their CNF representation with an IMC accelerator, our approach achieves an order-of-magnitude improvement in computation speed and energy consumption, within a 10 × smaller chip area, by employing hybrid XOR–CNF representations. Furthermore, compared to state-of-the-art SAT solvers running on CPUs, our accelerator solves benchmarking problems with up to 300 variables and 1016 clauses ~ 10 × faster while consuming ~ 1000 × less energy. Our results highlight the potential of IMC accelerators for efficiently implementing hybrid XOR–CNF SAT solvers, enabling native problem representations for solving a variety of complex industry-relevant problems.

Results

Mapping and benchmarking advantages of hybrid XOR–CNF SAT problems over CNF

A SAT problem for a set of Boolean variables x_i ∈ {0, 1} and clauses C_i is given by the conjunction (∧)

$${{\mathcal{F}}}({x}_{1},\ldots,{x}_{n})={C}_{1}\wedge {C}_{2}\wedge \cdots \wedge {C}_{i}.$$

(1)

The problem is said to be satisfiable if an assignment of the Boolean variables exists where all clauses C_j are true. In a CNF representation, each C_j is a clause formed from a disjunction (∨) of literals l_k as C_CNF,j = l_k ∨ ⋯ ∨ l_m, where the literals l_k are either propositions (x_k) or their negations ($\overline{{x}_{k}}$) of the Boolean variables. XORSAT problems, on the other hand, are SAT problems where clauses are formed using XOR operations (⊕) between literals:

$${C}_{{{\rm{XOR}}},j}={l}_{k}\oplus \cdots \oplus {l}_{m}.$$

Problems formulated in XOR-and-OR normal form (XNF) are then hybrid XOR–CNF SAT problems, where the propositional logic formula (1) contains both CNF and XOR clauses. Figure 1a illustrates an XNF instance with three CNF and two XOR clauses. Here, the variable assignment x₁ = 1, x₂ = 0, x₃ = 0, x₄ = 1 guarantees satisfiability. In general, an XOR clause with k literals x₁, …, x_k can be equivalently represented using 2^k−1 CNF clauses, each containing k literals. These clauses represent all possible combinations of an even number of negated variables

$${C}_{{{\rm{X}}}{{\rm{O}}}{{\rm{R}}},j}={\bigwedge }_{{{\rm{e}}}{{\rm{v}}}{{\rm{e}}}{{\rm{n}}}\,{{\rm{n}}}{{\rm{u}}}{{\rm{m}}}{{\rm{b}}}{{\rm{e}}}{{\rm{r}}}\,{{\rm{o}}}{{\rm{f}}}\,{{\rm{\neg }}}} \pm {x}_{1} \vee \cdots \vee \pm {x}_{k},$$

(2)

where ± denotes the possible permutations for propositions (+) of literals or their negations (−). For instance, the first XOR clause in Fig. 1a has the equivalent CNF representation $(\overline{{x}_{1}}\vee \overline{{x}_{2}}\vee {x}_{3})\wedge (\overline{{x}_{1}}\vee {x}_{2}\vee \overline{{x}_{3}})\wedge ({x}_{1}\vee \overline{{x}_{2}}\vee \overline{{x}_{3}})\wedge ({x}_{1}\vee {x}_{2}\vee {x}_{3})$. Translating XOR clauses into CNF clauses incurs an exponential increase in the number of additional clauses, hence making clause evaluation computationally more expensive.

**Fig. 1: Mapping advantages of hybrid XOR–CNF problems over pure CNF problems.**

In practice, this exponential overhead can partly be mitigated by employing the Tseitin transformation²⁰, yet this method provides a clear trade-off between the reduction of overall clauses and the number of additional variables that need to be considered¹⁶. Conversely, translating a SAT problem in CNF representation into an XORSAT problem is generally impossible, though many key SAT applications, such as integer factorization, circuit fault testing⁴, and cryptographic decoding attacks¹⁹, originate from XOR-based logic. In these cases, XOR clauses can be reconstructed from the CNF clauses by reversing the transformation in Eq. (2), typically reducing both clause and variable counts.

We demonstrate the differences between CNF and XNF formulations in Fig. 1 for SAT problems from cryptographic attacks on the McEliece–Niederreiter and AES cryptosystems, as well as instances generated from the minimal disagreement parity (MDP) problem (details of the instances are provided in the Methods section). All instances inherit native XOR clauses but are initially provided with CNF clauses only. We explore two methods of generating hybrid XOR–CNF instances from these original problems. First, we convert directly the CNF instances to the XNF representation employing the cnf2xnf tool within the xnfSAT solver¹⁶. The final representation of this process is denoted by XNF in Fig. 1b. After this conversion, the resulting problems contain 2–43% XOR clauses. Additionally, we employ a SAT preprocessing (PP) tool²⁹ to the CNF instances (generating new instance denoted by CNF-PP in Fig. 1b) before applying the conversion tool to generate XNF instances. The final representation of this process is denoted by XNF-PP in Fig. 1b. Such preprocessing techniques are widely used to compress CNF problem size and to enhance solver performance. Details of the preprocessing procedure and the per-instance preprocessing runtime are reported in Section “Methods”. Figure 1c shows the compression ratio for the number of variables in relation to the original CNF representation. Direct XNF conversion reduces the number of variables by (2.0 ± 0.5)× on average. When applying preprocessing, the average number of variables initially remains almost unchanged ((1.1 ± 0.1)×) but is considerably reduced once the problem has been converted to an XNF representation. The preprocessing followed by XNF conversion achieves a compression ratio of (4.6 ± 1.0)×, on average. We also analyze the compression ratio for the number of clauses in relation to the CNF representation. With direct XNF conversion, we find that the number of clauses is reduced by (3.7 ± 1.2)×, on average. When applying preprocessing to the CNF representation, we again observe a small initial reduction in the number of clauses by (2.0 ± 0.9)×, while conversion of the preprocessed instances to an XNF representation reduces the number of clauses by (5.4 ± 1.8)×, on average, compared to the CNF representation.

These results show the advantages of mapping problems to an XNF representation, with the greatest benefits often observed when combining preprocessing with XNF conversion. Compared to using a pure CNF representation, the resulting reduction in the problem size can enhance SAT solver performance and significantly lowers compute resource requirements^17,21. Moreover, for SAT hardware accelerators, the comparatively smaller XNF instances enable reduced chip sizes and energy consumption. Therefore, these results serve as a strong motivation to develop hardware accelerators capable of supporting both CNF and XOR clauses simultaneously.

WalkSAT-XNF: an XNF-native SAT heuristic compatible with in-memory computing hardware

To leverage the described mapping advantages, we propose a heuristic called WalkSAT-XNF, designed to solve XNF problems in their native form. We then show how this algorithm can be realized efficiently in an accelerator using IMC. WalkSAT-XNF employs a local search heuristic and is inspired by prior work on IMC accelerators for CNF SAT problems⁸. Similar to the widely used WalkSAT solvers^30,31, WalkSAT-XNF computes gradients based on ‘make’ and ‘break’ values. The make value counts the number of violated clauses that become satisfied, while the break value counts the number of satisfied clauses that become violated when flipping a variable. WalkSAT-XNF then flips a variable found in violated clauses that maximizes the value obtained by subtracting the break value from the make value. In contrast to the standard WalkSAT heuristic, WalkSAT-XNF performs a full-neighborhood evaluation, where gradients for all variables present in unsatisfied clauses are considered, as opposed to evaluating only the variables in a randomly chosen violated clause.

Table 1 shows the pseudocode of the WalkSAT-XNF heuristic. The algorithm starts with an initial variable configuration and iteratively searches the space until it finds a solution or reaches the iteration limit. Each iteration computes gradients based on make and break values for all variables by evaluating the clauses in which they appear. A CNF clause is satisfied if at least one literal is true. Hence, the make value is the number of violated clauses containing the variable, as flipping it would satisfy them. The break value, on the other hand, corresponds to the number of satisfied clauses, where the variable is the only true literal, as flipping it would break clause satisfaction. For an XOR clause to be satisfied, an odd number of true literals is required. Thus, the make value corresponds to the number of violated clauses containing the variable, as flipping it would satisfy them. Similarly, break values are equal to the number of satisfied clauses containing the variable. The break value subtracted from the make value yields the gain value, or gradient. After computing the full gradient, Gaussian noise with a standard deviation σ is added to help escape local minima or avoid cycles. The variable with the highest noise-adjusted gain value is then flipped, and the process repeats.

Table 1 WalkSAT-XNF heuristic

Full size table

Figure 1d shows the algorithmic efficiency of WalkSAT-XNF when solving the McEliece, MDP, and AES benchmarking instances using CNF-PP, XNF, and XNF-PP compared to the CNF formulation. We quantify the performance with the iterations-to-solution (ITS₉₉) metric³², defined as

$${{{\rm{ITS}}}}_{99}({{\rm{iter}}}):=\frac{{{\rm{iter}}}\cdot \log 0.01}{\log (1-\theta ({{\rm{iter}}}))}\,,$$

(3)

where θ(iter) is the success probability of solving the problem as a function of iterations. The ITS₉₉ metric estimates the iterations required to observe at least one successful trial with a probability of 99%. Since WalkSAT-XNF stops once a solution is found, an optimized ITS_99opt metric can be obtained by evaluating ITS₉₉ at solution-finding trial lengths within reasonable error bounds. Compared to the CNF formulation, WalkSAT-XNF solves problems using fewer iterations, achieving a median improvement of ~23× (CNF-PP), ~10× (XNF), and ~68× (XNF-PP). The greatest performance gains are observed for preprocessed instances.

In what follows, we thus solely focus on the preprocessed instances for CNF and XNF problems, referring to them simply as CNF and XNF for brevity. Complete benchmarking results for all problem representations are available in Supplementary Note 1.

An in-memory computing accelerator architecture for WalkSAT-XNF

To realize WalkSAT-XNF with IMC hardware, we propose the accelerator architecture depicted in Fig. 2, which shows the steps performed in each iteration of the heuristic (i.e., clause evaluation, make and break value computations, and a variable update) using seven distinct hardware blocks.

**Fig. 2: Hardware architecture for an in-memory XOR–CNF solver accelerator.**

The Boolean variable configuration is initially stored in a register ((1) in Fig. 2). The variables and their respective conjugates are then provided as an input signal to a crossbar array to evaluate violation of the individual CNF and XOR clauses (2). For problems with N variables and C clauses, the crossbar has 2N columns and C rows. The input to the crossbar is applied as binary voltage signals at the columns. Each variable x_j and its negation $\overline{{x}_{j}}$ are mapped to the column pairs {2j, 2j + 1}, while clauses correspond to the rows of the crossbar. Each literal is represented by a binary-valued crossbar connection b_ij ∈ {0, 1} that allows current to flow from a column to a row. Here, positive literals x_j connect rows to columns with even indices 2j, while negative literals $\overline{{x}_{j}}$ connect to columns with odd indices 2j + 1. These connections are facilitated by memory devices at each crossbar that can be switched between an ON and an OFF state, such as resistive random-access memory (RRAM)⁸, static random-access memory (SRAM), or embedded Flash memory cells³³. This crossbar array functions as a C-by-2N matrix, with entries of 1 where literals appear and 0 elsewhere. The output current at each row is then equivalent to a matrix–vector multiplication between the input signal and the array. Using the matrix encoding of the clauses described above, the output signals of the crossbar rows are proportional to the number of true literals in the clauses for the current assignment of variables.

Depending on the clause type, the output signals from the crossbar array are evaluated by the circuits (3) of Fig. 2. These circuits indicate whether a clause is violated and provide the input signals for the subsequent make and break value computations. For XOR clauses, a low-resolution analog-to-digital converter (ADC) with ${\log }_{2}(k)$ bits, where k is the maximum number of literals, performs a parity check using the least-significant bit (LSB). The LSB is provided as input for the break value computation, as it indicates whether the clause is currently satisfied and can be broken by flipping one of its member variables. Conversely, an inversion of the LSB is given as input for the make value computation. For CNF clause evaluations, two comparators⁸ determine if the number of true literals is 0 (for the make value) or 1 (for the break value). The outputs of these comparators are used as input for make and break computations.

The make and break values are computed via a crossbar array (4) that is the transpose of (2). After applying the input signals to the rows, the output signals from related pairs of columns are added to derive the make and break values for each variable. To calculate the break values for CNF clauses, the column outputs are additionally multiplied with the variable configuration using pass transistors to identify true literals. Adding the make and break values from XOR and CNF clauses provides the input signals for the subsequent gradient computation (5). Here, a Gaussian white noise signal σ generated by a pseudo-random number generator (PRNG) in conjunction with an array of digital-to-analog converters (DAC) is added to the make value, and the break values are subtracted from the make values using differential amplifiers to calculate the gradient for each variable. Finally, a winner-takes-all (WTA) circuit identifies the variable with the highest gradient (6) and the output signal is used to update the register state using XOR gates (7).

Crucially, the relative simplicity of WalkSAT-XNF enables us to map every computational step to an equivalent analog circuit, enabling rapid continuous computation. As with other IMC concepts^8,34, the crossbar arrays in Fig. 2 enable parallel gradient computations for both the CNF and XOR clauses within a single clock cycle. Performing an entire operation of WalkSAT-XNF is achieved within just three clock cycles, without the need for a complex control system, while also circumventing frequent time-intensive communication with external co-processors or memory systems. Both XOR and CNF clauses can be evaluated using the same array, allowing for an area-efficient design. Moreover, the crossbar array can implement a number of literals per clause that is equal to the number of variables, hence supporting highly complex clauses common in industry workloads.

Experimental demonstration using RRAM crossbar arrays

As with other mixed-signal computing systems, realizing WalkSAT-XNF in hardware requires it to be sufficiently resilient against hardware non-idealities in the analog circuits. Studies have identified variations in the RRAM cells and noise in the crossbar array’s analog readout circuit as the dominant non-idealities that can result in a deterioration in performance³⁵. To evaluate the feasibility of realizing WalkSAT-XNF in hardware, we implement a hybrid version of the architecture in Fig. 2 on an RRAM crossbar array chip. We experimentally validate the analog computation of clause evaluation and make/break value computation using an RRAM crossbar array chip, while the register, the circuits for checking clause satisfaction, the WTA circuit, and the Gaussian noise injection are emulated on a digital computer. The RRAM chip is a custom CMOS circuit in a 180 nm technology node with back-end-of-the-line (BEOL) monolithically integrated TaO_x 1T1M RRAM cells^36,37. For the experiment, we use the XNF instance derived from the par-8-1-c MDP problem³⁸, consisting of 13 variables and 42 clauses, including one XOR clause. To implement the crossbar’s ON and OFF states b_ij, the RRAM cells are programmed to either a high-resistance state (HRS, or OFF state) or a low-resistance state (LRS, or ON state). Figure 3a shows the conductance values of the RRAM cells after programming. Here, the LRS is set to 100 μS and the HRS is set to 1 μS. Two separate arrays are used for the clause evaluation (array 1) and the make and break value computations (array 2). Figure 3b shows a histogram of the memristor conductances of array 1. The memristors exhibit typical device-to-device variations during programming³⁹, where the LRS and HRS are programmed to have a tolerance of ±10 μS. While further optimization is possible⁴⁰, we find that this accuracy is sufficient for our purposes.

Fig. 3: Experimental demonstration of WalkSAT-XNF on TaOx memristor crossbar arrays. — **Fig. 3: Experimental demonstration of WalkSAT-XNF on TaO_x memristor crossbar arrays.**

To evaluate the capability of this crossbar array to perform clause evaluation (array 1 in Fig. 3a), we supply 400 random variable configurations as input signals and record the output current from the array. Figure 3c shows a histogram of the results, with distributions color-coded by the expected number of satisfied literals (H), showing a clear separation. It is thus possible to infer the number of satisfied literals directly from the array’s analog output signal using the threshold levels indicated by the dotted lines in Fig. 3c with an average error of ~1%. The second array can be used similarly to evaluate the make and break values. We perform the make and break value computations sequentially here, but a parallel, pipelined evaluation is possible by employing two separate crossbar arrays. We then employ the gradient computation as part of the full WalkSAT-XNF heuristic. Figure 3d shows the cumulative success rate for solving par-8-1-c problem instance. We have performed 500 repeats at a noise level of σ = 2.5, where the solver runs for a maximum of 2000 iterations per repeat. The solver consistently finds a satisfying solution within this limit and experimental results align well with ideal (i.e., variation-free and noiseless) simulations despite hardware non-idealities.

We also compare experiments and simulations by varying the noise level σ. To quantify differences in the cumulative success rate, we analyze the iterations-to-solution (ITS_99opt). In Fig. 3e, we show ITS_99opt for different noise levels and compare it against simulation-based results. Our results agree well with the experimental results, within the margin of error of the simulations. Overall, our results demonstrate that WalkSAT-XNF can be implemented using RRAM-based analog IMC hardware. The agreement between experiments and simulations highlights the robustness of the WalkSAT-XNF heuristic to hardware non-idealities, making it well-suited for implementation in custom CMOS circuits. This observation is also supported by a simulation-based sensitivity study, the results of which are presented in Supplementary Note 5. We believe this robustness to be due to the fact that the weights and the input states in our architecture are binary. The results of the crossbar array’s operations are discrete integer values, thereby providing additional robustness against noise, compared to, for example, floating point operations.

Simulation-based benchmarking for a 28 nm RRAM architecture

To evaluate our accelerator architecture illustrated in Fig. 2, we designed and simulated an architecture implementation using TaO_x RRAM crossbar arrays realized in a 28 nm CMOS process. For the simulations, we have derived latency and energy models from detailed circuit simulations and have evaluated them using activity simulations for the different SAT instances in Fig. 1. As our architecture supports both XOR and CNF clauses, we compare the CNF and XNF representations for the same problems on the same accelerator architecture to highlight the advantages for IMC accelerators of converting CNF instances to XNF instances. Figure 4a shows the average area advantage of XNF representations over CNF representations. We define the area advantage as A_XNF/A_CNF, where A is the number of memory cells in the crossbar arrays required for a given benchmarking instance. We find that XNF representations provide a (12.2 ± 4.7)× average area advantage for the crossbar arrays due to there being a reduced number of variables and clauses. This significantly reduces the footprint, thereby enhancing the cost-effectiveness, scalability, and energy efficiency of the accelerator. Figure 4b shows the average energy per iteration of the WalkSAT-XNF heuristic. The median energy uptake for the XNF representation is 36 pJ (interquartile range (IQR): 47 pJ) compared to 107 pJ (IQR: 119 pJ) for the CNF representation, thereby achieving a ~3× improvement in energy efficiency. Figure 4c provides a breakdown of energy consumption across hardware components for a McEliece instance. For the CNF representation with 174 variables and 623 clauses, the average energy per iteration is ~90 pJ. Here, the majority of energy is consumed by the circuits responsible for generating the Gaussian noise signal (PRNG, ~80%), while the second-largest contributor (the clause evaluation array) accounts for only ~9% of the energy uptake. The make and break computation array, the evaluation circuits, and the WTA circuit combined contribute to ~10% of the energy consumption. For the XNF representation with 32 variables and 96 clauses (13 of which are XOR clauses), energy consumption drops to ~33 pJ, that is, only a third of the CNF instance. Moreover, we find that the relative energy contributions between the two representations are notably different as approximately a third of the energy consumption of the XNF representation is dedicated to the clause evaluation circuits. The XOR clause evaluation is energetically more expensive, which accounts for 93% of the energy uptake of the evaluation circuits. Figure 4d shows a comparison of this breakdown for a 16-bit MDP instance. The XNF representation shows lower relative energy consumption by the evaluation circuits compared to Fig. 4c, due to a lower XOR-to-CNF clause ratio (7% in the MDP instance versus 23% in the McEliece instance). Overall, while an XNF representation significantly reduces energy consumption, it introduces a trade-off: problem size reduction increases the number of XOR clauses which are more energy-intensive to evaluate.

**Fig. 4: Energy and area advantages of hybrid XOR–CNF formulations for in-memory hardware accelerators.**

Figure 5 a shows the relative advantage of the time-to-solution (TTS) for the CNF and XNF representations. Here, the TTS is attained by multiplying ITS_99opt with the latency of performing one iteration. We find that, in all instances, the TTS for the XNF instances is improved over the CNF representation with a median advantage of 3.7× (IQR: 22.2). Separated by instance classes, MDP instances show the greatest improvement (546×, IQR: 27,496.2), followed by McEliece (3.7×, IQR: 0.8) and AES (1.7×, IQR: 0.2). A further comparison between the CPU and hardware implementations of WalkSAT-XNF is provided in Supplementary Note 1, highlighting the additional speedups gained through IMC hardware acceleration.

**Fig. 5: Comparison of energy-to-solution and time-to-solution for hybrid XOR–CNF and pure CNF problems.**

To analyze the energy consumption of the accelerator architecture for the different problem representations, we consider the energy-to-solution (ETS). The ETS is calculated by multiplying ITS_99opt with the average energy consumed per iteration. Figure 5b shows the relative ETS advantage of the XNF representation over the CNF representation. We find that energy consumption is improved over CNF with a median of 11.4× (IQR: 65.4). Separated by instance classes, we again observe that the MDP instances benefit most (1644.1×, IQR: 83540.7), followed by McEliece (11.4×, IQR: 3.4) and AES (3.9×, IQR: 0.6).

Beyond this comparison of different problem representations for IMC hardware accelerators, we benchmark our accelerator against SAT solvers running on a CPU. For our benchmarking, the ETS and TTS were measured when running solvers on a 2.6 GHz Xeon CPU, and compared to the results for the XNF instances in Fig. 5. The TTS of the benchmarking solvers is directly derived from the CPU runtime. For the SAT solvers, we consider the SLS-solvers xnfSAT¹⁶ and WalkSAT-SKC³⁰, alongside the conflict-driven clause learning (CDCL) solvers CryptoMiniSat²² and Kissat⁴¹. The xnfSAT and CryptoMiniSat solvers are capable of solving problems in XNF representation and are therefore evaluated with XNF instances (see Supplementary Note 2 for more details). For xnfSAT, we initially noted that performance for preprocessed XNF instances is considerably worse compared to unprocessed XNF instances. To provide the fairest comparison, we therefore decided to evaluate the performance of xnfSAT using the unprocessed XNF instances, while WalkSAT-XNF and CryptoMiniSat were evaluated using the XNF-PP instances. WalkSAT-SKC and Kissat on the other hand support only CNF clauses and were therefore evaluated using the CNF representation of the benchmarking instances.

Figure 6 presents correlation plots comparing TTS and ETS for XNF-native solvers (a) and CNF-native solvers (b) against our WalkSAT-XNF accelerator. Table 2 summarizes the median relative performance. Compared to the best-performing software solver CryptoMiniSat, WalkSAT-XNF improves the median TTS by 9.1× and the ETS by 2.3 ⋅ 10³×. Notably, while our accelerator outperforms CryptoMiniSat for the McEliece instances, most MDP and AES problems are solved faster by CryptoMiniSat. This indicates that the structure of such problems may be more favorable to CDCL-type solvers compared to the SLS heuristic employed in WalkSAT-XNF. However, WalkSAT-XNF demonstrates a smaller ETS in most instances compared to the CDCL-type solvers. We also note that, while WalkSAT-XNF is always able to find a solution, the SLS solvers xnfSAT and WalkSAT-SKC are unable to solve a portion of the MDP instances. Moreover, xnfSAT exhibits a large variance, while WalkSAT-XNF forms distinct clusters for similar class and size instances. This clustering pattern allows for a more stable prediction of performance of similar instances and can likely be attributed to the full-neighborhood evaluation, compared to xnfSAT’s individual clause evaluation.

**Fig. 6: Benchmark of energy-to-solution and time-to-solution against state-of-the-art SAT solvers.**

Table 2 Performance comparison of XOR–CNF and CNF solvers relative to WalkSAT-XNF

Full size table

Discussion

Our results show that IMC hardware accelerators for SAT problems can be enhanced to solve problems in a hybrid XOR–CNF representation, which is the native representation of several industrial optimization problems. By performing parallel gradient computation of XOR and CNF clauses on the same crossbar arrays, our approach enables a fast and energy-efficient hardware implementation of our WalkSAT-XNF heuristic. This allows us to combine the algorithmic advantages of mapping problems to a hybrid XOR–CNF representation with the inherent parallelism and efficiency of IMC hardware.

For SAT problems that can be natively expressed as hybrid XOR–CNF problems, we find that this can reduce the chip area and energy consumption, while also improving the computation speed compared to mapping them to a pure CNF representation. This presents an advantage over existing SAT hardware accelerators, which can solve problems only in pure CNF formulation. When tackling pure CNF problems, the IMC architecture in Fig. 2 has previously demonstrated that it can outperform comparable SAT accelerators (see Supplementary Note 3). As shown in our comparison in Fig. 4, the ability to implement XOR clauses can provide an additional order-of-magnitude improvement in computation speed and energy efficiency.

Moreover, the crossbar array embedding depicted in Fig. 2 can, in principle, support dense XOR and CNF clauses with as many literals as there are variables. Our experimental proof of concept successfully demonstrates this for a hybrid XOR–CNF problem with up to five literals per clause, which can be extended to even more complex clauses. This allows our architecture to additionally leverage the advantages of SAT preprocessing techniques, which tend to trade increased algorithmic efficiency with a higher density of literals per clause (see Table 3). By combining these advantages, we find that our proposed accelerator can outperform state-of-the-art SAT solvers running on digital computers in terms of computation speed and energy consumption.

Table 3 Clause densities across problem representations

Full size table

As energy efficiency becomes an increasing concern in high-performance computing systems for resource-intensive applications such as optimization and artificial intelligence, hybrid XOR–CNF IMC accelerators can reduce operational costs and mitigate environmental impacts. In edge-computing applications, such as channel decoding in wireless receivers or AI route planning in autonomous vehicles, constraints on energy consumption and latency for computing hardware can benefit from fast and energy-efficient SAT accelerators to improve performance while enabling new use cases. Because XOR clauses are native to a wide variety of industry-relevant applications, such as hardware design, cryptanalysis, and telecommunications, we expect that a hybrid XOR–CNF SAT accelerator can provide considerable advantages when solving hard SAT problems.

While CNF and hybrid XOR–CNF instances have been identified as promising use cases for the IMC accelerator, there are also important industrial applications that rely on pure XORSAT problems. Although finding satisfying assignments to XORSAT problems is polynomial in problem complexity and thereby performed efficiently with linear system solvers on digital computers⁶, there is a variety of hard industry-relevant XORSAT problems where the state-of-the-art heuristics rely on XORSAT evaluations, such as error correction¹⁸ or efficiently attacking the McEliece cryptosystem⁴². For such problems, spin glass hardware accelerators have previously been demonstrated that scale exponentially in compute time^6,7 and it is likely that a native XOR–CNF accelerator can improve performance over existing techniques⁴³.

An interesting outcome of our research has been the insight that our proposed WalkSAT-XNF heuristic can benefit considerably from fast preprocessing techniques present in common SAT software libraries. By applying preprocessing to CNF instances before converting them to XNF instances, we have observed significant overall improvements in the number of iterations required to find a solution compared to XNF instances without preprocessing. While the hybrid XOR–CNF solver xnfSAT does not appear to benefit from preprocessing for the benchmarking instances we have studied, WalkSAT-XNF can improve the median TTS and ETS by an order of magnitude.

Although our results show there are clear advantages in using hybrid IMC XOR–CNF SAT accelerators, we envision possible improvements that could further enhance computational performance and relevance to industrial use cases. Our analysis of their energy consumption has identified the generation of noise signals and the evaluation of XOR clauses as targets for improvements. Enhancing the energy efficiency of noise signal generation would be possible by optimizing the PRNG design or by using analog noise sources⁴⁴. Similarly, the circuit used for conducting parity checks could likely be improved, given that only the LSB is needed or that, alternatively, trees of XOR gates can be employed. As we show in Supplementary Note 4, additional energy savings can also be achieved by reducing the resolution of the ADC.

One challenge in realizing performance enhancements for industrial applications pertains to the scalability of IMC hardware. Crossbar arrays are limited in size, for example, by parasitic effects, signal drop-off, and non-idealities, to a few hundred rows and columns. Current IMC hardware capable of dense matrix–vector operations could support the computations in our architecture for SAT problems with up to ~250 variables and ~500 clauses within a single array⁴⁵. To overcome this limitation and increase the capacity for solving larger and more-complex SAT problems, one potential strategy would be to distribute the computational load by partitioning the variables and clauses across multiple crossbar arrays³³. Exploring the implementation of such a multi-array architecture is an essential step in enhancing the scalability and applicability of our solver, opening up the possibility of solving larger and more-complex SAT instances.

The WalkSAT-XNF heuristic is an evolution of the CNF-specific WalkSAT heuristic and does not differentiate between XOR and CNF clauses for the purpose of variable selection. Based on the insights from this work, it could be possible to use IMC hardware for accelerating algorithmically efficient heuristics that include more sophisticated clause differentiation (e.g., by pre-solving the XOR clauses using Gauss–Jordan elimination⁴⁶). Further enhancements can be achieved by combining it with the parallel tempering framework, which has recently been shown to provide performance improvements for IMC architectures with minimal overhead⁴⁷. Finally, high-performance SAT solvers often combine CDCL and SLS heuristics, including XOR subroutines^48,49; our IMC approach could similarly be adapted to accelerate other types of heuristics, including CDCL SAT solvers⁵⁰.

Methods

Benchmarking instances

McEliece–Niederreiter cryptosystem

The McEliece instances are derived from cryptographic attacks^25,51 on the McEliece–Niederreiter cryptosystem^52,53. This cryptosystem was proposed as the first code-based public-key cryptosystem in the 1970s and has been elected by the National Institute of Standards and Technology (NIST) as a quantum-resistant public-key cryptographic algorithm for evaluating post-quantum cybersecurity⁵⁴.

For the encryption and decryption of a cipher, the receiver generates three matrices: the n-by-k generator matrix G typically using Goppa codes; an n-by-n permutation matrix P; and a random k-by-k invertible matrix S. The receiver publishes a public key ${G}^{{\prime} }:=SGP$. The message sender prepares a plaintext message m and creates the ciphertext $y={m}^{T}{G}^{{\prime} }+e$, where e is an error vector with a Hamming weight of t. The receiver then uses an error-correction algorithm⁵⁵ to identify the error vector e and obtains m via G, P, and S. A potential attack on the McEliece cryptosystem involves identifying the error vector e. In particular, the authors in ref. ²⁵ interpret the problem as finding the minimum-weight codeword. Let H be an (n − k) × n matrix, with H_i,j being the (i, j)-th element of the matrix H. The linear system Hc = 0 is then written over the binary field with the XOR logical operator ⊕ . For instance, the i-th equality of Hc = 0 is

$$\begin{array}{lllllllll}{H}_{i,1}{c}_{1} & \oplus & {H}_{i,2}{c}_{2} & \oplus & \cdots & \oplus & {H}_{i,n}{c}_{n} &=& 0.\end{array}$$

(4)

A decoding attack on the system involves finding a solution c to Hc = 0 having the desired Hamming weight.

Based on this attack, the McEliece instances are generated via the PySA package⁵⁶ (further details can be found in refs. ^26,57). Each instance is first generated as a set of XOR equations as shown in Eq. (4). The XOR equations are then translated to CNF clauses, and the Hamming weight of the desired solution c is incorporated using additional CNF clauses. We use 10 CNF instances with a code length equal to 16. We label these instances from McE-i, where i ∈ {0, …, 9}. The numbers of variables and clauses range from 171 to 183, and 611 to 659, respectively.

Minimal disagreement parity problem

The MDP instances are generated from the minimal disagreement parity problem described in ref. ³⁸. Given an m-by-n binary matrix X, a binary vector y of length m, and an integer k, the MDP problem seeks to find a binary vector a ∈ {0, 1}ⁿ satisfying

$$\mathop {\sum }\limits_{i=1}^{m}\left(\left( \mathop {\sum }\limits_{j=1}^{n}{X}_{i,j}{a}_{j}\right)\oplus {y}_{i}\right)\le k.$$

(5)

The difficulty in solving the MDP problem has been explored in the literature, and an algorithm for solving the inequality (5), relying on XOR clauses only, was suggested in ref. ⁵⁸. A total of 15 MDP instances were proposed by Crawford³⁸ and added to the DIMACS library⁵⁹, with the instances translated to a CNF representation. We selected 10 instances, par-8-i-c and par-16-i-c, i ∈ {1, …, 5}, from the DIMACS library⁵⁹, and they can be accessed from ref. ⁵⁷. We labeled these instances p-8-i, p-16-i, where i ∈ {1, …, 5}. The numbers of variables and clauses lie in the ranges [64, 74] and [254, 298] for the par-8-i-c family, and [317, 349] and [1264, 1392] for the par-16-i-c family.

Advanced encryption standard

The Advanced Encryption Standard (AES)^27,28 is a symmetric key encryption algorithm selected by the National Institute of Standards and Technology (NIST). It was developed to replace an older data encryption standard (DES) that was shown to be vulnerable to decryption attacks, particularly with the advent of stronger computational resources. Applications of AES include securing communications for online financial transactions and encrypting data in a database⁶⁰. XOR operations are one of the key components of the encryption process that utilizes the so-called round keys, which are inherent to AES and finding them is indicative of a successful cryptographic attack. Instances pertaining to AES are available in the dataset from the 2012 SAT competition⁶¹, and they can be accessed from ref. ⁵⁷. Solving these problem instances is viewed as a successful cryptographic attack to AES. As mentioned in ref. ⁶¹, these instances inherit XOR operations, but are translated into a CNF representation, making it possible to utilize SAT solvers that operate only CNF clauses. We use instances called aes_32_1_keyfind_i, where i = 1, 2 and label them AES-1 and AES-2 in the benchmarking experiment below. The numbers of variables and clauses are 300 and 1056, respectively.

XNF problem conversion

We provide the details on the conversion process for generating the formulation classes CNF-PP, XNF, and XNF-PP, which illustrated in Fig. 1b. We incorporated CNF preprocessing using PySAT⁶², a Python library designed to work with SAT instances with CNF clauses only. We use PySAT to access the CaDiCaL solver’s preprocessor²⁹. To produce preprocessed CNF instances (denoted by CNF-PP in the figure), the parameter named ‘rounds’ was set to 3, indicating the number of preprocessing rounds. PySAT supports a variety of preprocessing techniques, including blocked clause elimination, covered clause elimination, globally blocked clause elimination, equivalent literal substitution, bounded variable elimination, failed literal probing, hyper binary resolution, clause subsumption, and clause vivification. Details on each technique can be found in ref. ²⁹. All available preprocessing techniques supported by the package were employed, provided by the following parameters: block, cover, condition, decompose, elim, probe, probehbr, subsume, and vivify. The time to process a CNF instance to its preprocessed counterpart CNF-PP ranges approximately from 2 ⋅ 10⁻³ to 1 ⋅ 10⁻² s, with an average time of around 7 ⋅ 10⁻³ s.

To convert an instance in CNF representation into XNF form, we employed the cnf2xnf tool, which is a utility present in the xnfSAT solver¹⁶. The cnf2xnf tool is designed to transform CNF instances by identifying and extracting XOR clauses from given CNF clauses. The resulting hybrid representation retains the structure of the original CNF instance while introducing XOR clauses, making the clauses more compact. The processing time to convert a CNF instance to an XNF instance ranges from ~3 ⋅ 10⁻³ to 3 ⋅ 10⁻² s, with an average time of around 4 ⋅ 10⁻³ s. For converting a CNF instance to XNF form, the processing time ranges from 2 ⋅ 10⁻³ to 4 ⋅ 10⁻³ s, with an average time of around 3 ⋅ 10⁻³ s.

Table 3 presents the average of clause densities of each instance class, where the density is calculated by summing the number of literals in each clause and dividing by the total number of variables. We present further observations regarding the literals per clause densities d_CNF and d_XOR of the XNF and XNF-PP formulation classes in Supplementary Note 1.

Benchmarking of SAT solvers on CPUs

The TTS and ETS of xnfSAT, CryptoMiniSat, WalkSAT-SKC, and Kissat were calculated using an Intel Xeon CPU running at 2.60 GHz with 512 GB of system memory and 128 virtual cores. For the ITS and TTS estimations, the number of trials was set to 1000 by all algorithms and instances in order to obtain a reliable success probability θ⁶³. For CryptoMiniSat, the parameter named ‘maxsol’ was set to 1, quantifying the number of targeted solutions found by the algorithm. The maximum allowed runtime for Kissat was set to 300 s. For WalkSAT-XNF, WalkSAT-SKC, and xnfSAT, each trial was capped at 10⁹ maximum allowed bit flips. The noise parameters used for WalkSAT-XNF were optimized in a grid search for the different problem classes. The optimized parameters are displayed in Table 4a. The computation of the ETS for each solver is outlined in Table 4b.

Table 4 Solver parameters and energy-to-solution (ETS) estimation methodology

Full size table

To estimate the energy consumption of solvers that solely depend on software, 1.5 joules per second (i.e., 1.5 watts) was used. We benchmarked several instances using CryptoMiniSat on an AMD Epyc server while tracking the energy usage using the Powertop package⁶⁴. In all cases, we observed 1.5 watts, which we used as the baseline energy usage for all CPU-based solvers. Of note, the full benchmarking experiments were performed on Intel Xeon CPUs running at 2.80 GHz with 90 GB of RAM and 64 logical cores on the Google Cloud Platform (GCP), on which it is not possible to measure the energy directly. We believe our estimate of 1.5 watts is conservative, as the per-core thermal design can have a higher power ceiling.

Hardware accelerator energy modeling

The components of the hardware architecture in Fig. 2 have been designed, validated, and modeled in a TSMC 28 nm technology node. The crossbar array is modeled for a BEOL integrated RRAM device using TaO_x memristors based on data from previously fabricated test chips³⁷. The output currents at the bit lines are detected and processed using transimpedance amplifiers with active common-drain feedback. For CNF clauses, output signals are evaluated with comparators based on a StrongARM latch architecture. For the XOR clause evaluation, we model the energy consumption of the ADCs based on a regression analysis of the ADC survey data in refs. ^65,66. Based on the maximum number of literals for the benchmarking problems (see Table 3), we assume an ADC bit resolution of 4 bits, which can support clauses with up to 15 literals. For an ADC with a sampling rate of 900 million samples per second and a bit resolution of 4 bits, we estimate an energy consumption per operation of 0.718 pJ and an area of 3.9 ⋅ 10⁻³ mm².

The Gaussian noise signal is generated from an XORSHIFT-64 PRNG using the Alias method. The normal-distributed random number sequence generated by the PRNG is converted to analog signals using R2R ladder DACs at each bit line of the gradient evaluation crossbar ((4) in Fig. 2). The WTA circuit is realized using voltage-controlled delay lines, whose output is evaluated using merger trees and arbiters. The one-hot encoded output of the WTA circuit is fed into an array of XOR gates, whose other input is the current variable configuration stored in the register. The output is used to set the new state of the register.

The circuit is driven and synchronized by a central clock signal, where the signal provided by the register sequentially progresses through the individual circuit blocks shown in Fig. 2. A single iteration of WalkSAT-XNF is performed in three clock cycles. During the first clock cycle, the signals are applied to the first crossbar array and the output signals are analyzed using the readout circuit. During the second clock cycle, the second crossbar array is operated in the same way. During the third clock cycle, the WTA operation is performed, and the register state is updated. The combined latency of these components per iteration of WalkSAT-XNF was modeled as taking t_iter = 6 ns. Once the register is initialized, the entire circuit will continuously repeat this flow until a predefined number of iterations is reached or until a satisfying solution has been identified. Additional details about the circuit designs and the hardware parameters can be found in ref. ⁸.

From these modeling results, a semi-analytical model has been derived, which evaluates the energy consumption of the individual components based on average signal levels and activity patterns. For the benchmarking, we have built a custom cycle-accurate simulator that derives instance-specific activity patterns and signal levels when running the WalkSAT-XNF heuristic. Using the semi-analytical model, we derive the mean energy consumption for each instance without the need for extensive SPICE-like simulations, which would be intractable. We derived the mean energy consumption per iteration of the WalkSAT-XNF heuristic E_mean/iter for each instance and calculated the energy to solution as ETS = E_mean/iter ⋅ ITS.

Experimental validation of the WalkSAT-XNF heuristic on memristor crossbar arrays

The experimental setup used to realize our IMC architecture comprises a custom chip fabricated in a TSMC 180 nm technology node and houses three 64-by-64 memristor crossbar arrays. The 1T1M cells are based on Ta/TaO_x/Pt RRAM that was monolithically integrated in-house in a BEOL process. To perform in-memory computations, the chip contains digital control and analog sensing circuits. Input signals to each array’s word line are applied digitally and the analog output is reconstructed using the ‘shift and add’ method⁶⁷. To convert and measure the signals from the array’s bit lines, transimpedance amplifiers and sample-and-hold circuits are employed that rapidly convert the output currents to voltage signals and sample them. The signals are then converted to digital signals using ADCs. The chip is hosted on a custom-printed circuit board, which facilitates the voltage supply to the chip and provides a digital interface to access, control, and program the individual crossbar arrays. Additional details about the layout and the fabrication of the chip may be found in ref. ³⁶. For the implementation of the WalkSAT-XNF heuristic, a custom Python program was written that performs the matrix operations in Fig. 2 on the crossbar arrays. Here, the matrices in Fig. 3a were programmed into two of the chip’s arrays. During the matrix operations, the binary input signals are communicated to the chip and the output signals are measured and returned via the digital interface. For the clause evaluation, the number of true literals is inferred from the output signal using equidistant quantization levels. These levels have been optimized to yield the lowest error rate.

Data availability

The benchmarking instances used in this study are available in ref. ⁵⁷.

Code availability

The simulator used for the heuristic simulation and energy modeling is open-sourced and available at https://github.com/HewlettPackard/CountryCrab.

References

Cook, S. A. The complexity of theorem proving procedures. In Proceedings of the Third Annual ACM Symposium, 151–158 (ACM, 1971).
Levin, L. A. Universal sequential search problems. Probl. Peredachi Inf. 9, 115–116 (1973).
Google Scholar
Larrabee, T. Test pattern generation using boolean satisfiability. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 11, 4–15 (1992).
Article ADS Google Scholar
Knuth, D. E. The Art of Computer Programming, Volume 4, Fascicle 6: Satisfiability (Addison-Wesley Professional, 2015).
Perron, L. & Didier, F. CP-SAT. https://developers.google.com/optimization/cp/cp_solver (2024).
Kowalsky, M., Albash, T., Hen, I. & Lidar, D. A. 3-regular three-xorsat planted solutions benchmark of classical and quantum heuristic optimizers. Quantum Sci. Technol. 7, 025008 (2022).
Article ADS Google Scholar
Nikhar, S., Kannan, S., Aadit, N. A., Chowdhury, S. & Camsari, K. Y. All-to-all reconfigurability with sparse and higher-order ising machines. Nat. Commun. 15, 8977 (2024).
Article CAS PubMed PubMed Central ADS Google Scholar
Pedretti, G. et al. Solving boolean satisfiability problems with resistive content addressable memories. npj Unconv. Comput. 2, 7 (2025).
Article Google Scholar
Sharma, A., Burns, M., Hahn, A. & Huang, M. Augmenting an electronic ising machine to effectively solve boolean satisfiability. Sci. Rep. 13, 22858 (2023).
Article CAS PubMed PubMed Central ADS Google Scholar
Zhang, Q. et al. A stochastic analog sat solver in 65nm CMOS achieving 6.6μs average solution time with 100% solvability for hard 3-sat problems. In 2024 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits) (IEEE, 2024).
Shim, C., Bae, J. & Kim, B. 30.3 VIP-Sat: a Boolean satisfiability solver featuring 5 × 12 variable in-memory processing elements with 98% solvability for 50-variables 218-clauses 3-SAT problems. In 2024 IEEE International Solid-State Circuits Conference (ISSCC), 486–488 (IEEE, 2024).
Xie, S. et al. 29.2 Snap-SAT: a one-shot energy-performance-aware all-digital compute-in-memory solver for large-scale hard boolean satisfiability problems. In 2023 IEEE International Solid- State Circuits Conference (ISSCC), 420–422 (IEEE, 2023).
Kim, D., Rahman, N. M. & Mukhopadhyay, S. PRESTO: a processing-in-memory-based k -SAT solver using recurrent stochastic neural network with unsupervised learning. IEEE J. Solid State Circuits 59, 2310–2320 (2024).
Article ADS Google Scholar
Bhattacharya, T. et al. A fully integrated mixed-signal compute-in-memory accelerator for solving arbitrary order boolean satisfiability problems. In 2024 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits) (IEEE, 2025).
Soos, M., Gocht, S. & Meel, K. S. Tinted, detached, and lazy cnf-xor solving and its applications to counting and sampling. In International Conference on Computer Aided Verification, 463–484 (Springer, 2020).
Nawrocki, W., Liu, Z., Fröhlich, A., Heule, M. J. H. & Biere, A. Xor local search for boolean brent equations. In SAT, vol. 12831 of Lecture Notes in Computer Science, (eds, Li, C.-M. & Manyá, F.) 417–435 (Springer, 2021).
Andraschko, B., Danner, J. & Kreuzer, M. Sat solving using xor-or-and normal forms. Math. Comput. Sci. 18, 1–26 (2024).
Article MathSciNet Google Scholar
Nandi, A., Chakrabartty, S. & Thakur, C. S. Margin propagation based xor-sat solvers for decoding of ldpc codes. In IEEE Transactions on Communications (IEEE, 2024).
Bellini, E. et al. New records of pre-image search of reduced sha-1 using sat solvers. In Proceedings of the Seventh International Conference on Mathematics and Computing: ICMC 2021, 141–151 (Springer, 2022).
Tseitin, G. S. On the Complexity of Derivation in Propositional Calculus, 466–483 (Springer Berlin Heidelberg, 1983).
Nawrocki, W., Liu, Z., Fröhlich, A., Heule, M. J. & Biere, A. XOR local search for boolean brent equations. In Theory and Applications of Satisfiability Testing–SAT 2021: 24th International Conference, Barcelona, Spain, July 5-9, 2021, Proceedings 24, 417–435 (Springer, 2021).
Soos, M., Nohl, K. & Castelluccia, C. Extending SAT solvers to cryptographic problems. In Theory and Applications of Satisfiability Testing - SAT 2009, 12th International Conference, SAT 2009, Swansea, UK, June 30 - July 3, 2009. Proceedings, vol. 5584 of Lecture Notes in Computer Science (ed. Kullmann, O.) 244–257 (Springer, 2009).
Sebastian, A., Le Gallo, M., Riduan, K.-A. & Evangelos, E. Memroy devices and applications for in-memory computing. Nat. Nanotechnol. 15, 529–544 (2020).
Article CAS PubMed ADS Google Scholar
Zhu, C., Rucker, A. C., Wang, Y. & Dally, W. J. SatIn: Hardware for boolean satisfiability inference. Preprint at https://arxiv.org/abs/2303.02588 (2023).
Canteaut, A. & Chabaud, F. A new algorithm for finding minimum-weight words in a linear code: application to mceliece’s cryptosystem and to narrow-sense bch codes of length 511. IEEE Trans. Inf. Theory 44, 367–378 (1998).
Article MathSciNet ADS Google Scholar
Mandrà, S., Munoz-Bauza, H., Mossi, G. & Rieffel, E. G. Generating hard ising instances with planted solutions using post-quantum cryptographic protocols. Fut. Gener. Comput. Syst. 166, 107721 (2025).
Daemen, J. & Rijmen, V. The Design of Rijndael : AES - The Advanced Encryption Standard, 1st edn. Information Security and Cryptography (Springer Berlin Heidelberg, 2002).
Kamal, A. A. & Youssef, A. M. Applications of SAT solvers to AES key recovery from decayed key schedule images. In 2010 Fourth International Conference on Emerging Security Information, Systems and Technologies, 216–220 (IEEE, 2010).
Biere, A. et al. CaDiCaL 2.0. In Gurfinkel, A. & Ganesh, V. (eds.) Computer Aided Verification - 36th International Conference, CAV 2024, Montreal, QC, Canada, July 24-27, 2024, Proceedings, Part I, vol. 14681 of Lecture Notes in Computer Science, 133–152 (Springer, 2024).
Selman, B., Kautz, H. & Cohen, B. Noise strategies for improving local search. Proceedings of the National Conference on Artificial Intelligence, 1 (ACM, 1999).
Russell, S. & Norvig, P. Artificial Intelligence: A Modern Approach, 3 edn (Prentice Hall, 2010).
Aramon, M. et al. Physics-inspired optimization for quadratic unconstrained problems using a digital annealer. Front. Phys. 7, 48 (2019).
Article Google Scholar
Bhattacharya, T., Hutchinson, G. H., Pedretti, G. & Strukov, D. Ho-fpia: High-order field-programmable ising arrays with in-memory computing. In 2024 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 252–259 (IEEE, 2024).
Bhattacharya, T. et al. Computing high-degree polynomial gradients in memory. Nat. Commun. 15, 8211 (2024).
Article CAS PubMed PubMed Central ADS Google Scholar
Heittmann, A., Hizzani, M. & Strachan, J. P. Impact of variability compensation on the performance of an rram-based 3-sat solver. In 2025 IEEE International Symposium on Circuits and Systems (ISCAS), 1–5 (IEEE, 2025).
Li, C. et al. Cmos-integrated nanoscale memristive crossbars for cnn and optimization acceleration. In 2020 IEEE International Memory Workshop (IMW), 1–4 (IEEE, 2020).
Sheng, X. et al. Low-conductance and multilevel cmos-integrated nanoscale oxide memristors. Adv. Electron. Mater. 5, 1800876 (2019).
Article Google Scholar
Crawford, J. M., Kearns, M. J. & Schapire, R. E. The minimal disagreement parity problem as a hard satisfiability problem. In Computational Intell (Research Lab and AT&T Bell Labs TR, 1994).
Pedretti, G., Ambrosi, E. & Ielmini, D. Conductance variations and their impact on the precision of in-memory computing with resistive switching memory (rram). In 2021 IEEE International Reliability Physics Symposium (IRPS), 1–8 (IEEE, 2021).
Rao, M. et al. Thousands of conductance levels in memristors integrated on cmos. Nature 615, 823–829 (2023).
Article CAS PubMed ADS Google Scholar
Biere, A. arminbiere/kissat: Release 4.0.0. https://github.com/arminbiere/kissat (2024).
Stern, J. A New Identification Scheme Based on Syndrome Decoding, 13–21 (Springer Berlin Heidelberg, 1994).
Dobrynin, D. et al. Energy landscapes of combinatorial optimization in Ising machines. Phys. Rev. E 110, 045308 (2024).
Article MathSciNet CAS PubMed ADS Google Scholar
Cai, F. et al. Power-efficient combinatorial optimization using intrinsic noise in memristor Hopfield neural networks. Nat. Electron. 3, 409–418 (2020).
Article Google Scholar
Ambrogio, S. et al. An analog-ai chip for energy-efficient speech recognition and transcription. Nature 620, 768–775 (2023).
Soos, M. & Meel, K. S. Gaussian Elimination Meets Maximum Satisfiability. In Proceedings of the 18th International Conference on Principles of Knowledge Representation and Reasoning (IJCAI Organization, 2025).
Zhang, X. et al. Parallel tempering–inspired distributed binary optimization with in-memory computing. Phys. Rev. 23, 034031 (2025).
Soos, M., Devriendt, J., Gocht, S., Shaw, A. & Meel, K. S. Cryptominisat with CCAnr at the sat competition 2020. SAT COMPETITION 2020, 27 (2020).
Google Scholar
Soos, M., Selman, B., Kautz, H., Devriendt, J. & Gocht, S. Cryptominisat with walksat at the sat competition 2020. SAT COMPETITION 2020, 29 (2020).
Lo, M., Chang, M.-C. F. & Cong, J. SAT-Accel: A modern sat solver on a FPGA. In Proceedings of the 2025 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, FPGA ’25, 234–246 (Association for Computing Machinery, 2025).
Bernstein, D. J., Lange, T. & Peters, C. Attacking and defending the mceliece cryptosystem. In Post-Quantum Cryptography (eds Buchmann, J. & Ding, J.) 31–46 (Springer Berlin Heidelberg, 2008).
McEliece, R. J. A public-key cryptosystem based on algebraic coding theory. Deep Space Netw. Prog. Rep. 44, 114–116 (1978).
ADS Google Scholar
Niederreiter, H. Knapsack-type cryptosystems and algebraic coding theory. Prob. Contr. Inform. Theory 15, 157–166 (1986).
MathSciNet Google Scholar
National Institute of Standards and Technology. Post-quantum cryptography candidates to be standardized and round 4 of the nist post-quantum cryptography standardization process. https://csrc.nist.gov/news/2022/pqc-candidates-to-be-standardized-and-round-4 (2022).
Patterson, N. The algebraic decoding of goppa codes. IEEE Trans. Inf. Theory 21, 203–207 (1975).
Article MathSciNet ADS Google Scholar
Mandra, S. et al. PySA: fast simulated annealing in native Python. https://github.com/nasa/pysa (2023).
Im, H. et al. Dataset for accelerating hybrid xor-cnf sat problems natively with in-memory computing. Zenodo data repository, https://doi.org/10.5281/zenodo.18235974 (2026).
Chen, J. XORSAT: an efficient algorithm for the dimacs 32-bit parity problem. Preprint at https://arxiv.org/abs/cs/0703006 (2007).
Dimacs instance repository. http://archive.dimacs.rutgers.edu/pub/challenge/sat/benchmarks/cnf/ (2000).
M.P., B. & Babu, K. R. Secure cloud storage using aes encryption. In 2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT), 859–864 (IEEE, 2016).
Balint, A. et al. (eds.) Proceedings of SAT Challenge 2012 : Solver and Benchmark Descriptions (University of Helsinki, 2012). https://api.semanticscholar.org/CorpusID:199587264.
Ignatiev, A., Morgado, A. & Marques-Silva, J. PySAT: a Python toolkit for prototyping with SAT oracles. In Theory and Applications of Satisfiability Testing – SAT, 428–437 (SAT, 2018).
Noori, M., Valiante, E., Vaerenbergh, T. V., Mohseni, M. & Rozada, I. Statistical analysis for per-instance evaluation of stochastic optimizers: Avoiding unreliable conclusions. Phys. Rev. Appl. https://link.aps.org/doi/10.1103/2fpj-t663 (2026).
van de Ven, A. et al. Powertop. https://github.com/fenrus75/powertop. Version 2.15 (2022).
Murmann, B. ADC Performance Survey 1997-2024. Available: https://github.com/bmurmann/ADC-survey (2025).
Andrulis, T., Chen, R. Lee, H.-S. Emer, J. S. & Sze, V. Modeling Analog-Digital-Converter Energy and Area for Compute-In-Memory Accelerator Design. arXiv https://arxiv.org/abs/2404.06553 (2024).
Shafiee, A. et al. ISAAC: a convolutional neural network accelerator with in-situ analog arithmetic in crossbars. ACM SIGARCH Comput. Architecture N. 44, 14–26 (2016).
Article Google Scholar

Download references

Acknowledgements

The authors thank our editor, Marko Bucyk, for his careful review and editing of the manuscript, and Dmitri Strukov for discussions on XOR hardware architectures. This material is based upon work supported by the Defense Advanced Research Projects Agency (DARPA) through Air Force Research Laboratory Agreement No. FA8650-23-3-7313. The views, opinions, and/or findings expressed are those of the author(s) and should not be interpreted as representing the official views or policies of the Department of Defense or the U.S. Government.

Author information

These authors contributed equally: Haesol Im, Fabian Böhm.

Authors and Affiliations

1QB Information Technologies (1QBit), Vancouver, BC, Canada
Haesol Im, Noriyuki Kushida, Moslem Noori, Elisabetta Valiante, Xiangyi Zhang, Chan-Woo Yang & Ignacio Rozada
HPE Labs, Hewlett Packard Enterprise, Brussels, Belgium
Fabian Böhm & Thomas Van Vaerenbergh
HPE Labs, Hewlett Packard Enterprise, Milpitas, CA, USA
Giacomo Pedretti, Xia Sheng, Jim Ignowski, Masoud Mohseni & Raymond Beausoleil
University of California, Santa Barbara, CA, USA
Tinish Bhattacharya
Peter Grünberg Institute (PGI-14), Forschungszentrum Jülich GmbH, Jülich, Germany
Arne Heittmann & John Paul Strachan
RWTH Aachen University, Aachen, Germany
John Paul Strachan

Authors

Haesol Im
View author publications
Search author on:PubMed Google Scholar
Fabian Böhm
View author publications
Search author on:PubMed Google Scholar
Giacomo Pedretti
View author publications
Search author on:PubMed Google Scholar
Noriyuki Kushida
View author publications
Search author on:PubMed Google Scholar
Moslem Noori
View author publications
Search author on:PubMed Google Scholar
Elisabetta Valiante
View author publications
Search author on:PubMed Google Scholar
Xiangyi Zhang
View author publications
Search author on:PubMed Google Scholar
Chan-Woo Yang
View author publications
Search author on:PubMed Google Scholar
Tinish Bhattacharya
View author publications
Search author on:PubMed Google Scholar
Xia Sheng
View author publications
Search author on:PubMed Google Scholar
Jim Ignowski
View author publications
Search author on:PubMed Google Scholar
Arne Heittmann
View author publications
Search author on:PubMed Google Scholar
John Paul Strachan
View author publications
Search author on:PubMed Google Scholar
Masoud Mohseni
View author publications
Search author on:PubMed Google Scholar
Raymond Beausoleil
View author publications
Search author on:PubMed Google Scholar
Thomas Van Vaerenbergh
View author publications
Search author on:PubMed Google Scholar
Ignacio Rozada
View author publications
Search author on:PubMed Google Scholar

Contributions

H.I. and F.B. contributed equally to this work and are recognized co-first authors. H.I. and F.B. wrote the manuscript. H.I., N.K., and T.B. performed algorithm designs. M.N. and E.V. analyzed the numeric results. H.I., X.Z., and C.-W.Y. conducted the corresponding numeric benchmarking simulation. A.H. performed circuit and architectural simulations. X.S., J.I., and J.P.S. contributed to the memristor fabrication and experimental system development. G.P. and T.V.V. conceived the idea of asserting XOR clauses with in-memory computing. F.B. derived the hardware architecture, conducted the hardware modeling and energy simulations, and performed the hardware experiments. I.R. conceived the main idea of the XOR–CNF use case. I.R., T.V.V., J.P.S., M.M., and R.B. supervised and led the collaboration effort. All authors analyzed and discussed the results.

Corresponding author

Correspondence to Ignacio Rozada.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Xueqing Li and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Transparent Peer Review file (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Im, H., Böhm, F., Pedretti, G. et al. Accelerating hybrid XOR–CNF Boolean satisfiability problems natively with in-memory computing. Nat Commun 17, 2922 (2026). https://doi.org/10.1038/s41467-026-69465-2

Download citation

Received: 10 April 2025
Accepted: 02 February 2026
Published: 19 February 2026
Version of record: 27 March 2026
DOI: https://doi.org/10.1038/s41467-026-69465-2

Subjects

Abstract

Similar content being viewed by others

Memristor-based hardware accelerators for artificial intelligence

Efficient combinatorial optimization by quantum-inspired parallel annealing in analogue memristor crossbar

Strategies of high-accuracy memristor-based analogue computing in memory for artificial intelligence

Introduction

Results

Mapping and benchmarking advantages of hybrid XOR–CNF SAT problems over CNF

WalkSAT-XNF: an XNF-native SAT heuristic compatible with in-memory computing hardware

An in-memory computing accelerator architecture for WalkSAT-XNF

Experimental demonstration using RRAM crossbar arrays

Simulation-based benchmarking for a 28 nm RRAM architecture

Discussion

Methods

Benchmarking instances

McEliece–Niederreiter cryptosystem

Minimal disagreement parity problem

Advanced encryption standard

XNF problem conversion

Benchmarking of SAT solvers on CPUs

Hardware accelerator energy modeling

Experimental validation of the WalkSAT-XNF heuristic on memristor crossbar arrays

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information (download PDF )

Transparent Peer Review file (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links