Probabilistic greedy algorithm solver using magnetic tunneling junctions for traveling salesman problem

Zhang, Ran; Li, Xiaohan; Wan, Caihua; Hoffmann, Raik; Hindenberg, Meike; Xu, Yingqian; Liu, Shiqiang; Kong, Dehao; Xiong, Shilong; He, Shikun; Vardar, Alptekin; Dai, Qiang; Gong, Junlu; Sun, Yihui; Zheng, Zejie; Kämpfe, Thomas; Yu, Guoqiang; Han, Xiufeng

doi:10.1038/s41467-025-66864-9

Download PDF

Article
Open access
Published: 04 December 2025

Probabilistic greedy algorithm solver using magnetic tunneling junctions for traveling salesman problem

Nature Communications volume 17, Article number: 189 (2026) Cite this article

4286 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Combinatorial optimization underpins applications in artificial intelligence, logistics, and network design, yet classical techniques such as greedy search and dynamic programming struggle to balance efficiency and solution quality at scale. We present a probabilistic framework that embeds true random number generators based on spin-transfer-torque magnetic tunnel junctions into a greedy solver. Intrinsic stochastic switching enables configurable random number distributions, which we use to inject controlled randomness via a temperature parameter that interpolates between deterministic and stochastic choices, balancing exploration and exploitation. Applied to the traveling salesman problem, the framework yields high-quality tours and outperforms simulated annealing and genetic algorithms in solution quality and convergence speed. In larger instances with up to 70 cities, it maintains its advantage, reaching near-optimal solutions with fewer iterations and reduced computational cost. These results show that hardware true randomness with tunable statistics can improve heuristic search and motivate integrated, energy-efficient probabilistic hardware for scalable optimization.

AI-guided framework for the design of materials and devices for magnetic-tunnel-junction-based true random number generators

Article Open access 11 March 2025

Energy-efficient superparamagnetic Ising machine and its application to traveling salesman problems

Article Open access 24 April 2024

An integrated-circuit-based probabilistic computer that uses voltage-controlled magnetic tunnel junctions as its entropy source

Article 13 August 2025

Introduction

Combinatorial optimization is a cornerstone of modern computational science, playing a pivotal role in domains ranging from artificial intelligence and machine learning^1,2 to logistics^3,4,5 and operations research⁶. The objective is to identify an optimal configuration from a finite but exponentially large set of possibilities, where even modest increases in problem size can render classical methods impractical due to the exponential growth of computational complexity⁷. While deterministic algorithms such as dynamic programming⁸ and branch-and-bound⁹ have proven effective for small-scale problems, they often fail to scale efficiently to larger scenarios or escape local optima when confronted with the complex landscapes of combinatorial spaces¹⁰.

In recent years, there has been a paradigm shift towards incorporating randomness into optimization algorithms^11,12,13, leading to a emerging class of techniques termed stochastic or probabilistic optimization^{14,15,16,17,18,19,20}. Methods such as simulated annealing, genetic algorithms, and Monte Carlo simulations have demonstrated the potential of randomness to diversify search strategies, enabling algorithms to explore solution spaces more comprehensively and escape local minima. However, the efficacy of these methods is highly dependent on the quality and configurability of the random number generators (RNGs) employed^13,21,22. Traditional RNGs, whether pseudo-random or hardware-based, often lack the flexibility required to dynamically adjust their distribution characteristics, limiting their adaptability to different optimization scenarios.

A promising development in this field is the utilization of magnetic tunneling junctions (MTJs) as a source of true random numbers, enabling true random number generators (TRNGs), which exploit inherent physical randomness rather than deterministic algorithmic processes^23,24. MTJs, typically used in non-volatile memory technologies^25,26,27,28, exhibit probabilistic switching behavior that can be finely tuned by external control parameters such as voltage or magnetic field strength. This inherent stochasticity—referred to as hardware randomness due to its direct physical origin—makes MTJ-based TRNGs uniquely suited for probabilistic computing^{29,30,31,32,33}, where the randomness can be directly mapped onto computational processes. The ability to configure the probability distribution of an MTJ-based TRNG—effectively creating probabilistic bits (p-bits), binary units defined by probabilistic rather than deterministic states—enables an alternative approach to algorithm design, where the degree of randomness can be adjusted in real time to influence decision-making processes^34,35.

Prior conceptual and simulation‑level frameworks, such as the SPINBIS spintronics‑based Bayesian inference engine built on MTJ stochastic bit‑stream generators³⁶, and spin‑orbit‑torque‑based Bayesian reasoning hardware³⁷, have demonstrated the feasibility of MTJ‑based probabilistic inference in data‑fusion and inference tasks. Distinctively, our work goes beyond these precedents by experimentally embedding MTJ‑based, probability‑distribution‑configurable TRNGs into a probabilistic greedy algorithm for TSP optimization. This hybrid hardware‑algorithm co‑design is, to our knowledge, a representative fully experimental demonstration of Bayesian‑PDF‑matched probabilistic optimization for combinatorial problems.

In this study, we propose an advanced optimization framework that leverages MTJ-based TRNGs to solve complex combinatorial problems. Specifically, we introduce a probabilistic greedy algorithm for the traveling salesman problem (TSP)^{22,38,39,40,41} – a canonical example in combinatorial optimization – to showcase the potential of this approach. The TSP challenges a solver to find the shortest possible route that visits a given set of cities and returns to the starting point, and it is well known for its non-deterministic polynomial-hardness (NP-Hardness). By incorporating MTJ-based TRNGs into the decision-making process, we can modulate the selection strategy for the next city, transitioning smoothly between deterministic greedy choices and purely random selection. This dynamic adaptability enables the algorithm to effectively balance exploration and exploitation, thereby improving its ability to find high-quality solutions efficiently.

Results and Discussion

Figure 1 presents a detailed characterization of the MTJ-based TRNG employed in this study. The resulting R-H hysteresis loops, shown in Fig. 1b, reveals a clear and sharp switching between high and low resistance states, confirming the stability and reproducibility of the MTJ’s magnetic switching behavior. The MTJ has a high tunnel magnetoresistance (TMR) ratio ~175%, which is essential for ensuring reliable and distinct resistance states. The MTJ’s resistance switching behavior under current pulses is illustrated in Fig. 1c.

**Fig. 1: Characterization of the performance of a TRNG based on an MTJ.**

By applying a series of current pulses, we observed stochastic switching of the free layer’s magnetization, resulting in resistance changes. This stochastic behavior serves as the basis for the MTJ-based TRNG. To further analyze the switching probability, Fig. 1d plots the probability of switching as a function of the applied write voltage. The experimental data (green circles) show a gradual increase in switching probability (P_sw) with increasing voltage (V), which is accurately captured by the fitted sigmoidal curve (black solid line). The fitting parameters b and c indicate the sharpness and offset of the curve.

$${P}_{{{\rm{sw}}}}=\frac{1}{1+{e}^{-b(V+c)}}$$

This relationship P_sw(V) is crucial, as it enables precise control over the probability distribution of generated random numbers. Figure 1e demonstrates the results of continuous resistance measurements at three fixed voltages: 0.275 V, 0.282 V, and 0.288 V, corresponding to P_sw of 25%, 48%, and 81%, respectively. These measurements confirm that the device can achieve consistent and repeatable switching behavior, with well-defined probability at each voltage level. The ability to finely tune the switching probability by adjusting the voltage is a key advantage of MTJ-based TRNGs, allowing for the generation of random numbers with specific statistical properties tailored to various probabilistic algorithms. While achieving fine-grained control over switching probabilities typically requires highly precise voltage tuning, we adopt a hybrid control scheme previously validated in ref. ⁴², where switching probability is regulated via pulse-width modulation rather than analog amplitude adjustment. This digital control method ensures consistent sigmoidal switching behavior across devices, even when target probabilities are closely spaced. Moreover, the use of a self-stabilizing feedback mechanism compensates for device variation and drift, allowing robust and scalable probability distribution generation without relying on high-resolution voltage sources.

Figure 2 showcases the versatility of the MTJ-based TRNG in generating random numbers with configurable probability distributions. The schematic diagram in Fig. 2a illustrates the experimental setup, where multiple MTJs are connected to the NI PXIe system through a probe card and adapter board (Supplement I). This configuration enables the simultaneous measurement of multiple MTJs, allowing for efficient data collection and parallel testing of different devices.

**Fig. 2: Probability-distribution-configurable TRNGs based on multiple MTJs.**

Figure 2b displays the random numbers generated by the MTJ-based TRNGs. The generated values align closely with the expected Gaussian distribution, as evidenced by the smooth bell-shaped curve. This Gaussian-distributed randomness is achieved by carefully adjusting the write voltage of the MTJs, demonstrating the flexibility of the TRNG in producing specific distributions. The transformation from binary Bernoulli TRNGs into a probability-distribution-function-configurable TRNG modeled as a Bayesian network can be found in Supplement II in details.

To quantitatively evaluate the accuracy of the generated distributions, Fig. 2c presents the error analysis, where the left axis represents the Kullback-Leibler (KL) divergence and the right axis represents the mean squared error (MSE). The KL divergence measures the difference between the experimentally generated distribution and the theoretical Gaussian distribution, while the MSE quantifies the average deviation of the generated values from the expected mean and variance. Both metrics indicate minimal errors, confirming the high fidelity of the MTJ-based TRNG in replicating desired distributions.

The neighbor correlation of the generated random numbers is analyzed in Fig. 2d, where the color intensity represents the sample point density. The nearly uniform distribution of points and the presence of concentric circles indicate negligibly weak neighboring correlation, signifying that the generated random numbers are statistically independent. Our STT-MTJs are not low-barrier ones and each random number is generated by a reset-sampling circle, therefore correlevance between neighboring random numbers no longer an issue here. This property is essential for ensuring that the TRNG can produce high-quality random numbers suitable for applications requiring true randomness, such as probabilistic algorithms and cryptographic operations.

Figure 2e–g demonstrate the capability of the TRNG to generate random numbers following various probability distributions. Figure 2e presents a uniform distribution, where each value has an equal probability of being sampled. Figure 2f shows an exponential decay distribution, characterized by a high probability for smaller values and a rapidly decreasing probability for larger values. Finally, Fig. 2g illustrates a user-defined arbitrary distribution, highlighting the flexibility of the TRNG in generating custom probability profiles. This configurability is critical for integrating the TRNG into a wide range of applications, from stochastic optimization to artificial intelligence, where diverse probability distributions are needed to guide decision-making processes. The probabilistic nature of the algorithm inherently mitigates the influence of occasional transient faults or soft errors in random number generation. Additionally, the embedded self-calibration and stabilization routines further ensure robustness by identifying and correcting persistent anomalous switching behaviors at the hardware level. It is important to note that the experimentally demonstrated capability of generating random numbers with configurable and dynamically tunable probability distributions (Fig. 2) directly supports the probabilistic selection mechanism required by our greedy algorithm in solving the TSP (Fig. 3). At each iterative step of the algorithm, the MTJ-based TRNG efficiently provides random samples precisely matching the dynamically updated probability distribution defined by Eq. (1). This intrinsic alignment between the MTJ-based TRNG capabilities and the probabilistic selection mechanism greatly enhances both the solution quality and computational efficiency, clearly distinguishing our approach from traditional algorithms that rely on fixed or less-flexible random number generators.

**Fig. 3: Probability of selecting the next city under different temperature (k_BT) conditions.**

Maintaining precise switching probability control in large-scale or on-chip implementations is crucial. In our recent work⁴², we demonstrated a scalable hybrid control approach in which pulse-width modulation (PWM) replaces the need for high-resolution analog voltage tuning. By adjusting the duration of fixed-amplitude pulses using simple digital logic, the system effectively maintains the desired probability distribution across devices, even under thermal drift and process variation. This strategy significantly reduces the requirement for high-resolution DACs or ADCs, making the architecture well-suited for scalable on-chip integration.

Compared to pseudo-random number generators (PRNGs), MTJ-based TRNGs offer significant advantages in probabilistic computing applications, particularly through their ability to directly generate and dynamically tune probability distributions in real time. Unlike deterministic PRNGs, which typically require additional computational overhead for mapping uniform random outputs into desired distributions, MTJ-based TRNGs inherently produce physically-generated randomness with precisely controllable statistics via simple external parameters (e.g., applied voltage or pulse width). This feature significantly reduces complexity and latency, while also offering enhanced parallelism and power efficiency in hardware implementations.

While this work primarily focuses on distribution configurability, we emphasize that the random bitstreams generated by our MTJs have been rigorously validated⁴². In that study, we conducted comprehensive statistical evaluations using the NIST SP800-22 test suite, confirming that the TRNG outputs exhibit high entropy and pass all standard randomness tests without requiring post-processing. The same device architecture and control protocols were employed in this work, ensuring that the TRNGs used for TSP solving maintain equivalent statistical quality. Additional statistical evaluation of the TRNG-generated bitstreams is provided in Supplement I,I,I. The current bitstream generation rate in our experimental setup is limited by the speed of the NI PXIe data acquisition system, operating at approximately 500 kHz per MTJ. However, the intrinsic switching times of STT-MTJs allow for much faster operation. With high-speed peripheral circuits and optimized on-chip integration, generation rates approaching the GHz range are feasible, as reported in recent high-speed TRNG demonstrations⁴³. This positions our MTJ-based TRNGs as suitable candidates for future high-throughput probabilistic computing applications.

Figure 3 presents the probabilistic greedy algorithm’s mechanism for selecting the next city in the traveling salesman problem (TSP) under varying temperature conditions. The algorithm utilizes the MTJ-based TRNG to generate random numbers that influence the city selection process, allowing for a probabilistic adjustment of the greedy strategy. The selection probability P_i+1($\overline{N}$) of the next city $\overline{N}$ is a function of the distance d_ij between the current city N and $\overline{N}$ as well as a temperature parameter k_BT as shown in Eq. (1).

$${P}_{i+1}({\overline{N}}_{i})= (1-{b}_{i})\exp (-{d}_{N{\overline{N}}_{i}}/{k}_{B}T)/Z\\ Z= \mathop{\sum }\limits_{i=1}^{8}(1-{b}_{i})\exp (-{d}_{N{\overline{N}}_{i}}/{k}_{B}T)$$

(1)

Here b_i indicates the accessibility of the i^th city and b_i = 1 once the i^th city has been visited or else b_i = 0 if it is to be visited. Thus, the final probability of choosing a specific route P = ∏P_i+1($\overline{N}$)∝exp[-(∑d_ij)/k_BT] and, straightforwardly, the shortest route S = (∑d_i)_min has the highest probability to be experimentally sampled. This feature assures the convergence of this probabilistic greedy algorithm. More details can be found in the Supplement V.

It is worth noting that the decision of choosing the next city relies on a probabilistic sampling operation according to the series of probabilities P_i+1($\overline{N}$) with $\overline{N}$ being the city indices to be visited. This probability-distribution-function (PDF) defined by P_i+1($\overline{N}$) changes dynamically step by step, which calls for a random number generator that can output random numbers according to the time-variant PDFs. Fortunately, our TRNGs with configurable PDFs match this requirement well.

When k_BT approaches zero, the algorithm operates as a deterministic greedy algorithm, always selecting the closest city to the current one. In this regime, the probability of choosing the closest city is nearly 100%, leading to rapid but potentially suboptimal solutions due to the algorithm’s inability to escape local minima.

Conversely, when k_BT is extremely high, the selection probability for each remaining city becomes nearly uniform, leading to a selection process similar to a random walk. This behavior encourages exploration of the solution space but at the cost of reduced efficiency in converging to high-quality solutions. The optimal performance is observed at intermediate k_BT-values, where the algorithm effectively balances exploration (randomness) and exploitation (favoring shorter distances), allowing it to escape local optima and discover near-optimal solutions with high probability. Practical considerations regarding frequent adjustments of MTJ-based TRNG distributions have shown minimal overhead. This is primarily because reconfigurations involve only modest voltage adjustments via a small set of parameters. Furthermore, implementing parallel and pipeline operations effectively reduces latency, ensuring these hardware-level adjustments do not significantly affect the overall algorithm performance.

It is important to acknowledge existing epsilon-greedy methods commonly used in reinforcement learning and optimization tasks⁴⁴, which similarly balance exploration (randomness) and exploitation (optimal choice). However, our proposed MTJ-based probabilistic greedy algorithm significantly diverges from traditional epsilon-greedy methods in several critical aspects. Unlike epsilon-greedy algorithms, which typically employ a fixed probability (ε) to introduce uniformly random choices, our algorithm continuously and dynamically updates a probability distribution for city selection at every step, based on distances and the adjustable temperature parameter k_BT (Eq. (1)). This dynamic adjustment provides a more nuanced and context-sensitive trade-off between exploration and exploitation. Moreover, the direct hardware-based randomness offered by MTJ-based TRNGs facilitates immediate, real-time, and computationally efficient generation of precisely tuned probability distributions, eliminating the computational overhead associated with transforming uniformly distributed pseudo-random numbers into desired distributions. Consequently, our method achieves superior encoding efficiency, algorithmic flexibility, and scalability, representing a substantial advancement beyond traditional epsilon-greedy approaches.

Figure 4 provides experimental results demonstrating the application of the MTJ-based TRNGs in solving the TSP using the probabilistic greedy algorithm. Figure 4a depicts the map of the Burma14 TSP problem (n = 14, where n denotes the problem size), where the solid line indicates the known optimal solution, and the dashed line represents the best solution obtained using a classic greedy algorithm. The probabilistic greedy algorithm, driven by the MTJ-based TRNG, consistently identifies paths that are closer to the optimal solution, as shown by the reduced total distance metrics. Figure 4b illustrates the variation in total distance across a range of k_BT values from 1 to 400. The orange dashed line marks the known optimal solution, while the green, orange, and red solid lines connect the maximum, minimum, and average total distances, respectively, obtained at each k_BT value. The results indicate that the algorithm achieves optimal or near-optimal solutions when k_BT is within the range of 40 to 60, highlighting the significance of selecting an appropriate temperature parameter to balance the probabilistic selection strategy.

**Fig. 4: Hardware test results for solving the Burma14 problem using the Probabilistic Greedy Algorithm.**

Figure 4c further investigates the distribution of solution distances at six selected k_BT values, showing improved performance and reaching the optimal solution when k_BT is between 40 and 60. This analysis underscores the robustness of the probabilistic greedy algorithm in finding high-quality solutions when driven by suitably tuned randomness. Figure 4d examines the relationship between the best path distance and the number of iterations (where one iteration is defined as a single complete solution route, visiting each city exactly once) for four selected k_BT values. When k_BT = 60, the optimal solution is achieved within 1000 iterations, demonstrating the efficiency of the algorithm in converging to high-quality solutions. Figure 4e presents a scatter plot of the best solutions obtained across the k_BT range and density distribution plots of solutions within 0, 50 and 100 kilometers of the known optimal solution, further validating the algorithm’s effectiveness. For visibility, the density of the 0 kilometer is scaled by a factor of 50. The appearance of a clear peak shape in the distribution plots indicates the existence of an optimal k_BT, highlighting the algorithm’s sensitivity to temperature parameters in achieving high-quality solutions.

The solver still works well when the city number is increased significantly. Figure 5 illustrates the simulated results obtained with the st70 problem, offering a comprehensive comparison of our algorithm against other established methods. In the map shown for the st70 problem, the optimal solution is indicated by a solid line, serving as a benchmark for evaluating the performance of different algorithms. The st70 problem, with its 70 cities, presents a considerable computational challenge, making it an ideal test case for demonstrating the efficacy of both heuristic and exact algorithms. The comparison of time and space complexity among the algorithms—Brute Force, Dynamic Programming, Genetic Algorithm, Simulated Annealing (SA), Greedy Algorithm, and Probabilistic Greedy Algorithm—clearly highlights the benefits of heuristic methods. While exhaustive approaches like Brute Force and Dynamic Programming struggle with scalability as n increases, heuristic algorithms, particularly the Probabilistic Greedy and Genetic algorithms, strike a balance between computational efficiency and solution quality. This contrast is evident from the results where n = 70 is used, showcasing the advantage of these more advanced approaches when tackling larger problems. As the number of iterations grows, the quality of the solutions improves, particularly when varying the thermal fluctuation parameter k_BT. Across different values (k_BT = 0.1, 1.3, 2.0, and 3.0), the results indicate that the algorithm’s performance is highly sensitive to this parameter, with intermediate values (e.g., 1.3) leading to a more optimal convergence rate. The gradual improvement in path quality with increasing iterations underscores the algorithm’s ability to refine its solution over time. When comparing different heuristic approaches—Genetic Algorithm, Simulated Annealing (SA), Greedy Algorithm, and Probabilistic Greedy Algorithm—the results reveal that incorporating stochastic elements, as seen in the Probabilistic Greedy Algorithm, significantly enhances performance. It is worth noting that while the SA can only evaluate the situation of exchanging two cities at a sample, the Probabilistic Greedy Algorithm can take all the remaining cities into account for a single sampling owing to the arbitrary PDF configurability of our MTJ-TRNGs. Thus, the latter can deal with a higher entanglement degree, which accounts for its faster convergence speed. By avoiding local optima, the probabilistic variant consistently outperforms the classic Greedy Algorithm, especially in later iterations, demonstrating its potential for yielding superior solutions. Moreover, our MTJ-based probabilistic framework can also benefit advanced metaheuristics such as Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO). By integrating hardware-level randomness, these metaheuristics can systematically enhance their exploration strategies, potentially leading to improved solution quality and faster convergence due to more effective escape from local optima.

**Fig. 5: Simulated results with more cities and comparison with other algorithms.**

We note that the parameter k_BT, which governs the shape of the exponential probability distribution, is currently selected empirically for each problem instance. While effective in practice, developing a more systematic or adaptive strategy for temperature tuning—analogous to annealing schedules or meta-optimization—remains an important avenue for future work, particularly to enhance scalability and generalizability.

Finally, the schematic in Fig. 5e illustrates the core design of a TSP solver based on an MTJ array. This hardware-based solver taps into the inherent randomness of the MTJ array, which can be efficiently reused to generate random numbers of any required length. This modularity and scalability make the MTJ array particularly well-suited for probabilistic algorithms like Simulated Annealing and Probabilistic Greedy Algorithm, enhancing the solver’s adaptability for larger and more complex TSP instances. The ability to expand the random number generation capability of the MTJ array according to an arbitrarily customized PDF without sacrificing performance is a crucial innovation, positioning this design as a versatile and efficient solution for hardware-accelerated optimization tasks.

To provide a transparent system-level comparison, we summarize key performance metrics of our prototype platform versus projected FPGA/ASIC implementations in Table 1. The prototype, implemented with 100 nm STT-MTJs controlled by NI-PXIe instrumentation, operates at ~0.5 MHz and requires ~20 ms per 14-city TSP solution, with energy dominated by instrumentation overhead. In contrast, projections based on published MTJ switching speeds ( ≤ 10 ns)⁴⁵ and intrinsic switching energies (fJ–pJ per event)³⁵ indicate that a dedicated FPGA or ASIC could achieve sub-millisecond or even sub-0.1 ms solution times with microjoule-level energy consumption. This analysis highlights the large performance headroom available for integrated spintronic probabilistic solvers.

Table 1 Benchmark comparison between prototype and projected MTJ-based probabilistic TSP solvers

Full size table

Furthermore, we note that the step-by-step probabilistic decision-making process in our algorithm bears strong resemblance to the autoregressive sampling mechanism employed in large language models (LLMs), where each token is sampled based on a dynamically updated softmax distribution. This conceptual alignment suggests a future direction for integrating MTJ-based probabilistic hardware with AI inference and generation tasks that involve structured randomness.

This paper presents a distinct probabilistic greedy algorithm that utilizes the stochastic properties of MTJ-based TRNGs to solve complex combinatorial optimization problems. By integrating MTJ-based TRNGs with the PDF reconfigurability into the optimization framework, we can dynamically adjust the degree of randomness in the decision-making process, allowing the algorithm to strike an optimal balance between exploration and exploitation. This capability is achieved through the control of a temperature parameter, which modulates the randomness level and enables the algorithm to adapt its strategy based on the problem state.

The effectiveness of the proposed approach is demonstrated through extensive experimentation on the traveling salesman problem. Our results show that the probabilistic greedy algorithm consistently achieves superior performance compared to classical methods such as simulated annealing and genetic algorithms, both in terms of solution quality and convergence speed. When applied to larger problem instances in simulation, the algorithm exhibits excellent scalability and robustness, maintaining a competitive edge even as the number of cities increases to 70. For significantly larger-scale problems involving hundreds or thousands of nodes, practical implementations may require adaptive parameter tuning strategies, parallelization techniques, or decomposition of the problem into manageable subproblems. Nonetheless, there are no fundamental limitations preventing the scalability of our proposed method. The key advantage of this approach lies in its ability to dynamically modulate randomness through the MTJ-based TRNG, which enhances the algorithm’s capacity to escape local optima and discover near-optimal solutions efficiently.

In the current implementation, the temperature-like parameter k_BT, which governs the exploration-exploitation trade-off, is determined empirically for each problem instance. While this heuristic approach is effective in practice, developing a systematic or adaptive tuning strategy remains an important direction for future work.

Additionally, although we demonstrate hardware results on medium-scale TSP instances (e.g., Burma14), the results for larger problems (e.g., st70) are obtained through algorithm-level simulations. These simulations validate the algorithmic scalability of our approach, while the underlying hardware design—requiring only log₂N MTJs for encoding an N-choice distribution—offers intrinsic architectural advantages for future large-scale implementations. It is worth noting that, besides the logarithmic scaling of MTJ count (O(log₂N)), our probabilistic greedy algorithm requires only O(N) auxiliary memory for conditional probability parameters, which remains significantly more favorable than the O(N²) parameter storage required in Boltzmann or Ising machines.

The integration of MTJ-based TRNGs offers a promising direction for developing hardware-accelerated optimization frameworks, with potential applications extending beyond TSP to other NP-hard problems. This framework can be readily generalized to other combinatorial optimization problems, such as graph coloring or scheduling tasks, by simply redefining the specific cost function and adjusting the temperature parameter accordingly. A concrete adaptation of the probabilistic greedy framework to the graph coloring problem is presented in Supplement VI. Future work will explore the integration of these TRNGs into parallel and distributed computing architectures, as well as their combination with advanced machine learning models to further expand the capabilities of probabilistic optimization methods. This research establishes a solid foundation for leveraging hardware-level stochasticity in computational algorithms, offering additional possibilities for tackling complex optimization challenges with greater efficiency and effectiveness⁴.

While the current study emphasizes the feasibility and statistical behavior of MTJ-assisted probabilistic solvers, we acknowledge that absolute runtime, energy, and area efficiency have not been characterized in this work. Our experimental platform involves instrument-level control and is not representative of a fully integrated solution. Future efforts will focus on ASIC- or FPGA-based implementations to enable rigorous evaluation of system-level performance metrics, leveraging the intrinsic speed and low-power characteristics of spintronic devices. Beyond traditional combinatorial problems, our framework also holds potential for accelerating probabilistic AI models, such as autoregressive generators, by serving as a hardware-compatible platform for structured random sampling.

Methods

The stack structure of the employed STT-MTJ devices^46,47,48, as depicted in Fig. 1a, is from top to bottom capping/CoFeB/Mo/CoFeB/MgO/CoFeB/Mo/[Co/Pt]_n-based synthetic anti-ferromagnetic structure/Seed/SiO₂. The multilayer films were deposited by magnetron sputtering on a thermally oxidized silicon substrate under a vacuum environment of 10⁻⁶Pa. Following deposition, the films were annealed at high temperature in an external magnetic field perpendicular to the film plane. The devices were then patterned into cylindrical STT-MTJs using standard lithography and etching processes. Magneto-transport measurements of the fabricated devices were conducted using an Hprobe H3DM tester. The samples were subsequently connected to a Keysight B1500A semiconductor analyzer and a NI PXIe system through a probe card and adapter board, enabling comprehensive experimental control and data acquisition through a Python-based interface. This setup facilitated precise electrical measurements and switching probability characterization of the STT-MTJs, providing a reliable platform for evaluating their performance as TRNGs. To ensure stable operation and mitigate the impact of device-to-device variations and long-term drift, we previously developed self-stabilizing techniques and pulse-width modulation strategies for MTJ-based TRNGs. These methods allow each MTJ to autonomously correct its switching probabilities, ensuring consistent random number generation across large-scale device arrays without frequent manual calibration^42,49. Specifically, for the TSP solver implementation reported herein (e.g., Burma14 problem), four MTJs were utilized to generate the required configurable random distributions. All MTJ devices used exhibited consistent sigmoid-shaped switching probability curves with stable and reproducible behavior, as characterized and verified prior to algorithmic integration.

Data availability

All data needed to evaluate the conclusions in the paper are present in the paper and available at https://doi.org/10.6084/m9.figshare.28071089.

Code availability

The code used in this work is available at: https://doi.org/10.5281/zenodo.17503789

References

Ghahramani, Z. Probabilistic machine learning and artificial intelligence. Nature 521, 452–459 (2015).
Article ADS PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS PubMed Google Scholar
Kacem, I., Kellerer, H. & Mahjoub, A. R. Preface: New trends on combinatorial optimization for network and logistical applications. Ann. Oper. Res. 298, 1–5 (2021).
Article MathSciNet Google Scholar
Sbihi, A. & Eglese, R. W. Combinatorial optimization and Green Logistics. Ann. Oper. Res. 175, 159–175 (2010).
Article MathSciNet Google Scholar
Yanling, W., Deli, Y. & Guoqing, Y. Logistics supply chain management based on multi-constrained combinatorial optimization and extended simulated annealing. In 2010 International Conference on Logistics Systems and Intelligent Management (ICLSIM) 188–192 (IEEE, Harbin, China, 2010). https://doi.org/10.1109/ICLSIM.2010.5461441.
Abaku, E. A., Edunjobi, T. E. & Odimarha, A. C. Theoretical approaches to AI in supply chain optimization: Pathways to efficiency and resilience. Int. J. Sci. Technol. Res. Arch. 6, 092–107 (2024).
Article Google Scholar
Oliveto, P. S., He, J. & Yao, X. Time complexity of evolutionary algorithms for combinatorial optimization: A decade of results. Int. J. Autom. Comput. 4, 281–293 (2007).
Article Google Scholar
Bellman, R. Dynamic Programming. Science 153, 34–37 (1966).
Article ADS PubMed Google Scholar
Lawler, E. L. & Wood, D. E. Branch-and-Bound Methods: A Survey. Oper. Res. 14, 699–719 (1966).
Article MathSciNet Google Scholar
Benson, S. J., McInnes, L. C. & Moré, J. J. A case study in the performance and scalability of optimization algorithms. ACM Trans. Math. Softw. 27, 361–376 (2001).
Scardapane, S. & Wang, D. Randomness in neural networks: an overview. WIREs Data Min. Knowl. Discov. 7, e1200 (2017).
Article Google Scholar
Rahnamayan, S., Tizhoosh, H. R. & Salama, M. M. A. Opposition versus randomness in soft computing techniques. Appl. Soft Comput. 8, 906–918 (2007).
Article Google Scholar
Singh, N. S. et al. CMOS plus stochastic nanomagnets enabling heterogeneous computers for probabilistic inference and learning. Nat. Commun. 15, 2685 (2024).
Article ADS PubMed PubMed Central Google Scholar
Borders, W. A. et al. Integer factorization using stochastic magnetic tunnel junctions. Nature 573, 390–393 (2019).
Article ADS PubMed Google Scholar
Camsari, K. Y. et al. From Charge to Spin and Spin to Charge: Stochastic Magnets for Probabilistic Switching. Proc. IEEE 108, 1322–1337 (2020).
Article ADS Google Scholar
Gibeault, S. et al. Programmable electrical coupling between stochastic magnetic tunnel junctions. Phys. Rev. Appl. 21, 034064 (2024).
Article ADS Google Scholar
Elyasi, M., Kanai, S., Ohno, H., Fukami, S. & Bauer, G. E. W. Effect of nonlinear magnon interactions on stochastic magnetization switching. Phys. Rev. B 110, 094433 (2024).
Article ADS Google Scholar
Wang, Y. et al. Superior probabilistic computing using operationally stable probabilistic-bit constructed by manganite nanowire. Natl. Sci. Rev. nwae338 https://doi.org/10.1093/nsr/nwae338 (2024).
Bao, Y., Yang, S., Yao, Z. & Yang, H. Computing with magnetic tunnel junction based sigmoidal activation functions. Appl. Phys. Lett. 124, 242403 (2024).
Article ADS Google Scholar
Luo, Y. et al. Magnetic field-free stochastic computing based on the voltage-controlled magnetic tunnel junction. Appl. Phys. Lett. 124, 212403 (2024).
Article ADS Google Scholar
Chlumecký, M., Buchtele, J. & Richta, K. Application of random number generators in genetic algorithms to improve rainfall-runoff modelling. J. Hydrol. 553, 350–355 (2017).
Article ADS Google Scholar
Geng, X., Chen, Z., Yang, W., Shi, D. & Zhao, K. Solving the traveling salesman problem based on an adaptive simulated annealing algorithm with greedy search. Appl. Soft Comput. 11, 3680–3689 (2011).
Article Google Scholar
Li, X. H. et al. True random number generator based on spin–orbit torque magnetic tunnel junctions. Appl. Phys. Lett. 123, 142403 (2023).
Article ADS Google Scholar
Li, X. H. et al. Stochastic p-Bits Based on Spin-Orbit Torque Magnetic Tunnel Junctions. Preprint at https://doi.org/10.48550/arXiv.2306.02780 (2023).
Zhao, M. K. et al. Type-Y magnetic tunnel junctions with CoFeB doped tungsten as spin current source. Appl. Phys. Lett. 120, 182405 (2022).
Article ADS Google Scholar
He, B. et al. All-Electrical 9-Bit Skyrmion-Based Racetrack Memory Designed with Laser Irradiation. Nano Lett. 23, 9482–9490 (2023).
Article ADS PubMed Google Scholar
Jung, S. et al. A crossbar array of magnetoresistive memory devices for in-memory computing. Nature 601, 211–216 (2022).
Article ADS PubMed Google Scholar
Hong, J. et al. Demonstration of spin transfer torque (STT) magnetic recording. Appl. Phys. Lett. 114, 243101 (2019).
Article ADS Google Scholar
Fukushima, A. et al. Spin dice: A scalable truly random number generator based on spintronics. Appl. Phys. Express 7, 083001 (2014).
Article ADS Google Scholar
Choi, W.H. et al. A Magnetic Tunnel Junction based True Random Number Generator with conditional perturb and real-time output probability tracking. In 2014 IEEE International Electron Devices Meeting 12.5.1-12.5.4 (IEEE, San Francisco, CA, USA, 2014). https://doi.org/10.1109/IEDM.2014.7047039.
Vodenicarevic, D. et al. Low-Energy Truly Random Number Generation with Superparamagnetic Tunnel Junctions for Unconventional Computing. Phys. Rev. Appl. 8, 054045 (2017).
Article ADS Google Scholar
Chen, H. et al. Binary and Ternary True Random Number Generators Based on Spin Orbit Torque. In 2018 IEEE International Electron Devices Meeting (IEDM) 36.5.1-36.5.4 (IEEE, San Francisco, CA, 2018). https://doi.org/10.1109/IEDM.2018.8614638.
Song, M., Duan, W., Zhang, S., Chen, Z. & You, L. Power and area efficient stochastic artificial neural networks using spin–orbit torque-based true random number generator. Appl. Phys. Lett. 118, 052401 (2021).
Article ADS Google Scholar
Li, X. et al. Restricted Boltzmann Machines Implemented by Spin–Orbit Torque Magnetic Tunnel Junctions. Nano Lett. 24, 5420–5428 (2024).
Article ADS PubMed Google Scholar
Zhang, R. et al. Probability-Distribution-Configurable True Random Number Generators Based on Spin-Orbit Torque Magnetic Tunnel Junctions. Adv. Sci. 11, 2402182 (2024).
Article Google Scholar
Jia, X. et al. SPINBIS: Spintronics-Based Bayesian Inference System With Stochastic Computing. IEEE Trans. Comput. -Aided Des. Integr. Circuits Syst. 39, 789–802 (2019).
Article ADS Google Scholar
Shim, Y., Chen, S., Sengupta, A. & Roy, K. Stochastic Spin-Orbit Torque Devices as Elements for Bayesian Inference. Sci. Rep. 7, 14101 (2017).
Article ADS PubMed PubMed Central Google Scholar
Tao, Q. & Han, J. Solving traveling salesman problems via a parallel fully connected ising machine. In Proc. 59th ACM/IEEE Design Automation Conference 1123–1128 (ACM, San Francisco California, 2022). https://doi.org/10.1145/3489517.3530595.
Zhang, T., Tao, Q., Liu, B. & Han, J. Ising Machines Using Parallel Spin Updating Algorithms for Solving Traveling Salesman Problems. In Design and Applications of Emerging Computer Systems (eds Liu, W., Han, J. & Lombardi, F.) 687–707 (Springer Nature Switzerland, Cham, 2023). https://doi.org/10.1007/978-3-031-42478-6_26.
Zhang, T. & Han, J. Efficient Traveling Salesman Problem Solvers using the Ising Model with Simulated Bifurcation. In 2022 Design, Automation & Test in Europe Conference & Exhibition (DATE) 548–551 (IEEE, Antwerp, Belgium, 2022). https://doi.org/10.23919/DATE54114.2022.9774576.
Si, J. et al. Energy-efficient superparamagnetic Ising machine and its application to traveling salesman problems. Nat. Commun. 15, 3457 (2024).
Article ADS PubMed PubMed Central Google Scholar
Zhang, R. et al. Drift-resilient magnetic-tunnel-junction random-number generator via hybrid control strategies. Phys. Rev. Appl. 23, 054073 (2025).
Article ADS Google Scholar
Valli, A. S. E., Tsao, M., Smith, J. D., Misra, S. & Kent, A. D. High-speed tunable generation of random number distributions using actuated perpendicular magnetic tunnel junctions. Appl. Phys. Lett. 126, 212403 (2025).
Wang, J., Xiao, C., Wang, S. & Ruan, Y. Reinforcement learning for the traveling salesman problem: Performance comparison of three algorithms. J. Eng. 2023, e12303 (2023).
Article Google Scholar
Hayakawa, K. et al. Nanosecond Random Telegraph Noise in In-Plane Magnetic Tunnel Junctions. Phys. Rev. Lett. 126, 117202 (2021).
Article ADS PubMed Google Scholar
Myers, E. B., Ralph, D. C., Katine, J. A., Louie, R. N. & Buhrman, R. A. Current-Induced Switching of Domains in Magnetic Multilayer Devices. Science 285, 867–870 (1999).
Article PubMed Google Scholar
Fuchs, G. D. et al. Spin-transfer effects in nanoscale magnetic tunnel junctions. Appl. Phys. Lett. 85, 1205–1207 (2004).
Article ADS Google Scholar
Ralph, D. C. & Stiles, M. D. Spin transfer torques. J. Magn. Magn. Mater. 320, 1190–1216 (2007).
Article ADS Google Scholar
Xu, Y. Q. et al. Self-stabilized true random number generator based on spin–orbit torque magnetic tunnel junctions without calibration. Appl. Phys. Lett. 125, 132403 (2024).
Article ADS Google Scholar

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (MOST) (Grant No. 2022YFA1402800), the National Natural Science Foundation of China (NSFC) (Grant Nos. 12134017, 51831012, 51620105004, and 12374131), the Strategic Priority Research Program (B) of Chinese Academy of Sciences (CAS) (Grant No. XDB33000000) and the CAS President’s International Fellowship Initiative (PIFI) (Grant No. 2025PG0006), awarded to X.H.; C.W. appreciates financial support from the Youth Innovation Promotion Association, CAS (Grant No. 2020008).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, Chinese Academy of Sciences, Beijing, China
Ran Zhang, Xiaohan Li, Caihua Wan, Yingqian Xu, Shiqiang Liu, Dehao Kong, Shilong Xiong, Guoqiang Yu & Xiufeng Han
Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, China
Caihua Wan, Guoqiang Yu & Xiufeng Han
Songshan Lake Materials Laboratory, Dongguan, Guangdong, China
Caihua Wan, Guoqiang Yu & Xiufeng Han
Fraunhofer IPMS, Center Nanoelectronic Technologies, Dresden, Germany
Raik Hoffmann, Meike Hindenberg, Alptekin Vardar & Thomas Kämpfe
Zhejiang Hikstor Technology Co. Ltd, Hangzhou, China
Shikun He, Qiang Dai, Junlu Gong, Yihui Sun & Zejie Zheng
TU Braunschweig, Institute for CMOS Design, Braunschweig, Germany
Thomas Kämpfe

Authors

Ran Zhang
View author publications
Search author on:PubMed Google Scholar
Xiaohan Li
View author publications
Search author on:PubMed Google Scholar
Caihua Wan
View author publications
Search author on:PubMed Google Scholar
Raik Hoffmann
View author publications
Search author on:PubMed Google Scholar
Meike Hindenberg
View author publications
Search author on:PubMed Google Scholar
Yingqian Xu
View author publications
Search author on:PubMed Google Scholar
Shiqiang Liu
View author publications
Search author on:PubMed Google Scholar
Dehao Kong
View author publications
Search author on:PubMed Google Scholar
Shilong Xiong
View author publications
Search author on:PubMed Google Scholar
Shikun He
View author publications
Search author on:PubMed Google Scholar
Alptekin Vardar
View author publications
Search author on:PubMed Google Scholar
Qiang Dai
View author publications
Search author on:PubMed Google Scholar
Junlu Gong
View author publications
Search author on:PubMed Google Scholar
Yihui Sun
View author publications
Search author on:PubMed Google Scholar
Zejie Zheng
View author publications
Search author on:PubMed Google Scholar
Thomas Kämpfe
View author publications
Search author on:PubMed Google Scholar
Guoqiang Yu
View author publications
Search author on:PubMed Google Scholar
Xiufeng Han
View author publications
Search author on:PubMed Google Scholar

Contributions

C.W., T.K. and X.H. conceived the research direction, coordinated the collaboration, and supervised all stages of the project. R.Z. carried out the majority of the experiments, including device measurements, data acquisition, and quantitative analysis. R.Z., X.L., C.W., Y.X., S.L., D.K., S.X., G.Y. and A.V. jointly contributed to the development of the experimental concepts and methodology, continuously refined the ideas through discussions, and participated in interpreting the results. R.H. and M.H. supported the characterization workflow and provided technical assistance for the measurement infrastructure. S.H., Q.D., J.G., Y.S. and Z.Z. were responsible for device and sample preparation, including thin-film stack growth, microfabrication, and device processing. All authors contributed to manuscript preparation, provided critical feedback at different stages of writing, and approved the final version of the manuscript.

Corresponding authors

Correspondence to Caihua Wan, Thomas Kämpfe or Xiufeng Han.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, R., Li, X., Wan, C. et al. Probabilistic greedy algorithm solver using magnetic tunneling junctions for traveling salesman problem. Nat Commun 17, 189 (2026). https://doi.org/10.1038/s41467-025-66864-9

Download citation

Received: 23 December 2024
Accepted: 17 November 2025
Published: 04 December 2025
Version of record: 07 January 2026
DOI: https://doi.org/10.1038/s41467-025-66864-9