Supervised learning in DNA neural networks

Cherry, Kevin M.; Qian, Lulu

doi:10.1038/s41586-025-09479-w

Download PDF

Article
Open access
Published: 03 September 2025

Supervised learning in DNA neural networks

Nature volume 645, pages 639–647 (2025)Cite this article

49k Accesses
2 Citations
213 Altmetric
Metrics details

Subjects

Abstract

Learning enables biological organisms to begin life simple yet develop immensely diverse and complex behaviours. Understanding learning principles in engineered molecular systems could enable us to endow non-living physical systems with similar capabilities. Inspired by how the brain processes information, the principles of neural computation have been developed over the past 80 years¹, forming the foundation of modern machine learning. More than four decades ago, connections between neural computation and physical systems were established². More recently, synthetic molecular systems, including nucleic acid and protein circuits, have been investigated for their abilities to implement neural computation^3,4,5,6,7. However, in these systems, learning of molecular parameters such as concentrations and reaction rates was performed in silico to generate desired input–output functions. Here we show that DNA molecules can be programmed to autonomously carry out supervised learning in vitro, with the system learning to perform pattern classification from molecular examples of inputs and desired responses. We demonstrate a DNA neural network trained to classify three different sets of 100-bit patterns, integrating training data directly into memories of molecular concentrations and using these memories to process subsequent test data. Our work suggests that molecular circuits can learn tasks more complex than simple adaptive behaviours. This opens the door to molecular machines capable of embedded learning and decision-making in a wide range of physical systems, from biomedicine to soft materials.

Nonlinear decision-making with enzymatic neural networks

Article 19 October 2022

Molecular convolutional neural networks with DNA regulatory circuits

Article 04 July 2022

Heat-rechargeable computation in DNA logic circuits and neural networks

Article Open access 01 October 2025

Main

Learning is a fundamental process driving adaptability and survival across biological scales. At the organismal level, neural rewiring and synaptic plasticity enable the brain to learn and form memories, essential for behavioural adaptation and decision-making⁸. At the cellular level and in a simpler form, the immune system learns from pathogen encounters, enhancing future responses⁹. At the molecular level and in the simplest form, biochemical circuits in bacteria learn about their environment, forming short-term memories to optimize survival¹⁰.

Inspired by these natural processes, researchers have explored the deep connections between biological learning algorithms and engineered cellular and molecular networks in numerous theoretical and experimental studies^11,12,13. These studies span a diverse range of systems, including abstract chemical reaction networks^14,15,16, genetic regulatory networks^17,18,19, protein circuits^7,20 and metabolic circuits²¹. Notably, DNA-based systems such as cell-free transcriptional and translational circuits^22,23, polymerase-exonuclease-nickase DNA circuits^6,24, DNA strand-displacement circuits^3,4,5,24,25 and DNA tile self-assembly²⁶ have been developed to perform molecular pattern recognition and classification.

Despite three decades of engineering research, experimental demonstrations have been limited to simple adaptive behaviours in systems with no more than a dozen signals^27,28,29. The goal of this work is to implement a molecular system that can autonomously learn to perform complex information-processing tasks. To distinguish learning from simple adaptation, consider a future artificial cell that learns from a more advanced entity, such as a biological cell. It observes the teacher’s behaviour, remembers how the teacher responds to a stimuli, learns from multiple examples over time, and generalizes this knowledge to respond independently to similar stimuli. Developing such a system requires several features. First, the artificial cell must interpret molecular inputs on its own, without relying on an instructor to translate between formats—so training and test data must share the same molecular ‘language’ (independence). Second, it must remember new examples without overwriting past memories, integrating all training data presented sequentially (integration). Third, it must generalize from examples, processing test data that differ from training data using a general-purpose architecture for complex and noisy information (generality). Fourth, it must retain long-term memories, making appropriate decisions even days or months after training data are removed (stability). Fifth, its success should be judged by its ability to correctly classify test data given sufficient training examples (accuracy). Sixth, its performance should be enhanced by its ability to process sequential test data and update its decisions over time (reusability). Finally, its power depends on the type of information that it can store and process (flexibility).

We demonstrate a DNA-based molecular system with learning capabilities that satisfy four key features—independence, integration, generality and stability. Stability arises naturally from the DNA implementation, and generality builds on previous work⁴, whereas independence and integration require substantial engineering, as we will discuss in this paper. The remaining features—accuracy, reusability and flexibility—are not fully met and represent goals for future work (Extended Data Fig. 1).

Concept and design

In silico learning produces a DNA neural network with fixed memories—acting like a hardwired processor. In contrast, in vitro learning enables the liquid system itself to form memories in response to example molecular inputs during a training phase—acting like an adaptive memory device. When composed together with a processor, this integrated system provides learned input–output functions applied to a subsequent testing phase (Extended Data Fig. 2). In previous work, we introduced a simple in silico learning algorithm using averaged training patterns as weights for winner-take-all computation⁴. Here we implement a variant for in vitro learning by molecules within a DNA neural network (Fig. 1a). Initially, all weights are zero, representing a blank memory. During training, input patterns and class labels collectively activate specific weights, adding the input to the corresponding memory. After all examples are presented, the resulting weights—encoded in the concentrations of activator molecules—passively store aggregated training data. Learning transitions to testing by connecting the memory device to the processor, transferring information from activators to weight molecules. These now-active weights interact with test inputs, compute weighted sums for each output and trigger a winner-take-all competition. The output with the largest sum is amplified, producing binary signals that classify the input based on its similarity to the learned memories (Fig. 1b).

The training process resembles ‘learning by memorizing’, conceptually akin to Hebbian learning³⁰ and k-means clustering³¹ (‘Learning algorithm’ in Methods). Unlike modern machine learning, which relies on loss functions and optimization, our approach is simpler yet effective for molecular systems. Although it does not extend naturally to deeper neural networks, it provides an initial step for exploring more advanced architectures.

To implement the learning and testing algorithm with molecules, we first translate the abstract mathematical function into a set of chemical reactions, where each variable is represented by the concentration of a chemical species (Fig. 1c). These reactions are then realized using DNA strand displacement³². Although arbitrary chemical reaction networks can be implemented in principle³³, experimental demonstrations have been limited to small systems with a few reactions^34,35. Simpler schemes such as the seesaw motif³⁶ have enabled larger systems with hundreds of reactions^3,4,37,38. Here, inspired by activatable species used in adaptive DNA circuits^25,27,39, we extend the seesaw architecture with two new activatable gate motifs (Extended Data Fig. 3) to implement the learning chemical neural network (Fig. 1d and ‘Implementation of learning and testing’ in Methods).

Motif characterization

The first motif is an activatable amplification gate (Extended Data Fig. 3). Using this motif, we developed a weight gate that catalytically produces a weighted input signal P_i,j, consisting of a toehold ‘T’ flanked by two long domains Xi and Pj (Fig. 2a). To activate a specific weight W_i,j, the activator Act_i,j must carry both input bit (i) and memory class (j) information. Although this could be encoded using two consecutive toeholds via the allosteric toehold mechanism⁴⁰, that design has drawbacks (‘Alternative activatable weight design’ in Methods). Instead, we encode class information in toehold Tj and bit information in branch migration domain Ai (Fig. 2a). An activator with matching Tj and Ai binds to the inhibited weight \({W}_{i,j}^{* }\), exposing a universal toehold U* for input binding. To embed a hidden thermodynamic drive⁴¹, we introduce a bulge loop B between U and Xi in the weight gate’s top strand, with the activator carrying B*. Bulge elimination in branch migration lowers the reverse rate and prolongs U* exposure, enhancing input binding.

**Fig. 2: Characterization of key motifs.**

We characterized the weight gate using fluorescence kinetics experiments (Supplementary Note 1). With fixed input and varying activator concentrations, the output reached the activator level within 2 h, demonstrating accurate 1-bit information transfer from passive to active memory (Fig. 2b and Extended Data Fig. 4d). With fixed activator and varying input concentrations, the output exceeded the input by over 4-fold in 20 h, saturating at the activator level—showing robust weight multiplication and signal amplification (Fig. 2c and Extended Data Fig. 4d).

We assessed weight-activation specificity using crosstalk experiments involving 18 inhibited weights and activators (Fig. 2d and Extended Data Fig. 4f). All matching pairs (diagonal) yielded ≥88% of the target signal, whereas mismatched pairs (306 off-diagonal cases) produced ≤20%, with 287 cases below 10%—indicating excellent specificity. These experiments were performed without competition—each tube contained one weight and one activator. In the actual memory transfer, all weights and activators are present together, and mismatches must compete with matches. We expect real-scenario crosstalk to be at least an order of magnitude better.

The second motif is an activatable transformation gate (Extended Data Fig. 3). Using this motif, we developed a learning gate that stoichiometrically produces an activator signal (Fig. 2e). A key requirement is irreversible consumption of input and label strands to prevent errors in future learning events. A simple allosteric toehold design is reversible, and although a drain molecule could enforce irreversibility, it introduces complications (‘Alternative learning gate design’ in Methods). Instead, we embed irreversibility into the learning gate itself (Fig. 2e). A Tj bulge loop in the Xi domain slows reverse branch migration, and upon top-strand release, the Tj domain forms a hairpin with Tj*, further inhibiting the reverse reaction. We experimentally confirmed irreversibility (Supplementary Fig. 20 and Supplementary Note 5.2) and optimized Xi length to balance strand quality and intermediate-state stability. This minimizes spurious activation during interaction with weight gates (Supplementary Fig. 21 and Supplementary Note 5.3).

Although learning is irreversible when both label and input are present, the label alone must interact reversibly with the learning gate. Otherwise, all gates in a memory would be locked on by one training pattern and respond incorrectly to patterns in the other class. This reversibility is built into a transient intermediate activator (Fig. 2e), which is less stable than the activated weight (Fig. 2a). Consequently, the produced activator followed the expected linear dependence on label and input concentrations, but slightly below ideal values (Fig. 2g,h). To improve kinetics and reaction completion (Extended Data Fig. 5d), we used excess label and added an inhibitor to remove leftover label between training events (Fig. 2f).

We evaluated the learning motif’s specificity using crosstalk experiments with eight learning gates and input–label pairs (Fig. 2i and Extended Data Fig. 5e). Unlike the weight motif, which uses a standard fluorescence reporter, these experiments required distinct fluorophore–quencher strands for each reaction (Extended Data Fig. 5a), limiting the experiment scale owing to cost. All matching pairs produced ≥94% of the target signal, whereas all 56 mismatched cases yielded ≤10%, demonstrating high specificity. As with weight crosstalk tests, these were done without competition; actual learning scenarios are expected to show even lower crosstalk.

Activatable memories

The activatable weight motif enables a DNA neural network to receive memories encoded in activators for performing different tasks. In this section, the activators represent weights from in silico training, allowing us to separately evaluate the function of the programmable processor before integrating it with the memory device. We constructed a network with two 100-bit activatable memories and provided 3 distinct sets of activator strands, each encoding 2 classes of handwritten digits from the Modified National Institute of Standards and Technology (MNIST) database⁴² (Fig. 3). In previous work, we averaged 100 training patterns per class and selected the top-20 bits to form the weight matrix⁴. Here, lacking a molecular mechanism for bit selection, classification accuracy declines (Supplementary Fig. 31a). To compensate, we developed a ‘good teacher’ strategy: rank and filter above-average training patterns, then randomly select ten per class (Supplementary Note 5.12)—used here for setting activator concentrations and for training in the next section.

Test patterns were selected based on their positions in the weighted sum space, reflecting classification difficulty. For each class, 12 representative tests were chosen via k-means clustering (k = 12), selecting 1 example per cluster (Fig. 3a–c). To ensure experimental feasibility, we excluded examples within 20% of the diagonal, retaining 53% of zeros and 83% of ones from the dataset. Outside this margin, the network is expected to classify all test patterns correctly with clear binary outputs. Within the margin, classifications are still possible but less binary. Similarly, the expected lower bounds of classification accuracy—determined by the fraction of test patterns outside the 20% margin—for the full MNIST dataset were 56% for threes, 46% for fours, 56% for sixes and 71% for sevens (Supplementary Fig. 31d). Fluorescence kinetics experiments confirmed correct outputs for all test patterns, showing clear on–off separation (Fig. 3d–i). Separation quality correlated with distance from the diagonal: greater distance yielded larger separations. To assess DNA sequence dependence, we encoded digit pairs with swapped memory molecules. Experimental results showed minor differences, consistent with simulations using reaction rates from the motif characterization (Extended Data Fig. 6).

The ability to receive memories enables the DNA neural network to function as a field-programmable device, performing different classification tasks based on the activators provided. Although it is still following instructions rather than learning, this demonstration is powerful—it can execute any combination of commands from a library of hundreds to classify complex and noisy inputs.

Learned weights

The learning motif enables a DNA neural network to develop memories encoded in activators for later computation. In this section, the activators represent weights learned from in vitro training, allowing us to separately evaluate the function of the adaptive memory device before integrating it with the processor. We constructed a network of learning gates that received 100-bit training data from 2 classes of handwritten digits and monitored the produced activators. A robust learning system should handle arbitrary training patterns in any order. To test this, we used a ‘batch training’ procedure, simultaneously presenting all patterns from the same class—analogous to batch training in machine learning. In molecular terms, mixing pre-prepared patterns yields a training mixture where each input strand’s quantity reflects the combined signal from all patterns. Although this method lacks intra-class order variation, it still tests robustness by presenting the two classes in different orders (Fig. 4a). Training with individual patterns would slow learning owing to lower input concentrations and require label inhibition between patterns of different classes, complicating the experiments.

We performed fluorescence kinetics experiments to read out learned weights after training with digits 0 and 1 (Fig. 4b). Each plot in the data array shows signal increase for one activated weight in the two 100-bit memories. Visually, the digit shapes emerged, confirming successful learning. Quantitatively, pixels with 0 values in all 10 training patterns showed low signals—0.5 ± 0.3% of the total signal for 1 and 1.1 ± 0.4% for 0—whereas other pixels showed up to 13-fold signal increases over background (see endpoint values in Extended Data Fig. 7).

To investigate background noise, we captured snapshots of the training process at the start, midpoint, and end (Fig. 4c). Before training, background signals in memories 1 and 2 were 0.4 ± 0.4% and 0.7 ± 0.3% of the total signal, respectively (Fig. 4c, left column). Memory 2’s higher background persisted post-training (Fig. 4c, middle and right columns), probably owing to sequence variations and synthesis errors causing spurious weight activation. Still, this noise was minimal compared with earlier versions of the learning and activatable weight motifs (Supplementary Note 4.7).

Introducing the first batch of training patterns with their label led to correct storage in the target memory, whereas the other remained largely unaffected (Fig. 4c), demonstrating strong label specificity. The second batch was similarly stored without disrupting the earlier memory. Learned weights were consistent across two training orders, confirming robust learning regardless of which class was presented first or which memory stored each class.

Further robustness was demonstrated using two additional pairs of distinct training patterns (Fig. 4d,e and Extended Data Fig. 8), showing the system’s ability to learn arbitrary information from two classes of 100-bit patterns. Error analysis revealed that memory 2 consistently had more noise than memory 1, regardless of training patterns (Fig. 4f). In addition, a few bits showed lower-than-expected values across training patterns and in both memories, probably owing to secondary structures in the learning intermediates (‘Errors in learned weights’ in Methods).

The ability to develop memories enables the DNA neural network to learn from examples. As shown, it can store arbitrary patterns within a given complexity, integrating them into two memories in any order. Learning is powered solely by the training patterns and labels. After training, the system reaches thermodynamic equilibrium, stably storing learned weights until testing begins.

Testing after training

The true effectiveness of the DNA neural network’s learning ability lies in its classification performance after connecting the memory device to the processor, where learned information is transferred from activators to weights for downstream computation. This step is the most challenging in building a functional learning system. In earlier designs, we achieved 4-bit learning and 4-bit activatable memories separately but not integrated (Supplementary Note 4). With a revised design, we successfully demonstrated integrated 9-bit learning and testing (Supplementary Fig. 21), but failed to scale to 100-bit (Supplementary Fig. 22 and Supplementary Note 5.4). Investigations revealed that unused molecules—more prevalent in larger systems—were the primary issue. For instance, unused learning gates can cause label occlusion, weakening the production of activator signals. They may also leak with weight gates or test inputs, creating spurious memories. We addressed these challenges through design revisions and fabrication techniques (Supplementary Figs. 23–28 and Supplementary Notes 5.5–5.10). Solutions included adding clamps to suppress toeless strand displacement^37,43, adjusting annealing ratios, and using clean-up strands to promote competition between full-length and truncated strands, improving gate purity.

To test the scalability of our final design (Fig. 2), we created training and test patterns of increasing complexity (Fig. 5a). We predicted that the performance depends on the ratio of unused-to-used memory bits—that is, inhibited-to-activated weights post-training. Experiments with increasing memory size and varying fractions of activated bits confirmed this (Fig. 5b–d and Extended Data Fig. 9). Overall, performance declined with increasing complexity. However, as long as the total-to-activated bit ratio remained constant, increasing the number of activated bits had minimal impact (Fig. 5e). Performance worsened as this ratio increased (Fig. 5f). These results validated our hypothesis and highlighted a key trade-off: to learn complex patterns, training inputs must not activate too many bits—for example, there is only 1 binary pattern that can be learned if it has 100 ones. Paradoxically, unused bits become the dominant factor degrading testing performance. Scaling beyond two memories introduces further challenges, such as a quadratic increase in annihilator species and biased winner-take-all competition from imperfect reaction rates⁴.

**Fig. 5: Scalability of pattern classification using learned weights.**

The DNA neural network with learning capabilities is far more complex than the previous one using weights trained in silico⁴. Our 100-bit, 2-memory network involved over 700 distinct species in a single test tube and more than 1,200 unique strands across learning and testing (Fig. 6b). Depending on the number of training patterns, up to 80% of activators and weights must remain inhibited post-training (Fig. 6c). Despite these challenges, we demonstrated successful classification in 72 representative tests after 3 distinct training processes (Fig. 6d and Extended Data Fig. 10), proving that engineered molecular systems can learn complex information-processing tasks.

**Fig. 6: A 100-bit pattern classification using learned weights.**

Discussion and conclusion

The DNA neural network developed here autonomously performs pattern classification after learning, processing test patterns composed of the same types of molecule as in training, but in new combinations. This demonstrates independence beyond field-programmable devices that rely on instructors to translate environmental signals. It also shows generality, enabling classification beyond simple lookup of previous examples. The system received training patterns over time and stored them into two distinct memories based on label information, demonstrating integration beyond simple memorization. After training, the memories remained passive and isolated from computation until testing began, providing stability beyond short-term memories. Together, these features bring us closer to realizing the future artificial cell envisioned at the beginning of this paper (Extended Data Fig. 1).

The DNA neural network with learning capabilities is remarkably robust given its complexity. Aside from fluorescence reporters, the entire system was constructed using unpurified strands, yet performance remained uncompromised—indicating strong tolerance to impurity-induced molecular noise. Double-stranded complexes were purified using a one-pot procedure to correct stoichiometry errors (for example, all 100 learning gates per memory in one mixture). The robustness, combined with low cost and simple fabrication, makes the system readily accessible for future applications.

Learning has been proposed to accelerate evolution through the Baldwin effect, which reshapes the fitness landscape^44,45. Our work explores how chemical systems can learn from an unknown environment, forming memories from past inputs to process future signals. However, supervised learning requires labelled examples—a ‘teacher’—which is incompatible with early life. The next challenge is enabling unsupervised learning, where systems enhance their capabilities through unguided exposure to a molecular environment. In such a regime, the system constantly learns and refines its classification decisions. Achieving this requires overcoming a major limitation: the use-once nature of current DNA neural networks. Computation consumes stored energy, driving the system towards equilibrium. Once used, outputs cannot be reversed without added energy. Although various approaches to reusable enzyme-free DNA circuits have been explored^{46,47,48,49,50}, none are yet scalable. Advancing sustainable computation will be essential to realizing unsupervised learning in molecular systems.

Truly impressive learning behaviours depend on the complexity of neural networks. Although scaling DNA neural networks to the level of the human brain or advanced artificial intelligence models remains infeasible, there is substantial room for increasing their complexity. A current limitation is the absence of spatial organization, which is essential for efficient information encoding in both biological neural networks and electronic computers⁵¹. Phase-separated DNA condensates offer a promising solution⁵². These micrometre-sized droplets, containing billions of branched DNA monomers, could provide the spatial organization needed to scale up learning DNA neural networks by at least an order of magnitude. Alternatively, reaction–diffusion DNA systems⁵³ offer another spatial paradigm for sophisticated pattern formation and classification behaviours^54,55.

The potential of learning molecular systems extends far beyond current demonstrations. DNA-based classifiers have been used in disease diagnostics^56,57; with learning, they could enable therapeutics that remember previous encounters with disease biomarkers, improving future responses. DNA circuits can also control soft materials, allowing them to expand or contract in response to stimuli^58,59. With learning, these materials could adapt based on past experiences. Our work addresses the long-standing challenge of molecular learning, opening paths for intelligent molecular systems. These advances could endow non-living physical systems with adaptive decision-making abilities, transforming fields from molecular therapeutics to programmable active materials⁶⁰.

Methods

Learning algorithm

The learning algorithm adds each binary training pattern to memory j if binary label l_j = 1 (Fig. 1a,b), resulting in weights that are averaged training patterns for each class. Conceptually, this approach is similar to Hebbian learning³⁰, often summarized by the phrase ‘cells that fire together wire together’. Although classical Hebbian learning is an unsupervised learning rule in recurrent neural networks, it can be generalized to feedforward winner-take-all neural networks⁶¹. In our case, weight w_i,j representing the wire between input node x_i and weighted sum node s_j is turned on when training data containing input x_i and label l_j are simultaneously present (Fig. 1a). The training process is also conceptually similar to the widely used clustering algorithm k-means³¹, but used in a supervised setting⁶². In our case, k = 2 clusters correspond to the two memories and the means correspond to the averaged training patterns.

Implementation of learning and testing

As the testing phase directly builds on our previous work of DNA-based winner-take-all neural networks⁴, we first describe this phase as follows before explaining the learning phase. The weighted sum function is divided into weight multiplication and summation (Fig. 1c,d), owing to an architectural constraint that catalytic reactions in the seesaw motif support fan-out but not fan-in³⁷. Weight multiplication and summation are then translated into a catalytic and stoichiometric reaction, respectively. The fan-out of each input multiplied by weights in two or more memories is implemented using an amplification gate motif, whereas the fan-in of weighted sums from all inputs is implemented using an integration gate motif (Extended Data Fig. 3). The catalytic property of the weight multiplication reaction allows for arbitrary analogue weights, including greater than one. The thresholding reaction (with a threshold species Th_i representing th_i = 0.5) is not explicitly translated from the mathematical function, but enables the clean-up of noisy input signals. The combination of thresholding and catalysis implements a signal restoration function that converts mildly corrupted inputs (high background or signal decay) to ideal binary signals. The winner-take-all function is implemented using pairwise annihilation (facilitated by annihilator Anh_i,j) that enables a competition between any two signals, allowing them to turn each other off, and signal restoration (facilitated by restoration gate RG_j and fuel YF_j) that amplifies the winner species to an ideal on signal. A reporting reaction (facilitated by reporter Rep_j) is used to stoichiometrically convert an output signal to fluorescence for readout in experiments.

The training phase cannot be implemented with the previously developed reaction mechanisms in the seesaw motif. The key function here is the summed multiplication of binary input and label signals (Fig. 1b), which could be implemented with X_i + L_j → W_i,j, where the total concentration of W_i,j is accumulated over all training patterns that have the same label. However, a problem is that, unlike the product species for all reactions in the testing phase, W_i,j is not a signal species but a double-stranded gate species. To translate a signal species into a gate species, we developed an activatable amplification gate motif where the presence and absence of an activator signal determines the on and off states of the gate, respectively (Extended Data Fig. 3). Using this motif, we can then separate the training process into two reactions: supervised learning X_i + L_j → Act_i,j, where an input and label strand collectively produce an activator, and weight activation \({{\rm{Act}}}_{i,j}+{W}_{i,j}^{* }\to {W}_{i,j}\), where the activator turns on a specific weight gate.

The supervised learning reaction can be implemented with an activatable transformation gate motif (Extended Data Fig. 3), using the label as an activator for a learning gate that is initially off (\({{\rm{Act}}}_{i,j}^{* }\)) but can be turned on to react with an input in a training pattern and produce the weight activator (Act_i,j). To focus on the essential concept of learning while simplifying other aspects of the algorithm, the scaling of 1/q_j (Fig. 1b) is implicitly achieved by using a lower amount of the input strand per training pattern when more patterns are used. Lastly, similar to the thresholding reaction, although not explicitly needed, a label inhibition reaction (facilitated by inhibitor Inh_j) is introduced to clean up excess label between training events, facilitating accurate learning.

Combining the training phase and testing phase (Extended Data Fig. 2c), the resulting DNA neural network has five layers (Fig. 1d). The first two layers each utilize the two types of activatable gate motif discussed above, the output of one is an activator signal (Act_i,j) and the other is a regular signal (product P_i,j of weight multiplication). This difference in output format may seem insignificant, but along with other desired properties of the two motifs, it led to many important details within the molecular design that will be discussed in the section on motif characterization.

Compared with the DNA neural network without learning capabilities⁴, this neural network may seem only mildly more complicated with one additional layer, but the network topology has advanced in a non-trivial fashion. The same set of input signals can participate in both the learning and testing phase, reacting with the activatable gates in both of the first two layers. When all gates are present, the only information that determines which gate the input actually reacts with is whether the gate is on or off. When the label is present, it turns on the learning gate; when the activator produced from learning is present, it turns on the weight gate. Undesired spurious activation within these two types of gate will alter the interpretation of the input, for example, confusing a test pattern as a training pattern. Moreover, the DNA neural network without learning capabilities⁴ used sparse weight matrices to efficiently perform pattern classification tasks, where zero weights corresponded to eliminated weight and fuel species. Here, to learn to perform arbitrary pattern classification tasks within a given pattern complexity and class number, the neural network must have the ability to turn on every element in the weight matrices, requiring the presence of all species in the first two layers. These conditions put more stringent requirements on the molecular design, which cannot be addressed at the individual molecule level, but all interlinked issues such as leak, occlusion, crosstalk and reaction reversibility must be considered as a whole at the system level. We discuss all of these issues in an earlier design (Supplementary Note 4) and summarize a set of system-level design criteria that is applied to the final design (Supplementary Note 3).

During the training phase, all species within the first layer of the DNA neural network will be present, creating passive memories represented by the activator species. These species store the full learned information in a one-to-one fashion (Act_i,j encodes W_i,j) but they do not react with any input signals. Transition from training to testing is enabled by combining all species in the remaining four layers with the resulting learning mixture into a single test tube. The added species represent a processor with blank memories. Once the activators and weight gates are mixed together, information transfers from the passive memories to the activated memories, allowing the processor to perform pattern classification tasks using the learned memories (Extended Data Fig. 2b).

Implementation of arbitrary chemical reaction networks

Although not fully verified by theory, we speculate that the extended seesaw motifs now lead to a general-purpose implementation of arbitrary chemical reaction networks (Supplementary Note 4.3). Like the original seesaw motif³⁶, every species in the additional motifs remains one- or two-stranded.

Alternative activatable weight design

To turn on a weight W_i,j, the activator Act_i,j could be implemented with two consecutive toeholds using an allosteric toehold mechanism⁴⁰ that allows for the control of toehold availability via a short regulator strand (Supplementary Fig. 10a). Toehold Xit* encodes the input bit information i and toehold Tj* encodes the memory class information j. However, there are several problems with this design. First, encoding the bit information in a short toehold lacks sufficient specificity for 100-bit patterns, giving rise to crosstalk in weight activation (Supplementary Fig. 17 and Supplementary Note 4.8). Second, all input strands have distinct toehold sequences and react with the weight gates at different rates, leading to the lack of synchronization in the production of weighted sum signals. This asynchrony creates the possibility that a fraction of one weighted sum signal might arrive before the other and become amplified without being annihilated, resulting in biased classification decisions. Third, the activator strand violates the system-level three-letter code (Supplementary Fig. 9 and Supplementary Note 3.3), allowing it to occlude and be occluded by other signal strands. Lastly, like the activator, the label must have a complementary toehold to the input (Supplementary Fig. 11a). Because all input strands have distinct toeholds, 100 label strands per class would be necessary to activate the learning gates for 100-bit training patterns. This is both wasteful, given that the label only needs to encode the class information but not the bit information, and problematic, allowing for increased spurious interactions that reduce the performance of learning and testing.

Design considerations for the weight gate

We investigated the impact of the bulge size and the possibility of replacing the bulge with a nick (Supplementary Fig. 19 and Supplementary Note 5.1). As expected, experiments suggested that a larger bulge drove weight activation more effectively but also increased undesired leak between the input or fuel and the weight gate. Replacing the bulge with a nick results in shorter strands but more strands per complex, reducing synthesis errors at the cost of increased stoichiometry errors (Supplementary Note 3.1). Experiments suggested that the nick provided a higher reaction completion but also more leak. A key difference in the leak mechanism comparing the bulge and the nick design is that although the forward reaction is always bimolecular, the reverse reaction is either unimolecular or bimolecular depending on the design. At a relatively low concentration (for example, 50 nM in our experiments), unimolecular reactions are faster than bimolecular reactions, and thus leak in the bulge design is both kinetically slow and thermodynamically unfavoured. For similar reasons, the nick design not only showed worse leak but also more crosstalk in weight activation, as well as worse leak between the learning gate and the weight gate. On the basis of these observations, we chose the weight gate design with a 2-nt bulge. In addition, a clamp cj is necessary to mitigate leak between the learning gate and the weight gate, preventing spontaneous activation without training. In this design, the system-level three-letter code is satisfied by using non-star domains (As, Cs and Ts) of Tj and Ai on the activator and a two-letter code (As and Ts only) for the universal toehold U and bulge B.

Alternative learning gate design

The desired irreversibility of learning could be provided by a double-stranded drain molecule that converts an intermediate waste to inert wastes with no open toeholds (Supplementary Fig. 11a). This design may seem straightforward, but we discovered several problems upon experimental investigation. First, all drains must be available for learning but most of them will not be consumed depending on the training patterns. Unreacted drains will severely occlude the weight gates in testing, preventing them from being effectively activated (Supplementary Fig. 14 and Supplementary Note 4.5). Second, shortening the toehold on the drain improves the occlusion but sacrifices the robustness of irreversibility, especially when fluorophore and quencher modifications are used to monitor the learning process (Supplementary Fig. 15 and Supplementary Note 4.6). Lastly, mismatches can be introduced to improve irreversibility, but they lead to substantial leak between the learning gates and weight gates, causing spurious memories without training (Supplementary Fig. 16 and Supplementary Note 4.7).

Errors in learned weights

Interestingly, error analysis identified a few bits (27, 85 and 86) that consistently had lower values than expected across all training patterns and both memories (Fig. 4c and Extended Data Fig. 8). Simulations suggested some signal loss during training (Supplementary Fig. 5) but did not explain why certain bits performed worse. We hypothesized that the issue lay in the DNA sequences of the learning gates associated with these bits—our sequence design criteria did not account for intermediate structures in the learning reaction, such as the intermediate activator in which the label is bound to the gate before the input has reacted (Fig. 2e). To investigate further, we used NUPACK^63,64 to analyse the secondary structures of all 100 intermediate activators in each memory (Supplementary Fig. 6c). A well-formed structure should have the toehold U* available for binding to the input strand. However, in problematic structures, the toehold partially binds to the Xib* domain owing to the destabilizing effect of the Tj bulge loop. NUPACK analysis predicted that the intermediate activators for bits 27, 85 and 86 were among the worst malformed structures, supporting our hypothesis. Simulations with adjusted reaction rates based on toehold availability showed better agreement with experimental observations (Supplementary Fig. 6d–f). In future work, more stringent sequence design criteria could be applied to avoid malformation in all intermediate structures.

Lessons for engineering complex molecular systems

Two important lessons that we learned for engineering complex molecular systems are as follows. First, a failure mode of the debugging strategy is to focus on individual challenges. A solution for one problem may give rise to another problem somewhere else in the system. With further cascading, in the worst-case scenario, this debugging strategy may form a deadlock in a cycle. After understanding the failure mode, we arrived at an alternative strategy where all challenges are considered as a whole and solutions are devised to address the entire body of challenges simultaneously (Supplementary Note 4.9). Second, a waste of energy may occur if there is no approach to differentiate fabrication problems from design problems. For example, we discovered severe and uneven sample evaporation in source plates for a liquid handler, resulting in wildly inaccurate concentrations that directly affect the computation of the molecular system. Instead of just relying on a better sample storage method, we established a systematic approach to regularly evaluate the sample quality and reorder new strands whenever needed (Supplementary Note 5.13).

Data availability

All data supporting the findings of this study are available within the paper and its Supplementary Information. Source data files are available at CaltechDATA (https://doi.org/10.22002/5bvkt-r7y16)⁶⁵. Source data are provided with this paper.

Code availability

The code for simulation and data analysis is available at CaltechDATA (https://doi.org/10.22002/5bvkt-r7y16)⁶⁵.

References

McCulloch, W. S. & Pitts, W. A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biophys. 5, 115–133 (1943).
Article MathSciNet Google Scholar
Hopfield, J. J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl Acad. Sci. USA 79, 2554–2558 (1982).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Qian, L., Winfree, E. & Bruck, J. Neural network computation with DNA strand displacement cascades. Nature 475, 368–372 (2011).
Article CAS PubMed Google Scholar
Cherry, K. M. & Qian, L. Scaling up molecular pattern recognition with DNA-based winner-take-all neural networks. Nature 559, 370–376 (2018).
Article ADS CAS PubMed Google Scholar
Xiong, X. et al. Molecular convolutional neural networks with DNA regulatory circuits. Nat. Mach. Intell. 4, 625–635 (2022).
Article Google Scholar
Okumura, S. et al. Nonlinear decision-making with enzymatic neural networks. Nature 610, 496–501 (2022).
Article ADS CAS PubMed Google Scholar
Chen, Z. et al. A synthetic protein-level neural network in mammalian cells. Science 386, 1243–1250 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Churchland, P. S. & Sejnowski, T. J. The Computational Brain (MIT Press, 2016).
Farmer, J. D., Packard, N. H. & Perelson, A. S. The immune system, adaptation, and machine learning. Physica D 22, 187–204 (1986).
Article ADS MathSciNet Google Scholar
Vladimirov, N. & Sourjik, V. Chemotaxis: how bacteria use memory. Biol. Chem. 390, 1097–1104 (2009).
Article CAS PubMed Google Scholar
Kieffer, C., Genot, A. J., Rondelez, Y. & Gines, G. Molecular computation for molecular classification. Adv. Biol. 7, 2200203 (2023).
Article Google Scholar
Nagipogu, R. T., Fu, D. & Reif, J. H. A survey on molecular-scale learning systems with relevance to DNA computing. Nanoscale 15, 7676–7694 (2023).
Article CAS PubMed Google Scholar
Vasle, A. H. & Moškon, M. Synthetic biological neural networks: from current implementations to future perspectives. Biosystems 237, 105164 (2024).
Hjelmfelt, A., Weinberger, E. D. & Ross, J. Chemical implementation of neural networks and Turing machines. Proc. Natl Acad. Sci. USA 88, 10983–10987 (1991).
Article ADS CAS PubMed PubMed Central Google Scholar
Poole, W. et al. Chemical Boltzmann Machines. In 23rd International Conference on DNA Computing and Molecular Programming (DNA 23) (eds Brijder, R. & Qian, L.) 210–231 (Springer, 2017).
Vasić, M., Chalk, C., Luchsinger, A., Khurshid, S. & Soloveichik, D. Programming and training rate-independent chemical reaction networks. Proc. Natl Acad. Sci. USA 119, e2111552119 (2022).
Article PubMed PubMed Central Google Scholar
Buchler, N. E., Gerland, U. & Hwa, T. On schemes of combinatorial transcription logic. Proc. Natl Acad. Sci. USA 100, 5136–5141 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Fernando, C. T. et al. Molecular circuits for associative learning in single-celled organisms. J. R. Soc. Interface 6, 463–469 (2009).
Article CAS PubMed Google Scholar
Rizik, L., Danial, L., Habib, M., Weiss, R. & Daniel, R. Synthetic neuromorphic computing in living cells. Nat. Commun. 13, 5602 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Bray, D. Protein molecules as computational elements in living cells. Nature 376, 307–312 (1995).
Article ADS CAS PubMed Google Scholar
Pandi, A. et al. Metabolic perceptrons for neural computing in biological systems. Nat. Commun. 10, 3880 (2019).
Article ADS PubMed PubMed Central Google Scholar
Kim, J., Hopfield, J. & Winfree, E. Neural network computation by in vitro transcriptional circuits. Adv. Neural Inf. Process. Syst. 17, 681–688 (2004).
van der Linden, A. J. et al. DNA input classification by a riboregulator-based cell-free perceptron. ACS Synth. Biol. 11, 1510–1520 (2022).
Article PubMed PubMed Central Google Scholar
Genot, A. J., Fujii, T. & Rondelez, Y. Scaling down DNA circuits with competitive neural networks. J. R. Soc. Interface 10, 20130212 (2013).
Article PubMed PubMed Central Google Scholar
Lakin, M. R. & Stefanovic, D. Supervised learning in adaptive DNA strand displacement networks. ACS Synth. Biol. 5, 885–897 (2016).
Article CAS PubMed Google Scholar
Evans, C. G., O’Brien, J., Winfree, E. & Murugan, A. Pattern recognition in the nucleation kinetics of non-equilibrium self-assembly. Nature 625, 500–507 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Pei, R., Matamoros, E., Liu, M., Stefanovic, D. & Stojanovic, M. N. Training a molecular automaton to play a game. Nat. Nanotechnol. 5, 773–777 (2010).
Article ADS CAS PubMed Google Scholar
Kim, J., Khetarpal, I., Sen, S. & Murray, R. M. Synthetic circuit for exact adaptation and fold-change detection. Nucleic Acids Res. 42, 6078–6089 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nakakuki, T. et al. DNA reaction system that acquires classical conditioning. ACS Synth. Biol. 13, 521–529 (2024).
Article CAS PubMed PubMed Central Google Scholar
Rojas, R. Neural Networks: A Systematic Introduction (Springer, 2013).
MacQueen, J. Some methods for classification and analysis of multivariate observations. In Proc. Fifth Berkeley Symposium on Mathematical Statistics and Probability (eds Le Cam, L. M. & Neyman, J.) Vol. 1, 281–297 (University of California Press, 1967).
Yurke, B., Turberfield, A. J., Mills Jr, A. P., Simmel, F. C. & Neumann, J. L. A DNA-fuelled molecular machine made of DNA. Nature 406, 605–608 (2000).
Article ADS CAS PubMed Google Scholar
Soloveichik, D., Seelig, G. & Winfree, E. DNA as a universal substrate for chemical kinetics. Proc. Natl Acad. Sci. USA 107, 5393–5398 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, Y.-J. et al. Programmable chemical controllers made from DNA. Nat. Nanotechnol. 8, 755–762 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Srinivas, N., Parkin, J., Seelig, G., Winfree, E. & Soloveichik, D. Enzyme-free nucleic acid dynamical systems. Science 358, eaal2052 (2017).
Article PubMed Google Scholar
Qian, L. & Winfree, E. A simple DNA gate motif for synthesizing large-scale circuits. J. R. Soc. Interface 8, 1281–1297 (2011).
Article CAS PubMed PubMed Central Google Scholar
Qian, L. & Winfree, E. Scaling up digital circuit computation with DNA strand displacement cascades. Science 332, 1196–1201 (2011).
Article ADS CAS PubMed Google Scholar
Thubagere, A. J. et al. Compiler-aided systematic construction of large-scale DNA strand displacement circuits using unpurified components. Nat. Commun. 8, 14373 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Johnson, H. A. & Condon, A. A coupled reconfiguration mechanism for single-stranded DNA strand displacement systems. In 28th International Conference on DNA Computing and Molecular Programming (DNA 28) (eds Ouldridge, T. E. & Wickham, S. F. J.) Vol. 238, 3:1–3:19 (Schloss Dagstuhl-Leibniz-Zentrum für Informatik, 2022).
Yang, X., Tang, Y., Traynor, S. M. & Li, F. Regulation of DNA strand displacement using an allosteric DNA toehold. J. Am. Chem. Soc. 138, 14076–14082 (2016).
Article ADS CAS PubMed Google Scholar
Haley, N. E. et al. Design of hidden thermodynamic driving for non-equilibrium systems via mismatch elimination during DNA strand displacement. Nat. Commun. 11, 2562 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Deng, L. The MNIST database of handwritten digit images for machine learning research [best of the web]. IEEE Signal Process. Mag. 29, 141–142 (2012).
Article ADS Google Scholar
Seelig, G., Soloveichik, D., Zhang, D. Y. & Winfree, E. Enzyme-free nucleic acid logic circuits. Science 314, 1585–1588 (2006).
Article ADS CAS PubMed Google Scholar
Baldwin, J. M. A new factor in evolution. Am. Nat. 30, 441–451 (1896).
Article Google Scholar
Hinton, G. E. & Nowlan, S. J. How learning can guide evolution. Complex Syst. 1, 495–502 (1987).
Google Scholar
Genot, A. J., Bath, J. & Turberfield, A. J. Reversible logic circuits made of DNA. J. Am. Chem. Soc. 133, 20080–20083 (2011).
Article ADS CAS PubMed Google Scholar
DelRosso, N. V., Hews, S., Spector, L. & Derr, N. D. A molecular circuit regenerator to implement iterative strand displacement operations. Angew. Chem. Int. Ed. 56, 4443–4446 (2017).
Article CAS Google Scholar
Scalise, D., Dutta, N. & Schulman, R. DNA strand buffers. J. Am. Chem. Soc. 140, 12069–12076 (2018).
Article ADS CAS PubMed Google Scholar
Garg, S. et al. Renewable time-responsive DNA circuits. Small 14, 1801470 (2018).
Article Google Scholar
Hahn, J. & Shih, W. M. Thermal cycling of DNA devices via associative strand displacement. Nucleic Acids Res. 47, 10968–10975 (2019).
Article CAS PubMed PubMed Central Google Scholar
Clamons, S., Qian, L. & Winfree, E. Programming and simulating chemical reaction networks on a surface. J. R. Soc. Interface 17, 20190790 (2020).
Article PubMed PubMed Central Google Scholar
Takinoue, M. DNA droplets for intelligent and dynamical artificial cells: from the viewpoint of computation and non-equilibrium systems. Interface Focus 13, 20230021 (2023).
Article PubMed PubMed Central Google Scholar
Wang, S. S. & Ellington, A. D. Pattern generation with nucleic acid chemical reaction networks. Chem. Rev. 119, 6370–6383 (2019).
Article CAS PubMed Google Scholar
Mordvintsev, A., Randazzo, E., Niklasson, E. & Levin, M. Growing neural cellular automata. Distill 5, e23 (2020).
Article Google Scholar
Randazzo, E., Mordvintsev, A., Niklasson, E., Levin, M. & Greydanus, S. Self-classifying MNIST digits. Distill 5, e00027–002 (2020).
Article Google Scholar
Lopez, R., Wang, R. & Seelig, G. A molecular multi-gene classifier for disease diagnostics. Nat. Chem. 10, 746–754 (2018).
Article CAS PubMed Google Scholar
Zhang, C. et al. Cancer diagnosis with DNA molecular computation. Nat. Nanotechnol. 15, 709–715 (2020).
Article ADS PubMed Google Scholar
Cangialosi, A. et al. DNA sequence-directed shape change of photopatterned hydrogels via high-degree swelling. Science 357, 1126–1130 (2017).
Article ADS CAS PubMed Google Scholar
Fern, J. & Schulman, R. Modular DNA strand-displacement controllers for directing material expansion. Nat. Commun. 9, 3766 (2018).
Article ADS PubMed PubMed Central Google Scholar
Stern, M. & Murugan, A. Learning without neurons in physical systems. Annu. Rev. Condensed Matter Phys. 14, 417–441 (2023).
Article ADS Google Scholar
Sanger, T. D. Optimal unsupervised learning in a single-layer linear feedforward neural network. Neural Netw. 2, 459–473 (1989).
Article Google Scholar
Al-Harbi, S. H. & Rayward-Smith, V. J. Adapting k-means for supervised clustering. Appl. Intell. 24, 219–226 (2006).
Article Google Scholar
Dirks, R. M., Bois, J. S., Schaeffer, J. M., Winfree, E. & Pierce, N. A. Thermodynamic analysis of interacting nucleic acid strands. SIAM Rev. 49, 65–88 (2007).
Article ADS MathSciNet Google Scholar
Fornace, M. E. et al. NUPACK: analysis and design of nucleic acid structures, devices, and systems. Preprint at https://doi.org/10.26434/chemrxiv-2022-xv98l (2022).
Cherry, K. M. & Qian, L. Supervised learning in DNA neural networks [Data set]. CaltechDATA https://doi.org/10.22002/5bvkt-r7y16 (2025).

Download references

Acknowledgements

We thank G. Gowri for preliminary investigation of an activatable gate motif; S. Badelt and E. Winfree for evaluation of a chemical reaction network to DNA strand-displacement implementation scheme using Peppercorn; C. Evans for providing information on how to install and use StickyDesign; D. Scalise, T. Song and E. Winfree for discussions and suggestions; and R. M. Murray for sharing an acoustic liquid handler. This research received support through Schmidt Sciences, LLC. K.M.C. and L.Q. were also supported by an NSF grant (1908643).

Author information

Authors and Affiliations

Bioengineering, California Institute of Technology, Pasadena, CA, USA
Kevin M. Cherry & Lulu Qian
Computer Science, California Institute of Technology, Pasadena, CA, USA
Lulu Qian
Computation and Neural Systems, California Institute of Technology, Pasadena, CA, USA
Lulu Qian

Authors

Kevin M. Cherry
View author publications
Search author on:PubMed Google Scholar
Lulu Qian
View author publications
Search author on:PubMed Google Scholar

Contributions

K.M.C. and L.Q. initiated the project and came up with the key mechanisms. K.M.C. designed and performed the experiments. K.M.C. and L.Q. developed the model, performed the simulations, analysed the data and wrote the paper. L.Q. guided the project.

Corresponding author

Correspondence to Lulu Qian.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature thanks Anne Condon and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Criteria of learning.

Independence: the same set of 100 input strands were used for training and testing, satisfying the independence criteria. Integration: training patterns presented in any desired order were stored into two 100-bit memories, satisfying the integration criteria. Generality: while above-average examples of handwritten digits were used for training, a wider range of digits were correctly classified for testing, satisfying the generality criteria. Stability: some testing experiments were performed days after training, satisfying the stability criteria. The wait time is mainly limited by DNA degradation in magnesium, and in principle could be extended to at least weeks, or months if stored in a sodium buffer, and years if the molecules are lyophilized. Accuracy: handwritten digits outside of a 20% margin in the weighted sum space were used for testing, indicating a lower bound of 53% classification accuracy for ‘0’ and 83% for ‘1’ (same as the top plots in Fig. S31d, with more details explained in Supplementary Note 5.12). Reusability: each trained DNA neural network was distributed into 24 aliquots for distinct tests (12 per class) in parallel. Flexility: training and testing inputs were all binary signals with high and low concentrations representing ON and OFF states, respectively. Learned memories were composed of analog signals representing the average of all training patterns, but could only be used to classify binary tests.

Extended Data Fig. 2 Components of learning.

a, DNA neural network that uses fixed memories to process information, like a hardwired processor. b, DNA neural network composed of a memory device and a processor, capable of developing memories based on training data and using the memories to process subsequent test data. The four highlighted contents—motif characterization, activatable memories, learned weights, and testing after training—correspond to four sections in this paper. c, Seesaw DNA circuit implementation of the memory device and the processor. Each two-sided circle is a node representing a seesaw motif, with the black number above it indicating the identity of the node. A total of 2n + 6 nodes are required for implementing a learning neural network with 2 memories that each has n bits. The location and absolute value of each red number (e.g. 1.5) or variable (e.g. x₁) indicate the identity and initial concentration of a DNA species, respectively. A red number on a wire connected to a node indicates a free signal species, which can be an input X_i, label L_j, or fuel strand XF_i or YF_j. A negative red number inside a node indicates a threshold species Th_i. A positive red number inside a node indicates a gate species, which can be an inhibited activator \(Ac{t}_{i,j}^{* }\), inhibited weight \({W}_{i,j}^{* }\), summation gate SG_j, or restoration gate RG_j. A red number on a wire that stops perpendicularly at two wires indicates an annihilator species Anh_j,k. A negative red number inside a half node with a zigzag arrow indicates a reporter species Rep_j. A black arrowhead pointing to the center of a node indicates a set of activators that each activates a specific gate species within the node; the order of the wires joining at the arrowhead corresponds to the order of the gates that each wire activates (e.g. the top wire activates the top gate). Initial concentrations of all species in the supervised learning layer and weight activation and multiplication layer are shown for the situation where a single bit in the training or test pattern is on (x_i = 1). For patterns with b out of n bits being on, the initial concentrations will be divided by b. For example, with 100-bit training and testing patterns that have 20 1s, the initial concentration of each inhibited activator and inhibited weight is 1.5/20 = 0.075×, that of each threshold is 0.5/20 = 0.025×, and that of each fuel is 3/20 = 0.15×. Species concentrations in other layers do not change with b. Standard concentration (1×) is 50 nanomolar (nM) here and in all other figures.

Extended Data Fig. 3 Collection of motifs.

Integration gate motif produces an output with a steady-state concentration expected to be the total initial concentration of all inputs. The ideal reaction is stoichiometric and irreversible, implemented with an reversible reaction facilitated by an excess gate. Amplification gate motif produces multiple outputs, each of which has a steady-state concentration expected to be the initial concentration of the gate that produces the output if and only if the input exceeds the threshold. A pair of ideal reactions includes a faster thresholding reaction and a slower catalytic reaction, enabling the input to be consumed before any leftover input can effectively produce the outputs. The catalytic reaction is implemented with a pair of reversible reactions facilitated by an excess fuel. Activatable amplification gate motif has all gates initially inhibited and only become functional if an activator is present. The steady-state output concentration is expected to be the initial concentration of the activator if and only if the input exceeds the threshold. The ideal reactions include an additional activation reaction facilitated by an excess inhibited gate. The activator is also considered a free signal species. To contrast with activator signals, the input and output can be referred to as regular signals. Activatable transformation gate motif produces an output in the form of an activator for a downstream gate when an input and activator are both present. Only one of all activators will be on, selectively turning on a gate and producing a specific output whose steady-state concentration is expected to be the initial input concentration. A reporter converts an output to fluorescence for readout. An annihilator facilitates the competition between two signals, with whichever has a larger initial concentration remains.

Extended Data Fig. 4 Characterization of the activatable weight motif.

a, Sequence-level diagrams of inhibited weight \({W}_{1,1}^{* }\), activator Act_1,1 (renamed as ssAct_1,1 for single-stranded activator, to contrast with double-stranded activator dsAct_1,1 produced from learning), input X₁, fuel XF₁, and reporter. The reporter containing a fluorophore and a quencher modified strand reacts with the output of this reaction (P_1,1), resulting in creased fluorescence signal, which is then normalized to concentration based on control experiments. For simplicity, instead of using a distinct Pj domain for each output P_i,j, a single P domain was used for all experiments in this figure, allowing for fluorescence readout using the same reporter. b, Sequence-level diagrams of inhibited weight \({W}_{1,2}^{* }\) and single-stranded activator ssAct_1,2. The input, fuel, and reporter are the same as shown in panel a. c, Sequence-level diagrams of double-stranded activators dsAct_1,1 and dsAct_1,2. d,e, Fluorescence kinetics experiments and simulations with distinct activatable weight gates and single-stranded (d) or double-stranded (e) activators. Fluorescence kinetics data (dotted trajectories) are overlaid with mass-action simulations of chemical kinetics by solving ordinary differential equations (solid trajectories). Details of modeling see Supplementary Note 2.1. Double-stranded activators produced during training (Fig. 2e) exhibited faster kinetics than single-stranded activators (Fig. 2a) due to the additional stacking bond between Xi and Tj domains. The small variations of reaction kinetics across distinct inhibited weights and activators can be explained by the sequence of the Tj domain and the minor secondary structure within the Ai and U* domains. 1 × indicates a standard concentration of 50 nM. Relative concentrations of the inhibited weight \({W}_{i,j}^{* }\), fuel XF_i, and reporter are 1.5×, 3×, and 2×, respectively. f, Crosstalk evaluation. The first two hours of the fluorescence kinetics data are shown in Fig. 2d whereas endpoint data of output concentrations (relative to the standard concentration) at 10 hours are shown here.

Source data

Extended Data Fig. 5 Characterization of the learning motif.

a, Sequence-level diagrams of inhibited activator \(Ac{t}_{1,1}^{* }\), input X₁, and label L₁. The \({5}^{{\prime} }\) end of the bottom strand in the inhibited activator is modified with a fluorophore, and the \({3}^{{\prime} }\) end of the input strand is modified with a quencher. The double-stranded activator produced from this reaction has the quencher adjacent to the fluorophore, resulting in decreased fluorescence, which is then normalized to increased activator concentration based on control experiments. b, Sequence-level diagrams of inhibited activator \(Ac{t}_{1,2}^{* }\), input X₁, and label L₂. c, Sequence-level diagrams of inhibited activators \(Ac{t}_{5,1}^{* }\) and \(Ac{t}_{5,2}^{* }\), along with input X₅. Label strands are the same as shown in panels a and b. d, Fluorescence kinetics experiments and simulations with distinct learning gates and input-label pairs. Fluorescence kinetics data (dotted trajectories) are overlaid with mass-action simulations of chemical kinetics by solving ordinary differential equations (solid trajectories). Details of modeling see Supplementary Note 2.2. The small variations of reaction kinetics across distinct learning gates can be explained by the sequence of the Lj domain and the minor secondary structure within the U* and Xib* domains. 1 × indicates a standard concentration of 50 nM. e, Crosstalk evaluation. Excess label (5×) was used. Four distinct fluorophores, ATTO488, ATTO550, ATTO590, and ATTO647 were used on \(Ac{t}_{1,1}^{* }\), \(Ac{t}_{3,1}^{* }\), \(Ac{t}_{5,1}^{* }\), and \(Ac{t}_{7,1}^{* }\), respectively. The same set of fluorophores with a different order were used on the other four inhibited activators. Eight hours of fluorescence kinetics data are shown in Fig. 2i whereas endpoint data of activator concentrations (relative to the standard concentration) at 8 hours are shown here. f, Reversibility evaluation. Before 16 hours, all fluorescence kinetics trajectories are repeats of the same sample with 1.5 × learning gate (\(Ac{t}_{5,1}^{* }\) or \(Ac{t}_{5,2}^{* }\)), 0.5 × input (X₅), and 5 × label (L₁ or L₂). Label inhibitor (Inh₁ or Inh₂) with varying concentrations was added at 16 hours and fluorescence data was collected until 32 hours. No fluorescence decrease was observed with increased inhibitor concentration, suggesting robust irreversibility of the learning gate.

Source data

Extended Data Fig. 6 Activatable memories.

a-f, Simulations and fluorescence kinetics experiments with 12 test patterns per class on activated memories of 0 and 1 (a,b), 3 and 4 (c,d), or 6 and 7 (e,f) in memory 1 and 2 (a,c,e), respectively, or swapped in memory 2 and 1 (b,d,f). Difference between two output concentrations (Y₁ − Y₂) at the end of the experiments are shown for all test patterns sorted according to their distance to the diagonal line in the weighted sum space. For the left set of patterns, the distance decreases from left to right; for the right set of patterns, the distance increases.

Source data

Extended Data Fig. 7 Learned weights with training patterns 0 and 1.

a, Overlaid training patterns (10 per class) for learning handwritten digits 0 and 1. b,c, Fluorescence kinetics data, endpoint signal values at 4 hours, and error statistics for learning 1 first into memory 2 and then 0 into memory 1 (b) or learning 0 first into memory 2 and then 1 into memory 1.

Source data

Extended Data Fig. 8 Learned weights with training patterns 3 and 4 or 6 and 7.

a, Overlaid training patterns (10 per class) for learning handwritten digits 3, 4, 6 and 7. b,c, Fluorescence kinetics data, endpoint signal values at 4 hours, and error statistics for learning 3 first into memory 2 and then 4 into memory 1 (b) or learning 6 first into memory 2 and then 7 into memory 1.

Source data

Extended Data Fig. 9 Evaluating the scalability of pattern classification with learned weights.

Fluorescence kinetics data for 8 test patterns of two classes are shown in two separate plots. Difference between two outputs at the end of the 8-hour experiments are shown in the bar chart, next to the learned weights. In learned weights, gray pixels indicate unused memory bits with no associated molecules. White pixels indicate activatable but not activated memory bits with inhibited activator and inhibited weights that are present throughout training and testing. Colored pixels indicate memory bits activated by training patterns and their labels.

Source data

Extended Data Fig. 10 100-bit pattern classification with learned weights.

a-c, Simulations (solid trajectories) and fluorescence kinetics experiments (dotted trajectories) of pattern classification for 72 representative cases after three distinct training processes with MNIST digits 0 and 1 (a), 3 and 4 (b), and 6 and 7 (c).

Source data

Supplementary information

Supplementary Information

Supplementary Methods, Notes, Figs. 1–33, Tables 1–19 and References.

Source data

Source Data Figs. 2–6 and Extended Data Figs. 4–10.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Cherry, K.M., Qian, L. Supervised learning in DNA neural networks. Nature 645, 639–647 (2025). https://doi.org/10.1038/s41586-025-09479-w

Download citation

Received: 06 January 2025
Accepted: 30 July 2025
Published: 03 September 2025
Issue date: 18 September 2025
DOI: https://doi.org/10.1038/s41586-025-09479-w

Subjects

Abstract

Similar content being viewed by others

Main

Concept and design

Motif characterization

Activatable memories

Learned weights

Testing after training

Discussion and conclusion

Methods

Learning algorithm

Implementation of learning and testing

Implementation of arbitrary chemical reaction networks

Alternative activatable weight design

Design considerations for the weight gate

Alternative learning gate design

Errors in learned weights

Lessons for engineering complex molecular systems

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data figures and tables

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links