Introduction

Higher-order networks1,2,3,4,5 are key to capturing many-body interactions present in complex systems. Inferring higher-order interactions6,7,8,9,10,11 from real, pairwise network datasets is recognized as one of the central challenges in the study of higher-order networks2,12, with wide applicability across different scientific domains, from biology and brain research13,14,15 to finance16,17. Mining higher-order interactions from the exclusive knowledge of the pairwise networks typically involves generative models and Bayesian approaches based on network structural properties6,7,9,11,18. Note, however, that when the inference is performed on the basis of the knowledge of the nodes’ dynamical states8,10, inferring higher-order interactions also requires dynamical considerations.

Triadic interactions19 are a fundamental type of signed higher-order interaction that are gaining increasing attention from the statistical mechanics community19,20,21,22,23,24, since they are not reducible to hyperedges or simplices. A triadic interaction occurs when one or more nodes regulate the interaction between two other nodes. The regulator nodes may either enhance or inhibit the interaction between the other two nodes. Triadic interactions are known to be important in various systems, including: ecosystems25,26,27 where one species can regulate the interaction between two other species; neuronal networks28, where glial cells regulate synaptic transmission between neurons, thereby controlling brain information processing; and gene regulatory networks29,30, where a modulator can promote or inhibit the interaction between a transcription factor and its target gene. There is mounting evidence that triadic interactions can induce collective phenomena and/or modulate dynamical states that reveal important aspects of complex system behavior19,20,21,22,23,24,31,32. An important advance in this line of research is triadic percolation19,20,21, a theoretical framework that captures the non-trivial dynamics of the giant component. Moreover, recent results demonstrate that triadic interactions can have significant effects on stochastic dynamics24 and learning22,23. However, despite the increasing attention that higher-order interactions are receiving, the detection of triadic interactions from network data and node time series, is an important scientific challenge that has not been thoroughly explored29,33,34.

In this article, we formulate the triadic perceptron model (TPM) in which continuous node variables are affected by triadic interactions. Based on the insights gained by investigating this model, we propose an information theoretic approach, leading to the triadic interaction mining (TRIM) algorithm, for mining triadic interactions. The TPM provides evidence of the mechanisms by which a triadic interaction can induce a significant variability of the mutual information between two nodes at the end-points of an edge. The TRIM algorithm leverages this finding and mines triadic interactions using knowledge of the network structure and the dynamical variables associated with the nodes. The significance of each putative triadic interaction is then validated by comparison with two distinct null models.

In this way, the TRIM algorithm can go beyond monotonicity assumptions regarding the functional form of the regulation of the two linked nodes by the third node (which is at the foundation of previously proposed methods29) allowing for broader applications. Significant node triples are also associated with an normalized entropic score function S [0, 1] that quantifies the spread of the conditional joint distribution functions of the variables at the ends of the regulated edge. We test the TRIM algorithm on the benchmark TPM, demonstrating its efficiency in detecting true triadic interactions. We also use the TRIM model to mine triadic interactions from gene-expression to identify “trigenic” processes35. We demonstrate that the TRIM algorithm is able to detect known interactions as well as propose a set of new candidate interactions that can then be validated experimentally.

Results

Triadic interactions

A triadic interaction occurs when one or more nodes modulate (or regulate) the interaction between two other nodes, either positively or negatively. A triadic interaction network is a heterogeneous network composed of a structural network and a regulatory network encoding triadic interactions (Fig. 1). The structural network GS = (VES) is formed by a set V of N nodes and a set ES of L edges. The regulatory network GR = (VESER) is a signed bipartite network with one set of nodes given by V (the nodes of the structural network), and another set of nodes given by ES (the edges of the structural network) connected by the regulatory interactions ER of cardinality \(| {E}_R|=\hat{L}\). The signed regulatory network can be encoded as an L × N matrix K where Kℓi = 1 if node i activates the structural edge , Kℓi = −1 if node i inhibits the structural edge and Kℓi = 0 otherwise. If Kℓi = 1, then the node i is called a positive regulator of the edge , and if Kℓi = −1, then the node i is called a negative regulator of the edge . It is worth noting that node iV cannot serve as both a positive and negative regulator for the same edge at the same time. However, node i can act as a positive regulator for edge while simultaneously functioning as a negative regulator for a different edge \({\ell }^{{\prime} } \,\ne \; \ell\).

Fig. 1: Schematic representation of triadic interactions.
Fig. 1: Schematic representation of triadic interactions.
Full size image

a A triadic interaction occurs when a node Z, called a regulator node, regulates (either positively or negatively) the interaction between two other nodes X and Y. The regulated edge can be conceptualized as a factor node (shown here as a cyan diamond). b A network with triadic interactions can be seen as a network of networks formed by a simple structural network and by a bipartite regulatory network between regulator nodes and regulated edges (factor nodes).

The triadic perceptron model (TPM)

Here, we formulate a model for node dynamics in a network with triadic interactions that we call TPM. The TPM acts as a benchmark to validate the TRIM algorithm proposed here. We assume that each node i of the network is associated with a dynamical variable \({X}_{i}\in {\mathbb{R}}\), and that the dynamical state of the entire network is encoded in the state vector \(X={({X}_{1},{X}_{2}\ldots,{X}_{N})}^{T}\). The topology of the structural network is encoded in the graph Laplacian matrix L with elements

$$\begin{array}{rc}{L}_{ij}&=\left\{\begin{array}{ll}-{a}_{ij}J\quad \hfill&\,{{\rm{if}}}\,\,\,i \; \ne \; j,\hfill\\ {\sum}_{k}{a}_{ik}J\quad &\,{{\rm{if}}}\,\,\,i=j,\end{array}\right.\end{array}$$
(1)

where a is the adjacency matrix of the network of elements aij, and J > 0 is a coupling constant. In the absence of triadic interactions, we assume that the dynamics of the network is associated with a Gaussian process implemented as the Langevin equation

$$\frac{dX}{dt}=-\frac{\delta {{{\mathcal{H}}}}}{\delta X}+\Gamma {{{\boldsymbol{\eta }}}}(t),$$
(2)

with the Hamiltonian

$$\begin{array}{r}{{{\mathcal{H}}}}=\frac{1}{2}{X}^{\top }({{{\bf{L}}}}+\alpha {{{\bf{I}}}})X,\end{array}$$
(3)

where Γ > 0, α > 0, and where η(t) indicates uncorrelated Gaussian noise with

$$\left\langle {\eta }_{i}(t)\right\rangle=0,\quad \left\langle {\eta }_{i}(t){\eta }_{j}({t}^{{\prime} })\right\rangle={\delta }_{ij}\delta (t-{t}^{{\prime} }),$$
(4)

for all t and \({t}^{{\prime} }\). The resulting Langevin dynamics are given by

$$\frac{dX}{dt}=-({{{\bf{L}}}}+\alpha {{{\bf{I}}}})X+\Gamma {{{\boldsymbol{\eta }}}}(t).$$
(5)

We remark that the Hamiltonian \({{{\mathcal{H}}}}\) has a minimum for X = 0, and its depth increases as the value of α increases. In a deterministic version of the model (Γ = 0), the effect of the structural interactions will not be revealed at stationarity. However, the Langevin dynamics with Γ > 0 encode the topology on the network. Indeed, at equilibrium, the correlation matrix \({C}_{ij}={\mathbb{E}}(({X}_{i}-{\mathbb{E}}({X}_{i}))({X}_{j}-{\mathbb{E}}({X}_{j})))\) is given by

$${C}_{ij}=\frac{{\Gamma }^{2}}{2}{[{{{\bf{L}}}}+\alpha {{{\bf{I}}}}]}_{ij}^{-1},$$
(6)

see Supplementary Information (SI) for details. In other words, from the correlation matrix, it is possible to infer the Laplacian, and hence the connectivity of the network.

We now introduce triadic interactions in the TPM. As explained earlier, a triadic interaction occurs when one or more nodes modulate the interaction between another two nodes. To incorporate triadic interactions into the network dynamics, we modify the definition of the Laplacian operator present in the Langevin equation. Namely, we consider the Langevin dynamics

$$\frac{dX}{dt}=-({{{{\bf{L}}}}}^{({{\rm{T}}})}+\alpha {{{\bf{I}}}})X+\Gamma {{{\boldsymbol{\eta }}}}(t),$$
(7)

obtained from Eq. (2) by substituting the graph Laplacian L with the triadic Laplacian L(T) whose elements are given by

$$\begin{array}{rc}{L}_{ij}^{(\,{{\rm{T}}})}&=\left\{\begin{array}{ll}-{a}_{ij}{J}_{ij}(X)\quad &\,{{\rm{if}}}\,\,\,i \; \ne \; j,\\ {\sum }_{k=1}^{N}{a}_{ik}{J}_{ik}(X)\quad &\,{{\rm{if}}}\,\,\,i=j.\end{array}\right.\end{array}$$
(8)

Moreover, we assume that the coupling constants Jij(X) are determined by a perceptron-like model that considers all the regulatory nodes of the link  = [ij] and the sign of the regulatory interactions. Specifically, if \({\sum }_{k=1}^{N}{K}_{\ell k}{X}_{k}\ge \hat{T}\) then we set Jij = w+; if instead \({\sum }_{k=1}^{N}{K}_{\ell k}{X}_{k} < \hat{T}\) then we set Jij = w_, with \({w}_{+},{w}_{-}\in {{\mathbb{R}}}_{+}\) and w+ > w. Thus,

$${J}_{ij}(X)={w}_{-}+({w}_{+}-{w}_{-})\theta \left(\mathop{\sum }_{k=1}^{N}{K}_{\ell k}{X}_{k}-\hat{T}\right),$$
(9)

where θ( ) is the Heaviside function (θ(x) = 1 if x ≥ 0 and θ(x) = 0 if x < 0). Note that, in the presence of triadic interactions, the stochastic differential equation (7) is not associated with any Hamiltonian, and a stationary state of the dynamics is not guaranteed, making this dynamical process significantly more complex than the original Langevin dynamics given in Eq. (2). The TPM is related to a recently proposed model that captures information propagation in multilayer networks24, but the TPM does not make use of a multilayer representation of the data. Moreover, the TPM is significantly different from models of higher-order interactions previously proposed in the context of consensus dynamics36,37 or contagion dynamics38,39. Indeed, in our framework, triadic interactions between continuous variables are not reducible to standard higher-order interactions because they involve the modulation of the interaction between a pair of nodes. Moreover, this modulation of the interaction is not dependent on the properties of the interacting nodes and their immediate neighbors, as is the case in36,37 or in the machine learning attention mechanism40. On the contrary, the modulation of the interaction is determined by a third regulatory node (or a larger set of regulatory nodes) encoded in the regulatory network.

The TPM for continuous node dynamics in the presence of triadic interactions is very general and comprehensively expresses the modulation of structural interactions by other nodes in the network. Therefore, the dynamics of TPM cannot be reduced to dynamics exclusively determined by pairwise interactions. An important problem that then arises is whether such interactions can be mined from observational data. To address this issue, we will develop a new algorithm—that we call the TRIM algorithm—to identify triadic interactions from data, and we will test its performance on the data generated from the TPM model described above.

Mining triadic interactions with the TRIM algorithm

We propose the TRIM algorithm (see Fig. 2) to mine triadic interactions among triples of nodes. To simplify the notation, we will use the letters XYZ to indicate both nodes as well as their corresponding dynamical variables.

Fig. 2: Illustration of the TRIM algorithm.
Fig. 2: Illustration of the TRIM algorithm.
Full size image

The TRIM algorithm identifies triples of nodes X, Y, and Z involved in a putative triadic interaction, starting from the knowledge of the structural network and the dynamical variables associated with its nodes. For each putative triple of nodes involved in a triadic interaction (a), which belongs to a network whose structure and dynamics are known (b), we study the functional behavior of the conditional mutual information MIz (c), and assess the significance of the observed modulations of MIz with respect to a null model (d). Given a predetermined confidence level, we can use these statistics to identify significant triadic interactions (e). This procedure can be extended to different triples of the network, thereby identifying the triadic interactions present in it (f).

Given a structural edge between nodes X and Y, our goal is to determine a confidence level for the existence of a triadic interaction involving an edge between node X and node Y with respect to a potential regulator node Z. Specifically, we aim to determine whether the node Z regulates the edge between node X and node Y, given the dynamical variables X, Y and Z associated with these nodes. To do so, given a time series associated with node Z, we first sort the Z-values, and define P bins in terms of the quantiles of z, chosen in such a way that each bin mz comprises the same number of data points (ranging in our analyses from 30 to 100). We indicate with zm the quantile of Z corresponding to the percentile m/P. Therefore, each bin mz indicates data in which Z ranges in the interval [zmzm+1). We indicate with μ(xzm), μ(yzm) and μ(xyzm) the probability density of the variables X, Y and the joint probability density of the variables X and Y in each mz bin.

A triadic interaction is taken to occur when the node Z affects the strength of the interaction between the other two nodes X and Y. Consequently, our starting point is to consider the mutual information between the dynamical variables X and Y conditional to the specific value of the dynamical variable Z. We thus consider the quantity MIz(m) = MI(XYZ = zm) defined as

$$MIz(m)=\int\,dx\int\,dy\mu (x,y| {z}_{m})\log \left(\frac{\mu (x,y| {z}_{m})}{\mu (x| {z}_{m})\mu (y| {z}_{m})}\right).$$

In order to estimate this quantity, we rely on non-parametric methods based on entropy estimation from k-nearest neighbors41,42,43 (see SI for details). For each triple of nodes, we visualize the mutual information MIz computed as a function of the m/P-th quantiles zm and fit this function with a decision tree comprising r splits.

In the absence of triadic interactions, we expect MIz to be approximately constant as a function of the m/P-th quantiles zm, while, in the presence of triadic interactions, we expect this quantity to vary significantly as a function of zm. The discretised conditional mutual information CMI between X and Y conditioned on Z can be written as

$$\begin{array}{r}CM{I}_{X,Y;Z}={\sum }_{m=0}^{P-1}p({z}_{m})MIz(m)=\left\langle MIz\right\rangle,\end{array}$$
(10)

where p(zm) = 1/P indicates the probability that the Z value falls in the mz bin. This quantity indicates important information about the interaction between the nodes X and Y when combined with the information coming from the mutual information MI, given by

$$M{I}_{X,Y}=\int\,dx\int\,dy\mu (x,y)\log \left(\frac{\mu (x,y)}{\mu (x)\mu (y)}\right),$$

where μ(x), μ(y) are the probability density functions for X and Y, and μ(xy) is the joint probability density function of the variables X and Y. The conditional mutual information, however, is not sensitive to variations in MIz and does not therefore provide the information needed to detect triadic interactions. In order to overcome this limitation, we define the following two quantities that measure how much the mutual information between X and Y conditioned on Z [zmzm+1) changes as zm varies. Specifically, we consider:

  1. (1)

    the standard deviation Σ of MIz, defined as

    $$\begin{array}{r}\Sigma=\sqrt{{\sum }_{m=0}^{P-1}p({z}_{m}){\left[MIz(m)-\left\langle MIz\right\rangle \right]}^{2}};\end{array}$$
    (11)
  2. (2)

    the difference T between the maximum and average value of MIz, given by

    $$\begin{array}{r}T={\max }_{m=0,\ldots,P-1}\left\vert MIz(m)-\left\langle MIz\right\rangle \right\vert .\end{array}$$
    (12)

The quantities Σ and T collectively measure the strength of the triadic interaction under question and can thus be used to mine triadic interactions in synthetic as well as in real data. In order to assess the significance of the putative triadic interaction, we compare the observed values of these variables to the results obtained with given null models. In order to determine if the observed values are significant with respect to a given null model, we compute the scores ΘΣ, ΘT, given by

$${\Theta }_{\Sigma }=\frac{\Sigma -{\mathbb{E}}({\Sigma }^{{{{\rm{ran}}}}})}{\sqrt{{\mathbb{E}}({({\Sigma }^{{{{\rm{ran}}}}})}^{2})-{({\mathbb{E}}({\Sigma }^{{{{\rm{ran}}}}}))}^{2}}},\\ \quad {\Theta }_{T}=\frac{T-{\mathbb{E}}({T}^{{{{\rm{ran}}}}})}{\sqrt{{\mathbb{E}}({({T}^{{{{\rm{ran}}}}})}^{2})-{({\mathbb{E}}({T}^{{{{\rm{ran}}}}}))}^{2}}},$$
(13)

and the p-values

$$\begin{array}{r}{p}_{\Sigma }={\mathbb{P}}({\Sigma }^{{{{\rm{ran}}}}} > \Sigma ),\quad {p}_{T}={\mathbb{P}}({T}^{{{{\rm{ran}}}}} > T),\end{array}$$
(14)

Note that if we consider \({{{\mathcal{N}}}}\) realizations of the null model, we cannot estimate probabilities smaller than \(1/{{{\mathcal{N}}}}\). Therefore, if in our null model we observe no value of Σran larger than the true data Σ, we set the conservative estimate \({p}_{\Sigma }=1/{{{\mathcal{N}}}}\). A similar procedure is applied also to pT.

To assess this significance, we consider two types of null models. The first is the randomization null model obtained by shuffling the Z values, to give \({{{\mathcal{N}}}}\) randomization of the data, i.e., we use surrogate data for testing44,45. The second is the maximum likelihood Gaussian null model between the three nodes involved in the triple X,Y,Z. Specifically, the Gaussian null model uses the mean and covariance of the timeseries of X,  Y and Z to define a multivariate normal distribution from which samples are randomly drawn, thereby providing surrogate timeseries values for the considered triple. We note that the use of these two null models also allows us to identify non-monotonic relationships between MIz and z, thereby going beyond underlying monotonic assumptions made elsewhere46. The first null model disrupts the temporal correlations between the timeseries of the node Z and the timeseries the two nodes X and Y at the endpoints of the considered edges. Therefore, this null model is robust with respect to the presence of possible outliers in the dataset. However, this first null model may overlook confounding network effects that affect correlations between the dynamical variables. The second null model more efficiently captures correlations between the dynamical state of the three considered nodes due to network effects, but is more sensitive to the presence of outliers in the data. To increase confidence, we therefore combined the insights coming from both these null models (see SI for details).

For each triple, the function MIz(m) is fitted with a decision tree with two splits. In this way, three different intervals of values of Z are identified, each corresponding to a distinct functional behavior of the correlation functions between the variables X and Y. While our method in principle allows for more than two splits of the decision tree, for illustrative purposes we have chosen two splits since this is the minimum number of splits needed to capture non-trivial functional behavior in MIz, such as non-monotonicity. In practice, this choice of two splits will also be the best choice when data is limited, such as the gene expression data we will analyze in the following section.

We also further characterize significant triples by examining their normalized entropic score function S [0, 1], which is used to characterize their corresponding functional behavior. Specifically, the entropic score S classifies the diversity of each of the joint distribution functions of X and Y conditioned on Z for each interval obtained through the decision tree (see Methods for details).

As we will discuss below, the algorithm performs well on data obtained from the TPM. In this case, we also observe that true triadic triples are characterized by a high entropic score S. On real data, the results obtained with the TRIM algorithm using randomized surrogate data might neglect potentially important network effects, this shortcoming is mitigated by performing an additional validation using the Gaussian null model and the entropic score S (see SI for the full pipeline of TRIM).

Validation of the TRIM algorithm on the triadic perceptron model

In order to discuss the phenomenology of the TPM, we first considered a representative network (see Fig. 3) of N = 10 nodes, L = 12 edges and \(\hat{L}=5\) triadic interactions (each formed by a single node regulating a single edge) on top of which we consider the TPM.

Fig. 3: Illustration of the role of triadic interactions in modulating the mutual information between linked nodes.
Fig. 3: Illustration of the role of triadic interactions in modulating the mutual information between linked nodes.
Full size image

We consider a network with N = 10 nodes, L = 12 edges, and \(\hat{L}=5\) triadic interactions (a). b, c Display the effect of triadic interactions on the Mutual Information profile MIz. b shows MIz for the triple [4, 9, 5] involved in a positive triadic interaction. c Shows MIz for the triple [1, 2, 6] that is not involved in a triadic interaction. In all panels, simulations were run to \({t}_{\max }=4,000\) with a timestep of dt = 10−2. For the analysis we consider 40, 000 time steps. The parameters of the model are: \(\alpha=0.05,\hat{T}=1{0}^{-3},\Gamma=1{0}^{-2}\), w+ = 8, w = 0.5, number of bins P = 400.

We found that data obtained from the TPM on this network shows a strong dependence of MIz(mz) on mz for the triples of nodes involved in triadic interactions, with greater significance for smaller values of α. Figure 3 shows the difference between the MIz(m) profile of a triple that is involved in a triadic interaction compared to a triple that is not, demonstrating how triadic interactions modulate the MIz(mz) profile.

Moreover, Fig. 4 shows, for a given triadic interaction involving nodes X, Y and Z, the joint distributions μδ(XY) of X and Y for each interval δ of values Z determined by the decision tree. The results provide evidence of this interesting dynamical behavior of the triadic model in the case of a positive regulatory interaction. Note that the analysis of the form of the function MIz(mz) also allows us to distinguish between positive and negative regulatory interactions, which are associated with an increase or a decrease in MIz for larger values of mz, respectively.

Fig. 4: Representative results for triples of nodes involved in triadic interactions in the continuous model with triadic interactions.
Fig. 4: Representative results for triples of nodes involved in triadic interactions in the continuous model with triadic interactions.
Full size image

Results for the triple [4, 9, 5] of the network in Fig. 3 of the main text, which is triadic, are shown. The joint distributions of variables X and Y conditional on the values of Z are shown in a. b Shows the behavior of MIz as a function of the values of zm, which clearly departs from the constant behavior expected in absence of triadic interactions. c Presents the decision tree for fitting the MIz functional behavior and determining the range of values of Z for which the most significant differences among the joint distributions of the variables X and Y conditioned on Z are observed. The parameters used are the same as in Fig. 3.

In the Supplementary Figs. S1 and S2, we display further examples of the function MIz for triadic triples. We observe the increased variability of the MIz functional behavior as the parameter Γ is raised, i.e., the noise increases.

These results confirm the main general principle on which the TRIM algorithm is based, i.e., that the conditional mutual information MIz is modulated by triadic interactions. To make this observation precise, we examined the performance of the TRIM algorithm in mining triadic interactions from synthetic data. We first considered the network shown in Fig. 3, and using the score ΘΣ, we evaluated the receiver operating characteristic (ROC) curve and precision recall (PR) curve for different values of the dynamical parameters (see Fig. 5). Both the ROC curve and the PR curve (which addresses the limitations of the ROC curve for imbalanced datasets) indicate that the TRIM algorithm performs well on data produced by the TPM, with a better performance for higher values of α.

Fig. 5: The performance of the TRIM algorithm on the test bench network of 10 nodes.
Fig. 5: The performance of the TRIM algorithm on the test bench network of 10 nodes.
Full size image

We consider the network in Fig. 3a. The time series obtained by integrating the stochastic dynamics of the proposed dynamical model for triadic interactions (Eq. (7)) are analyzed with the TRIM algorithm. a Displays the receiver operating characteristic curve (ROC curve) obtained by running TRIM with P = 400 bins and \({{{\mathcal{N}}}}=1{0}^{3}\) realizations of the null model on these synthetic time series, using ΘΣ to score for different parameters values indicated in the legend. b Displays the corresponding precision-recall curve (PR curve) obtained by running TRIM with the same parameters. The timeseries are simulated up to a maximum time tmax = 4000 with a dt = 10−2. For the analysis, we consider 40, 000 time steps (see the SI for details). The parameter of the model are: \(\hat{T}=1{0}^{-3}\), w+ = 8, w = 0.5, and α and Γ as indicated in the figure legend.

For all parameter values, we noticed that false positives are more likely to involve short-range triples, i.e., triples in which the regulator node Z is close (in the structural network) to the end-points X and Y of the target edge.

These results indicate that the TRIM algorithm is effective in identifying triadic interactions in a small network generated using the TPM. To examine the scalability of this methodology we also tested the TRIM algorithm on a much larger model network. To this end, we consider a random Erdös-Renyi network of 100 nodes, and average degree c = 4, to which we added 25 random triadic interactions (i.e., between randomly chosen nodes to randomly chosen edges), imposing the condition that each edge is at most regulated by a single node for simplicity. The results of this analysis are shown in Fig. 6a, in which we provide statistics for all possible node triples in the network (the majority of which are not triadic interactions). For each edge, we retained only the 5 most significant triples according to ΘΣ. By conditioning on the value of the third node, for each of these connected nodes we also record the conditional mutual information CMI. In Fig. 6, each considered triple corresponds to a point, color coded according to the value of S. Stars indicate triples that are involved in a triadic interactions (see SI for details). True triadic interactions are found for triples with high ΘΣ, while CMI span between high and intermediate values. This result confirms the very good performance of the TRIM algorithm on the data coming form the TPM.

Fig. 6: Performance of the TRIM algorithm on a random network with triadic interactions with N = 100 nodes.
Fig. 6: Performance of the TRIM algorithm on a random network with triadic interactions with N = 100 nodes.
Full size image

a Each data point represents a given triple of nodes X, Y and Z. The y-axis shows ΘΣ, while the x-axis shows the CMI between X and Y. The color of each point corresponds to the value of S, which characterizes the entropic score of the triple. The synthetic data comes from a structural random Erdös-Renyi network with 100 nodes, and average degree c = 4, to which 25 triadic interactions between random edges and random nodes have been added. For the modeling of the network we used α = 0.06, Γ = 1.4 × 10−2tmax = 1500, w+ = 18, w = 0.2 and for the analysis with TRIM we looked at 3000 data points and P = 100 bins. We display top 5 triples for each edge according to ΘΣ that are below our p threshold for the randomization null model. The triples below are represented in the scatter plot, and they all display an entropic score S > 0.5. Stars are the true triadic triples, which are characterized by high ΘΣ. Crosses are the triples that can be excluded by performing TRIM with the Gaussian Null model. b Histogram of the ΘΣ-values for all the triples of the network (in light blue), and for the triples corresponding to the 25 true triadic interactions only (in dark blue). c Histogram of the ΘΣ-values observed in a network of the same topology and with the same dynamical parameters for which all the triadic interactions have been removed (orange).

To test the statistical robustness of the TRIM algorithm, we also conducted the same analysis (i.e., on the same structural network with the same dynamical parameters) in which all triadic interactions were removed. The results of this analysis are shown in Fig. 6c. In this case, and as expected, the TRIM algorithm did not identify any statistically significant triadic interactions. This analysis indicates that the TRIM algorithm is able to identify true triadic interactions with a low false positive discovery rate (compare Fig. 6b, c).

Detecting triadic interactions in gene-expression data

Searching for triadic interactions in gene-expression is a problem of major interest in biology. For instance, understanding the extent to which a modulator promotes or inhibits the interplay between a transcription factor and its target gene is crucial for deciphering gene regulation mechanisms29. In order to address this question with our method, we considered a gene-expression dataset associated with acute myeloid leukemia (AML), extracted from the grand gene regulatory network database47,48.

Exhaustive mining of all potential triadic interactions from every putative triple of nodes in the AML dataset is computationally very demanding (it would require testing of  >260M triples) and likely, due to the sheer number of triples being tested, to result in false positives and/or interactions of less biological importance. Moreover, such a brute-force approach would not account for other sources of important biological information, such as putative interactions derived from other experimental sources. To account for such information, we therefore focused our analysis on edges between nodes associated with known biophysical interactions, as identified in the human protein–protein interaction network (PPI)47. Because proteins function through physical and functional interactions, PPI networks provide a biologically meaningful framework for interpreting gene expression changes in terms of coordinated molecular mechanisms49,50. To do this, we considered the connected subgraph of the human PPI network that contains all the genes/proteins included in the AML gene expression data and their associated edges. This network, which contains 622 nodes and 42, 511 edges, formed the structural network for our analysis51. To start, we focused on triples involving genes known to be associated with AML, in which the end-points X and Y of the target are directly connected in the PPI network (see SI for details).

We then selected additional triples according to their positions in the PPI network’s maximum spanning tree (MST), which only includes 621 edges (see Fig. 7). In order to focus on triples for which network effects are likely to be less pronounced, for each edge in the MST, connecting gene/protein X with gene/protein Y we considered all genes Z within a distance of 4 from both the X and Y as candidate regulatory nodes, i.e., the third node in the triple (see SI for details). For each considered triple of genes, we assessed its significance using ΘΣ as the significance score, with P = 5 bins, using \({{{\mathcal{N}}}}=5\times 1{0}^{3}\) realizations of the randomization null model (very similar results were obtained using ΘT as the significance score, see Supplementary Figs. S3-S4 for a comparison).

Fig. 7: Maximal spanning tree of the relevant genes from the gene expression data.
Fig. 7: Maximal spanning tree of the relevant genes from the gene expression data.
Full size image

Edges and their edge weights were obtained from the protein–protein interaction network. Red-colored nodes are nodes with biological significance, that is that play a critical role in AML54,55,56. Node size is proportional to the node degree.

Figure 8 shows the results of the TRIM algorithm for those triples with pΣ < 0.001. Note that for each selected edge, only the top 5 triples ranked according to the ΘΣ score are depicted. Squares indicate triples chosen from biologically relevant genes for AML. The triples deemed insignificant according to the Gaussian null model, are not shown here. The interested reader can see their visualization in Fig. S6 of the SI. Figure 8 provide examples of conditional distributions of two illustrative triples that rank high in TRIM. Both triples show evidence of a triadic interaction: the triple in panel b is a member of the MST, while panel c shows an example of a triple chosen from known biologically relevant genes for AML. Further example triples are shown in the SI. Interestingly, among the significant triples, we also detected triples in which the modulation of the mutual information is non-monotonic (see SI for details).

Fig. 8: Application of the TRIM algorithm to gene expression data.
Fig. 8: Application of the TRIM algorithm to gene expression data.
Full size image

a Shows the results of TRIM for the significant triples in the AML dataset. The scatter plot shows ΘΣ (y-axis) versus the CMI (x-axis). The colour of each point corresponds to the value of its entropic score S. Here we display only those triples with p-value 0.001 or less in the randomization null model and that have not been excluded by the Gaussian null model (for details about these triples see SI). Circles are triples whose links all appear in the minimum spanning tree, and squares indicate triples involving genes with biological relevance. b, c Display the conditional distributions for two example triples: both are identified by the TRIM algorithm with high significance, suggesting a meaningful biological association. Panel b shows the triple X = GATA1, Y =  KLF1, Z = ETV1. According to the randomized surrogate null model, this triadic interaction has pΣ-value 0.001, ΘΣ = 4.75, Σ = 0.44, S = 0.64; panel c shows the results for the triple X = HOXB3, Y = MEIS1, Z = GLIS3 involving two biologically relevant genes. According to the randomized surrogate null model, it has ΘΣ = 3.98, pΣ = 0.001, Σ = 0.38, S = 0.60.

Many of the genes involved in the 50 highest ranking triples have already been linked to AML in the literature (see SI for Table S3 for a list of highly significant triples and Table S4 with links to the literature associating the involved genes with AML). In total, 84% of the top 50 Triples include at least one gene that has a known association with AML.

Discussion

This work provides a comprehensive information theory-based framework to model and mine triadic interactions directly from dynamic observations. The TPM we propose demonstrates that the presence of a triadic interaction leads to systematic variations in the mutual information between the two end nodes of the edge involved (X and Y). Via this model, we have shown that to detect triadic interactions, it is necessary to go beyond standard pairwise measures, such as the mutual information. Importantly, standard higher-order statistical measures, such as the conditional mutual information, which accounts for the average effect of the third regulatory node Z on the mutual information between the target nodes X and Y, are also insufficient to identify triadic interactions. Our proposed approach, implemented in the TRIM algorithm, mines triadic interactions by identifying statistically significant variations in the mutual information between the two linked nodes conditioned on the third regulator node.

To demonstrate the efficacy of this algorithm, we have tested and validated it on a new dynamical model (that we denote the TPM) and shown how it can identify triadic interactions in randomly generated triadic interaction networks. We also used it to mine putative triadic interactions from gene expression data and connect the putative interactions with meaningful biology.

From the network theory point of view, this work opens new perspectives in the active field of modeling and inference of higher-order interactions and can be extended in many different directions, for instance by exploring the effect of triadic interactions on the dynamical state of nodes associated with discrete variables or including time delays in the regulation. From the biological point of view, our results may inspire further information-theoretic approaches to genetic regulatory network inference. Investigating the extent to which triadic interactions are tissue-specific, and if certain regulatory patterns are conserved across different tissues could yield valuable insights. Our proposed approach could also be used to mine triadic interactions in other domains, such as finance or climate, where triadic interactions also have a significant role.

Methods

Entropic score S for significant triples

In order to identify and classify the significant triples [X,  Y,  Z] involving node X and Y whose interaction is modulated by node Z, we introduce an entropic score function S which characterizes how diverse the conditional joint distributions μδ(X,  Y) of X and Y conditioned on Z in each of the obtained intervals δ {1, 2, 3} are. Dividing the plane X,  Y in P2 squares (ij) (by binning X and Y in P bins each) with \({n}_{ij}^{(\delta )}\) data points, we can calculate the participation ratio \({Y}_{2}^{(\delta )}\)52,53

$$\begin{array}{r}{Y}_{2}^{(\delta )}={\sum }_{i=1}^{P}{\sum }_{j=1}^{P}{\left(\frac{{n}_{ij}^{(\delta )}}{{{{{\mathcal{N}}}}}^{(\delta )}}\right)}^{2},\end{array}$$
(15)

where \({{{{\mathcal{N}}}}}^{(\delta )}={\sum }_{i=1}^{P}{\sum }_{j=1}^{P}{n}_{ij}^{(\delta )}\). The inverse of the partition function is known to measure the effective number of square bins in which the distribution is localized. We can then introduce the normalized entropic score S as

$$\begin{array}{r}S=-\frac{1}{3\ln {P}^{2}}\mathop{\sum }_{\delta=1}^{3}\ln {Y}_{2}^{(\delta )}.\end{array}$$
(16)

The entropy S is low if all the conditional distributions μδ(X,  Y) are very localized while it acquires large values if all the conditional distributions are delocalized. We adopt a threshold S = 0.5 in order to retain triples with S > 0.5, indicating that in average the conditional distributions associated to these triples have more than \(\sqrt{{P}^{2}}\) significantly populated bins.