Introduction

Understanding the patterns of large-scale social networks has been a long-standing challenge. It happened only recently that a clear sign of strong structural balance1 has been observed consistently in multiple large-scale social networks1,2. At least without considering directionality, the theory of strong balance dictates that both the friend of a friend, and the enemy of an enemy is expected to be a friend more often than expected by chance. On the other hand, the enemy of a friend or the friend of an enemy is often an enemy. We represent relationships with a positive sentiment (friends, trust) as positive links, and negatives (foes, distrust) with a negative link. In this representation, an undirected triad is balanced if it contains an even number of negative links (zero or two). While many triads appear to be unbalanced in real networks, the more useful formulation of the problem considers as balanced (unbalanced) triangles the ones that happen more (less) often than expected “by chance” in a randomized network ensemble given by an appropriate null model.

Clearly, null models play a crucial role in network analyses by providing a baseline for comparing observed network structures against random expectations. The primary use of null models in this context is to differentiate between statistically significant patterns and those that could arise by chance given certain node-level constraints. The most basic null model is a random graph with the same number of nodes and links as the observed network, for example given by the Erdős-Rényi model3. For signed networks, this basic model can be extended by randomly assigning positive or negative signs to links with a probability that maintains the overall sign ratio of the empirical network4. While such simple null models preserve global network properties such as density and sign ratio, they fail to capture more detailed structural features. Consequently, when comparing such simplistic null models to empirical social networks, the interpretation of the results often becomes problematic or misleading. Specifically, the observed differences between the null model and the real network conflate multiple factors, blurring the true patterns of social networks1.

The recent confirmation of balance in the undirected case required methodological advances in capturing key constraints in the null models. Both preserving the network topology and the individual sign preferences has been identified as key ingredients for such null models1,2. By preserving the network topology, we separate factors that dictate which links are formed from those that determine the sign of each link, the latter being the subject of questions related to social balance. The network topology is resulting from manifold physical and social constraints on forming connections. For instance, geographic proximity often limits potential interactions5,6,7, while organizational hierarchies can restrict communication pathways8,9. Even in online social networks, where physical distance is less of a barrier, constraints persist in the form of language barriers, cultural differences, and algorithmically curated content exposure10,11. These digital constraints create virtual “neighborhoods” that shape connection patterns in ways analogous to physical proximity in offline networks. If these topological constraints are relaxed in a null model, the resulting randomized networks may exhibit unrealistic connection patterns, leading to an overestimation of the significance of the observed network features.

Additionally, signed node degree in social networks represents the capacity or willingness of individuals to maintain positive or negative connections, which may be subject to cognitive and time limitations12,13. A null model without constraining the signed degree of individual nodes ignores the reality that every node is not equally friendly or antagonistic1. Yet, even in these state-of-the-art studies1,2 a key aspect of the interactions has been ignored, namely that interactions can often happen in one direction only. From the perspective of a node, connections can be viewed as incoming and outgoing connections. Outgoing connections represent sociability while incoming connections may indicate the popularity of an entity. The degrees associated with these two types of connections—in-degree and out-degree—are not necessarily correlated.

As a comprehensive overview, we consider all large-scale signed directed social networks that we could obtain from various fields: (a) Bitcoin-Alpha: a who-trusts-whom network of people who trade using Bitcoin on a platform called Bitcoin Alpha14,15; (b) Bitcoin-OTC: a who-trusts-whom network of people who trade using Bitcoin on a platform called Bitcoin OTC14,15; (c) Slashdot: a friend-or-foes network between users of a technology-related news website called Slashdot16; (d) Epinions: a who-trusts-whom online social network of a general consumer review site called Epinions17; (e) Pardus: an accumulated network of relationships among players of a Massive Multiplayer Online Game (MMOG) called Pardus4. Details of these datasets can be found in Supplementary Fig. 1, Supplementary Tables 15, and Table 1.

Table 1 Overview of the empirical social networks

As for degree correlations, the Bitcoin-Alpha network shows a strong correlation between in- and out-degrees, the Slashdot shows a much weaker correlation, highlighting the importance of directionality (Fig. 1a). Moreover, our friends may not always think about us as their friends, and similarly for our foes. While the reported fraction of reciprocated interactions depends on the dataset18, a large fraction of relationships can be non-reciprocated (Table 1 and Fig. 1b), also depending on whether they were positive (trust/friends) or negative (distrust/enemies). For instance, the Bitcoin-Alpha and Bitcoin-OTC datasets exhibit approximately 80% reciprocated links, whereas in the other three datasets examined, over half of the links are unidirectional. Although it occurs relatively rarely (less than 2% of total links), our friend might also consider us as a foe, leading to a “conflicting” relationship. Such observations indicate that directionality could play a substantial role in shaping higher-order network patterns.

Fig. 1: Overview of distinct link types and degree correlations in empirical networks.
figure 1

a The correlation between in- and out-degree (kinkout) of Bitcoin-Alpha and Slashdot networks. b The correlation between signed total node degree (kall±) and signed reciprocated degree (kr±) of Bitcoin-Alpha and Slashdot networks. ce The Bitcoin-Alpha network decomposed into unidirectional, reciprocated and conflicting links. The percentage of each link type relative to total links and their positive/negative composition are shown below each network layout. Negative links (red) are overlaid on positive links (blue), with node sizes proportional to the total degree. Node positions are computed using a spring-embedded layout algorithm37 considering all links regardless of their signs. f Correlation plots of the primary node degrees associated with each link type in panels (ce). g Correlation plots between degrees correspond to different link types. All correlation plots use logarithmic scales to accommodate the broad range of degree values, though correlation coefficients (rr+r) are computed using linear-scale degree values. Throughout all panels, blue and red consistently represent positive and negative links/degrees, respectively. *Slashdot limits the total number of connections to 200 (or 400 for subscribers)38.

While there is now a well-established notion of balance in undirected networks—aligned with everyday intuition—it is less clear what the correct directed generalization would look like. In this paper, we present a comprehensive roadmap towards understanding balance in signed directed networks, outlining the key steps to be taken and the current challenges. We present an efficient maximum entropy framework and—as a particularly attractive solution—propose a maximally constrained null model. Additionally, we explore possible definitions of strong balance by developing a maximally constrained null model. We find consistency with two of the proposed notions by comparing the relative frequency of different triad types in the observed data versus the maximally constrained null model.

Methods

As a starting point, signed directed networks pose manifold challenges compared to signed undirected networks or unsigned directed networks, contributing to a huge combinatorial space to be explores, characterized by: i) More link types: unidirectional A → B (+ or  −); reciprocated A ↔ B, meaning that the connection exists in both directions with the same sign; and conflicting, meaning that the sign of the two directions differs. ii) More primary node features: signed unidirectional degree (\({k}^{i{n}_{u}\pm },{k}^{ou{t}_{u}\pm }\)); signed reciprocated degree (kr±); signed conflicting degree (kc±, where the indicated sign corresponds to the outgoing arrow). iii) A combinatorial increase in secondary node features, derived from primary features, like the total positive (negative) degree, the total outgoing degree, or the total outgoing positive degree, etc. iv) A drastic increase in the potential network patterns (96 configurations) at the level of fully connected three-node triads, compared to the undirected signed case (4 configurations) and the unsigned directed case (7 configurations), see Fig. 2. As the presence of conflicting links might already violate classical balance theory assumptions19, we consider triads without (Fig. 2c) and with (Fig. 2d) conflicting links separately. v) A proliferation of potential null models depending on which network features are preserved. vi) A broad range of (largely unexplored) potential extensions and alternatives of balance theory for directed networks. vii) A need to decide if counting “motifs” or “graphlets”. In the case of graphlets (also known as induced subgraphs), we look for exact matches of all specified and non-existing links in the pattern. Motifs count every instance in which the specified interactions occur, irrespective of the existence of potential other interactions in the patterns.

Fig. 2: Triad configurations.
figure 2

a Signed triads without directionality. b Directed triads without signs. The alphabet is used to distinguish between different topologies as defined in ref. 39. c Triads without conflicting links: All links within these triads are either consistently positive (blue), negative (red), or unidirectional. d Triads with conflicting links: These triads contain at least one pair of mutual links with conflicting signs (one positive, one negative). Each triad is labeled with an alphanumeric code denoting distinct topological categories in (b).

Most importantly, the choice of null models plays a crucial role in assessing the significance of the observed balance in signed directed networks. Just like in the undirected case, each choice of the preserved network features can lead to a different interpretation of the network structure. Two fundamental properties to consider are the network topology and some variation of (signed and/or directed) node degrees. The decision to preserve network topology depends on the underlying assumptions about the system being studied. Allowing disruption of the network topology in the null model assumes that all connections can be potentially established, which is often unrealistic in many real-world systems. For example, in a product-competition network20, customer choices are restricted to products within their consideration set, forming an underlying network structure of constraints. If we had the constraint data on which connections are allowed to form, we could randomize the topology of the network within the allowed subspace21. For instance, in brain networks, only neurons in physical contact can form synapses to communicate with each other22, imposing inherent contact constraints on possible connections. However, we often do not have access to such detailed information for social networks, leaving two options: either allowing all potential connections or freezing the topology completely.

In the undirected case, the preservation or randomization of the topology and the signed node degrees yields 2 × 2 = 4 potential null models1,2. A natural starting point for the directed case—and our main focus—is the maximally constrained null model that preserves the topology and all primary (and therefore all secondary) node degrees. Note, however, that the landscape of potential null models for directed networks is considerably more complex than in the undirected case. As we mentioned, preserving any consistent subset of the primary and secondary node degrees is a potential mathematical option. While in the undirected case it was possible to follow a step-by-step elimination process until the maximally constrained null model was the only reliable option left1, carrying out a similar process appears to be impractical in the directed case. That means that there could be multiple viable alternative null models to explore. Yet, the most constrained version serves as a good starting point. If we still see a difference compared to this null model, then the datasets clearly have some sign patterns that remained unexplained by the local node-level features or the topology itself—a genuine sign of higher-order network patterns. A less restrictive null model, the “signed directed” null model, only preserves the topology and the signed in- and out- degrees, corresponding to the assumption that all nodes are equally likely to form mutual links. We present the results for both null models in this study to demonstrate the importance of choosing suitable null models to unravel the true patterns of signed directed networks23,24.

Results

The spectrum of directed null models

Network structure is shaped by node-level preferences as well as pairwise or higher-order wiring mechanisms. A key step towards unveiling the wiring mechanisms is to compare with null models that match the key features of individual nodes. Formally, a null model is an ensemble of random graphs that is constrained by some selected features of the original network. In directed networks, it is natural to consider the signed in- and out-degrees separately since they might originate from different mechanisms. For instance, the positive in-degree can indicate popularity or prestige, while the positive out-degree can result from sociability or influence-seeking behavior25,26. Thus, the in- and out-degrees are not necessarily well correlated in empirical networks as shown in Fig. 1a.

Reciprocity is another key aspect of social systems. As shown in Fig. 1b, for negative links, both the Bitcoin-Alpha and Slashdot networks exhibit moderate correlations between reciprocated degrees and total degrees, indicating heterogeneous reciprocity patterns across nodes. This heterogeneity suggests that nodes with similar total negative degrees may display varying tendencies to engage in mutual negative relationships. For positive links, reciprocity can be either highly homogeneous (Bitcoin-Alpha) or not (Slashdot), depending on the dataset. Although conflicting links typically account for less than 2% of total links, we observed that the tendency to form conflicting links can be highly heterogeneous among nodes (Supplementary Fig. 1). We further check the correlations between all six independent primary node degrees and find that Pearson correlation coefficients are mostly below 0.50, with a few exceptions that can go up to 0.77 (Fig. 1f, g and Supplementary Tables 15). These weak to moderate correlations likely suggest that each link type is influenced differently by social processes. For example, reciprocated positive links may represent mutual friendship, while conflicting links could indicate complex status relationships17. Critically, these correlations demonstrate that constraining one type of degree does not automatically constrain others. For example, maintaining the positive unidirectional in-degree does not guarantee the preservation of the positive reciprocated degree. Hence, providing motivation for the maximally constrained null model that preserves each primary degree separately. However, note that constraining the network topology makes some primary degrees depend on other degrees. For example, fixing the topology preserves both in- and out-degrees for unidirectional links. Consequently, if the positive unidirectional in-degree is preserved, the corresponding negative unidirectional out-degree is also automatically preserved. A similar relationship exists for signed reciprocated and conflicting degrees, which reduces the independent degrees considered in the null model, as described in detail in Supplementary Note 2.1.

On the way to the maximally constrained null model, as we incorporate more constraints into the null model, the fraction of the network that is being randomized decreases (Fig. 3). While the maximally constrained null model matches all node-level features, in principle, we could include additional constraints based on higher-order network properties. Such a non-local null model would eventually capture all characteristics of the empirical social networks. At this point, the non-local null model could serve as a generative model, capable of producing synthetic networks statistically indistinguishable from the observed network across multiple measures. In this sense, the progression from null models to generative models represents a continuum of increasing structural fidelity, reflecting our evolving understanding of the fundamental organizing principles in social networks.

Fig. 3: Randomizing signed directed networks.
figure 3

The diagram illustrates a progression from topology-disrupted models (left) to generative models (right), with intermediate models. The top network represents the original structure, with positive links in blue and negative links in red. Two example null models are shown: (1) the signed directed model that preserves directed topology and signed in- and out-degrees (39% randomized), and (2) the maximally constrained model that preserves directed topology with signed in-, out-, reciprocated-, and conflicting-degrees (18% randomized). Darker colors highlight randomized links.

However, as we mentioned above, for the purpose of detecting the patterns emerging from wiring mechanisms, we only want to include local constraints at the level of individual nodes. As a strong argument for the maximally constrained null model, the best null model is expected to be the one that is closest to the data without being constrained on higher-order features. Another strong argument for the maximally constrained null model emerges from considering incomplete, “V-shaped” patterns. In a V-shaped pattern, two of the three considered nodes have no relations to each other. Intuitively, such patterns are not expected to provide insights about genuine three-node interaction tendencies. In line with this expectation, the maximally constrained null model matches all V-shaped motifs. This follows from the fact that counts of such motifs are just the number of ways we can pair different types of links for each node, each of which link type matched by the null model for each node. Note, however, that V-shaped graphlets can still deviate from the model, echoing the presence of genuine three-node patterns, where all pairs of the three nodes are interacting. Of course, considering such V-shaped patterns does not provide new information in this case in addition to what we learned from the connected triads shown in Fig. 2. We also note that the maximally constrained null model is not the only one that fully captures incomplete motifs. The version that similarly constrains all primary degrees without fixing the topology also has this desirable property.

In this study, we restrict ourselves to present only two null models to illustrate how incorporating appropriate constraints reveals the hidden structure of empirical social networks. Based on a previous undirected study1, we consider both network topology and signed degrees as fundamental constraints. We extend this approach to signed directed networks by preserving directed topology and signed in- and out-degrees (signed directed null model). This null model is efficiently generated by applying maximum-entropy randomization to directed networks, maintaining average in- and out-degrees across the ensemble (see Supplementary Note 2.3 for details). Alternatively, the maximally constrained null model considers all primary node degrees from three distinct types of directed links: unidirectional, reciprocated, and conflicting. In addition to directed topology, the maximally constrained null model preserves the signed in-, out-, reciprocated-, and conflicting-degree of each node, when averaged over the ensemble of the null model. We implement this model by decomposing the directed network into three independent subgraphs: (1) unidirectional positive and negative links, (2) reciprocated positive and negative links, and (3) conflicting links. Maximum-entropy randomization is applied to each subgraph separately. The union of the links in the resulting randomized subgraphs provides the complete null model, as formulated in Supplementary Note 2.2.

Directed notions of balance

Extending the definition of balance to directed network necessitates understanding the role played by reciprocated and conflicting links. For example, negative reciprocated links can either be considered as equivalent to a negative link in the undirected case, or be interpreted as a sign of balance, due to sign concordance18. However, as in undirected networks, a definition of balance should not be based on isolated links as such an approach ignores the complex interdependencies of social systems. Instead, a comprehensive definition of balance should consider the dynamics among a group of entities, resulting in patterns beyond those given by individual node-level features. Here, we discuss several definitions of strong balance based on (fully connected) triads in directed networks, extending balance theory from the undirected case to the directed case, as illustrated in Fig. 4.

Fig. 4: Definitions of balance for signed directed triads.
figure 4

An example triad is considered under different definitions of balance. Positive links are shown in blue and negative links are shown in red.

As direct extensions of undirected balance theory, we first propose two definitions that transform directed links into undirected links. The Undirected definition simplifies the network by converting reciprocated  + / + to positive and  − / − to negative links, while treating conflicting  + / − as negative. In contrast, the Consistency definition focuses on sign agreement, treating both  + / + and  − / − as positive, with only  + / − considered as a negative. This distinction depends on the interpretation of reciprocated negative links (− / −): the Undirected approach views them as elevated discord, while the Consistency approach sees them as a form of consistency in negative sentiment. Both definitions consider unidirectional links as undirected and preserve their signs.

As an alternative, we build upon the observation that the pattern A → B → C ← A, known as a transitive cycle27, frequently appears in most considered datasets (> 30% of total triads in Epinions, Slashdot, Pardus). Thus, we consider a definition of balance based on transitive cycles (Cycle). This definition considers the consistency of sentiment between the direct interaction from A to C and the indirect interaction from A to C through B. We consider a cycle as balanced if it has an even number of negative links, and unbalanced otherwise. A triad is considered balanced only if all transitive cycles are balanced.

Another definition of directed balance considers the closed walks involving all three triad nodes (Walk). Closed walks represent paths where information or influence can flow back to its origin. This circular flow within a triad can reinforce or counteract itself, depending on the signs of the links. We consider closed walks that encompass all nodes without repeating nodes and consider them as balanced if a given walk contains an even number of negative links, and unbalanced otherwise. A triad is considered balanced only if all closed walks are balanced.

The final definition we consider is grounded in status theory17,28, which offers a distinct perspective on balance in signed directed networks, particularly relevant in hierarchical social structures. According to status theory, the sign of a link between two nodes is determined by the perceived difference in their social status (Status). Specifically, a positive link from node A to node B indicates that A perceives B as having a higher status, while a negative link suggests that A views B as having a lower status. In this definition, balance is achieved when all three nodes of a triad can be placed in a consistent status order. While previous studies of status theory have focused less on reciprocated links, real social systems often contain reciprocated links, potentially corresponding to equal status. Thus, here we introduce an extended notion of status theory by considering reciprocated positive or negative links between two nodes as indicators of having equal status. For example, if A positively links to B and B also positively links to A, such a triad is considered as balanced as long as both A and B have higher, lower, or equal status relative to C.

Note that these prospective definitions of balance already differ even at the level of fully reciprocated configuration with consistent signs (A1, A11, A3, A16 in Fig. 5). In these cases, according to the Undirected definition, directionality plays no role, as mutual links with identical signs can be considered as undirected links without loss of information. On the contrary, both the Consistency and Status definitions suggest that triads A3 and A16 should be balanced, while the Undirected definition indicates they should be unbalanced (Fig. 5). This discrepancy indicates that the Consistency and Status definitions should not be considered as extensions of the undirected notion of balance. Instead, these definitions may offer complementary insights into signed directed networks if they demonstrate consistency with empirical data.

Fig. 5: Comparison of the observed triad statistics to null models and balance theories for triads without conflicting links.
figure 5

Triads are ordered based on their expected balance according to the Undirected definition, with balanced triads presented first, followed by a black line and then unbalanced triads. The “Maximally constrained” and “Signed directed” rows show z-scores quantifying the deviation of triad frequencies in empirical social networks from corresponding null model expectations. Orange (gray) dots indicate significant overrepresentation (underrepresentation) with z > 2 (z < − 2), while lighter colors indicate results with (z < = 2). The “Theory” rows indicate whether each triad configuration is classified as balanced (orange) or unbalanced (gray) according to a certain definition of balance. The spot is left blank if the theory is inconclusive about the balance of that triad. Triads that contain only reciprocal links are shaded in red. Inconclusive triads without significant results are shaded in gray.

Balance is observed in directed social networks

To quantitatively understand the balance in signed directed networks, we consider all signed and directed triadic graphlets presented in the empirical social systems, as shown in Fig. 2. As a standard measure, we use the z-score to quantify how the observed triad frequencies deviate from the null models. The z-score is calculated as

$$z=\frac{{f}_{obs}-\langle \,{f}_{null}\rangle }{\sqrt{{\sigma }_{obs}^{2}+{\sigma }_{null}^{2}}},$$
(1)

where fobs is the observed frequency of a given triad and 〈 fnull〉 is the mean frequency of the same triad type averaged over 1000 independently generated null model networks. The denominator represents the total uncertainty, combining two sources of uncertainty: σnull, the standard deviation of triad frequencies across all null model samples, and σobs, the estimated shot noise \({\sigma }_{obs}\approx \sqrt{{f}_{obs}}\), assuming a Poisson distribution for the occurrence of each triad type. Note that for frequent motifs, with a large count, the shot noise has a negligible downward impact on the z-score, however, it saves us from misinterpreting small counts. This approach allows us to account for both the variability in the null model and the inherent statistical fluctuations in the observed network, providing a robust measure of the significance of triad frequency deviations. A triad is considered as significantly overrepresented when z > 2 and significantly underrepresented when z < − 2. Any z < 2 score means that the triad does not deviate substantially from the null model.

First, we consider the maximally constrained null model, which preserves topology and all primary node degrees. For each triad, we note consistency with a given definition of balance if there are no statistically significant contradicting conclusions regarding over- or under-representation across all empirical datasets. In Fig. 5, we ordered all triads without conflicting links based on whether they are balanced or not under the Undirected definition. We observe consistent alignment within all datasets examined. Moreover, the results are also consistent with the Cycle definition of balance. The exceptions are triads G1 − G4, where the results are not significant in most datasets, indicating that these specific configurations occur at frequencies similar to what would be expected by chance, given the preserved network constraints. Regarding triads with conflicting links, at least partially due to their small numbers, the majority lacks sufficient statistical significance to determine over- or underrepresentation compared to the null model, leading to no definite conclusions of balance (Supplementary Fig. 2). Another possible interpretation is that the maximally constrained null model already captures the statistics of triads with conflicting links. This means that once we account for the signed degrees of unidirectional, reciprocated and conflicting links of each node, the frequencies of triads containing conflicting links are found to be fully explained by these lower-order network properties.

On the contrary, when comparing observed frequencies to the signed directed null model, which preserves directed topology and signed in- and out-degrees, we do not observe a clear pattern that aligns with any proposed balance definition (Fig. 5). The most notable trend is the underrepresentation of all triads with conflicting links across datasets, with the sole exception of triad A8 in the Bitcoin-Alpha network (Supplementary Fig. 2). However, this observation does not reveal direct insights about balance in these networks. Instead, it is a consequence of the overrepresentation of reciprocated links and underrepresentation of conflicting links at the link level, as suggested by other studies18. This link-level pattern propagates to the triadic level, resulting in the observed underrepresentation of triads containing conflicting links and the overrepresentation of triads without conflicting links in most cases. We expect similar effects to take place for other null models that are leaving some constraints out of the maximally constrained null model.

Conclusions

This paper outlines a roadmap for addressing the challenges in understanding signed directed networks and establishes foundational steps toward uncovering their inherent patterns. By implementing a maximally constrained null model, we identified robust structural patterns that consistently emerge in empirical social networks. Comparing empirical results with theoretical definitions helped us narrow down the studied extensions of balance theory to Undirected and Cycle definitions.

It is worth noting that the Status definition, while potentially applicable in certain social systems as indicated by previous studies17,18, shows contradictory conclusions when compared to empirical results for most triads. Such contradiction suggests that status dynamics may not be as universally applicable in explaining signed directed network structures as previously thought.

Note that while this study provides a maximally constrained null model as a starting point for understanding structural balance in signed directed networks, alternative null models may be considered. Specifically, a better null model would (1) yield statistics that more closely match those of the empirical dataset, resulting in smaller absolute z-scores, and (2) effectively remove structural balance when applied to a reference network that initially exhibits such balance, similar to the approach demonstrated in ref. 1. However, unlike for undirected networks, constructing a suitable reference network for the directed case presents significant challenges, as discussed in the Introduction.

One limitation of the current study is that it focuses on triad patterns. However, to fully understand the organizing principles of social systems, or more broadly signed directed systems, it may be necessary to incorporate higher-order patterns29, such as four-node patterns, as well.

Although our previous study1 found that square patterns also show balance in the undirected case, extending such analysis to directed networks may require a fundamentally different framework. The primary obstacle lies in the combinatorial explosion of possible configurations as the number of nodes increases. It is worth noting that analysis based on comparing patterns to null models inherently involves multiple hypothesis testing issues. In our analysis, we simultaneously test 96 variables that are not independent of each other. This approach can increase the risk of Type I errors (false positives) if not properly addressed. For example, when testing multiple hypotheses at a certain significance level, the probability of observing at least one false positive result increases with the number of tests performed. Although in this study we examine our main results against several thresholds of z-scores (e.g., z = 2, 3, 5, and 10, see Supplementary Fig. 3) and demonstrate robustness, future studies may benefit from more rigorous statistical approaches. For example, in addition to considering z-scores, a combination of fold change and empirical p-values may also be considered.

Moving forward along the roadmap, future studies could incorporate additional primary node features related to the node attributes, which is also known as node color30. For instance, categorical node attributes (e.g., gender) could generate attribute-specific network properties, such as gender-specific degree, which quantifies the number of connections a node has to nodes of a particular gender category31. Similar to how distinct link types drastically increase the number of possible network patterns, the introduction of node colors multiplies the complexity of structural patterns that need to be properly considered. Eventually, advancing towards generative models would require considering more node features and associated degrees, pairwise connections and even higher-order properties.

This work provides a roadmap towards understanding the formation of alliances and conflicts in various social contexts4,32. Such a roadmap can be applied to a wide range of signed directed networks. For instance, applying our methods to C. elegans neuronal networks33,34,35, which contain approximately 300 neurons with signed, directed synapses, could reveal whether principles of social balance have analogs in the neuronal organization. Incorporating higher-order or non-local constraints may eventually lead to an effective generative model that captures the organizational principles of neural networks with low-order properties36.