Abstract
River networks are important landscape features that have been extensively studied over many years. While seminal works have focused on characterizing the topological properties of river networks, the quantification of their spectral properties has received limited attention. In this study, through a graph-theoretic formulation of river network topology, we investigate the eigenvalue spectra of its connectivity matrix (i.e., adjacency matrix). First, we explain the observed range of zero eigenvalues on the spectra using the notion of multiplicity (i.e., algebraic and geometric multiplicity) for both undirected and directed river networks. Next, we investigate the physical meaning of the multiplicity of zero eigenvalues on the dynamics of the river network. We show that multiplicity of zero eigenvalues is sufficient to determine the minimum set of driver nodes on the river network. The ratio of the number of driver nodes vs total number of nodes is a measurement of controllability of the river network, which is essential for a comprehensive understanding of the system’s dynamics under external forcing. Using both synthetic and natural river networks, we show that with increasing heterogeneity, quantified via Tokunaga c-value, the number of zero eigenvalues increases indicating that basins in humid climate require more number of driver nodes to control their network dynamics. Finally, we show that driver nodes tend to avoid critical nodes identified via pairwise connectivity. Our results indicate that the multiplicity of zero eigenvalues in the eigenvalue spectrum can serve as a valuable tool for understanding and quantifying the physical and dynamical properties of river networks, such as controllability and heterogeneity. Furthermore, our findings establish a clear connection between controllability metrics and the vulnerability of river networks.
Similar content being viewed by others
Introduction
Branching patterns of river networks exhibit complex topology and depend on climatic, geologic, biologic and ecologic conditions1,2,3,4. They have been studied extensively over the past several decades. These dendritic networks serve as essential pathways for transport of water, sediment, nutrients and interact with surrounding ecologic and biotic activities3,5,6,7,8,9,10. Quantifying their structure and dynamics is crucial for understanding their evolution under changing external forcings and for predicting the environmental fluxes that operate within them1,3,5,6,7,8,11,12,13,14,15,16,17.
Disparate literature exists on quantifying the connection between river network (RN) topology and its hydrologic response. For example, Rodriguez-Iturbe et al.,3 explored geomorphic and topologic properties of RNs under different climate and their interaction with biodiversity, vegetation, human populations, and water-borne diseases. More recently, Abed-Elmdoust et al.,11 showed that precipitation pattern has significant effects on the structure and evolution of a RN. Their study adopted a graph-theoretical approach to explore topologic properties of synthetic RNs for a range of precipitation patterns.
In a graph-theoretical approach, a network is represented as a set of nodes and links, which can be expressed using a network connectivity matrix (e.g., adjacency or Laplacian). Several complex network metrics have been extensively used to characterize networks across diverse fields, including social networks, transportation, communication, bioinformatics, molecular chemistry, mathematics, and geoscience18,19,20,21,22,23. Quantification of connectivity, vulnerability, influence of nodes and complexity of the networks, among others, offer deeper insight into network structure and dynamics and has been a focus of research in past few decades24. For example, the topologic and dynamic complexity of delta channel networks has been investigated through a graph-theoretic approach25,26. Sarker et al.27 developed a framework to understand the vulnerability and resilience of RNs under external disruptions using a graph-theoretic approach.
In a recent study, Abed-Elmdoust et al.28 adopted spectral graph theory framework to investigate the eigenvalue spectrum of the graph adjacency matrix. They found that the eigenvalue spectrum exhibits a forbidden range of zero eigenvalues (spectral gap) which is dictated by the branching pattern of the RN. They also suggested spectral gap and nullity (number of zero eigenvalues) are independent of size and shape of the basin and the number of outlets on the landscape. However, studies that specifically relate physical and spectral properties of a RN are limited. In addition, to the best of our knowledge, the multiplicity of eigenvalue on the eigenvalue spectrum of the matrix associated with RN and their relationship to climate and geomorphology has never been explored.
In this study, we characterize physical connections and arrangements of channels and junctions in a RN using a tree network topology, where nodes represent stream junctions and links represent stream segments. The RNs were obtained through numerical simulations using the optimal channel network (OCN) approach, which ignores processes such as ground infiltration and evaporation, as well as from natural basins across the United States. The simulated and natural RN topologies were analyzed to understand: (1) the spectral properties of the connectivity (adjacency) matrix and their response to changing climatic and physical processes, and (2) the range of zero eigenvalues in the eigenvalue spectrum, using the concept of eigenvalue multiplicity and its implications in the context of RNs. Additionally, we investigate how the multiplicity of zero eigenvalues (nullity) is related to network controllability and heterogeneity in both synthetic and natural RNs under varying physical processes and climate conditions (i.e., humid and dry environments). Furthemore, we propose a systematic method for identifying number of critical nodes27 whose removal could significantly impact overall network controllability, offering a valuable tool for water resource management and conservation planning. While existing methods form the foundation of our analysis, the key advancement lies in their novel application and interpretation within the context of RN topology, leading to insights with direct implications for environmental sustainability and hydrological resilience.
Results and discussion
Eigenvalue and eigenvalue spectrum of synthetic RNs
For a given network (here RN), based on the linear transformation, eigenvalues and eigenvectors can be obtained as:
where A is a square matrix, x are vectors on the vector space (i.e., eigenvectors) and \(\lambda\) are the corresponding scale factors (i.e., eigenvalues). In general, eigenvectors are the directions in which the matrix acts solely as a scaling transformation, and eigenvalues are the corresponding scale factors29,30,31.
Synthetic RNs obtained from OCN model for different energy exponent \(\gamma\) (a-e) and their corresponding eigenvalue spectrum (f). Here only five networks are shown (i.e. \(\gamma\) = 0.1, 0.3, 0.5, 0.7 and 0.9) for brevity. The bottom right subplots in panels (a-e) show the total energy expenditure as a function of number of iteration and subplot in panel (f) show the number of zero eigenvalue \(\lambda _0\) as a function of \(\gamma\).
Based on spectral graph theory, properties of the networks can be analyzed using the eigenvalue spectrum of the network connectivity matrix to determine the network’s complex and heterogeneous behavior32,33. Previous studies found that the eigenvalue spectrum of adjacency matrix exhibits striking features such as spectral gap and nullity which are closely related to the branching patterns of the network28. To investigate the range and multiplicity of zero eigenvalues in their eigenvalue spectrum, in this study we generate synthetic RNs using the optimal channel network (OCN) approach. These networks were simulated on an arbitrary basin shape with an initial grid size of \(50\times 50\) nodes, with varying energy decay exponent \(\gamma\) ranging from 0.1 to 0.9 (see Fig. 1, and also Methods for details). Figure 1 illustrates different branching patterns of RNs that emerge with varying \(\gamma\). This variation in branching patterns affects the eigenvalue spectrum. For example, as depicted in Fig. 1 (bottom right panel), the number of zero eigenvalues (\(\lambda _0\)) decreases with increasing \(\gamma\) and exhibits a non-linear behavior. Note that the eigenvalue spectrum were computed for networks that achieved steady state with minimum energy (optimized), shown as subplot in each panel Fig. 1(a-e), as a function of \(\gamma\) values. While previous research28 has introduced an empirical relationship for the \(\lambda _0\) in terms of the \(\gamma\) of the RNs, encompassing their probability density function with specific skewness (indicating varying branching topology), the physical significance of \(\lambda _0\) and their spatial distribution (location) within the RN has not been thoroughly examined. In the following section, we explore the notion of multiplicity in the eigenvalue spectrum for a small hypothetical stream network.
Algebraic and geometric multiplicity of eigenvalues on the eigenvalue spectrum
Figure 2 shows a small undirected (top left panel) and directed (bottom left panel) exemplary stream networks with 4 nodes and 3 links. The corresponding real eigenvalues for both undirected and directed networks are computed using Eq. (1) and presented in Fig. 2. Based on spectral graph theory, the number of times an eigenvalue appears in an eigenvalue spectrum is referred to as algebraic multiplicity of that eigenvalue (denoted by\(\ M_A\left( \lambda \right)\))29,30,34. Figure 2 depicts that the algebraic multiplicity of zero eigenvalues is \(M_A\left( \lambda _0\right) =2\) for the undirected network, whereas \(M_A\left( \lambda _0\right) =4\) for the directed network. For undirected binary tree networks, a general expression for \(M_A\left( \lambda _0\right)\) can be derived (see details in Appendix 1). Besides algebraic multiplicity, the geometric multiplicity of an eigenvalue (\(M_G\left( \lambda \right)\)), i.e., the number of linearly independent eigenvectors associated with it, can also be characterized and may provide useful information regarding branching patterns. Mathematically, geometric multiplicity of an eigenvalue \(M_G\left( \lambda \right)\) is the dimension of the null space of that eigenvalue.
Computation of algebraic and geometric multiplicity of a hypothetical network using the adjacency matrix shown in Fig. 2(a).
Figure 2 depicts that the geometric multiplicity of zero eigenvalues \(M_G\left( \lambda _0\right)\) is 2 for both undirected and directed network since there are two independent rows that exist on the column canonical form of zero eigenvalues of the adjacency matrix (i.e., \({\lambda =\lambda }_0\)). Detail calculation steps for \(M_A\left( \lambda \right)\) and \(M_G\left( \lambda \right)\) are presented in Fig. 3. Based on Figs. 2 and 3, one can observe that the algebraic multiplicity \(M_A\left( \lambda \right)\) and geometric multiplicity \(M_G\left( \lambda \right)\) of an eigenvalue can differ. However, the geometric multiplicity can never exceed the algebraic multiplicity (i.e., \(M_A\left( \lambda \right) \ge M_G\left( \lambda \right)\))29,35. In addition, geometric multiplicity can be also obtained from the rank of the adjacency matrix, which is related to the arrangement of nodes in the network topology. In the following sub-section we explore the physical implications of the multiplicity of zero eigenvalues and how the arrangement of nodes is related to the \(M_G\left( \lambda _0\right)\).
Physical meaning of multiplicity of zero eigenvalues
To investigate the physical meaning of the multiplicity of zero eigenvalues and their dependence on branching pattern, we use a relatively larger synthetic network (synthetic RN) (compared to 4 nodes shown in Fig. 2) with 16 nodes but different link/channel arrangements (Fig. 4) and employ the concept of matching36. For an undirected graph, matching is defined as an independent set of links that do not share common nodes (see Fig. 4) and a maximum matching is a matching that contains the largest possible number of links36. Whereas for a directed network, matching is defined as a set of links that do not share common in-nodes or out-nodes, where an in-node for one link can be an out-node for another. Therefore, for a directed network, a node is considered matched when it is an endpoint of one of the links in the matching, while the rest of the nodes are called unmatched nodes (see Fig. 4). For both undirected and directed cases, the minimum number of unmatched nodes can be achieved by a maximum matching.
Schematic of three undirected hypothetical networks with same number of nodes but different channel arrangement: pure branching (a), higher number of branching nodes than side-branching nodes (b), and higher number of side-branching nodes than branching nodes (c). Corresponding number of zero eigenvalue and unmatched nodes are also shown. Violet solid squares depict the unmatched nodes (also termed as driver nodes for an undirected network) and the red links are the set of links representing maximum matching.
It is important to note that in the context of RNs, both directed and undirected network models may provide valuable insights. Directed network models are especially crucial for characterizing processes like streamflow37, which predominantly occur in one direction (e.g., upstream to downstream). This makes such models indispensable for hydrological applications, including flood forecasting, watershed management, and modeling pollutant transport11. On the other hand, undirected networks focus on the connectivity between nodes without considering flow direction. These models are particularly useful for studying the role of connectivity (e.g., topological properties) in ecological9,10,15 and geomorphological28 contexts, where factors like habitat connectivity and species migration-such as fish movement-are key considerations.
Figure 4 shows that algebraic multiplicity of zero eigenvalues \(M_A\left( \lambda _0\right)\) is equal to the minimum number of unmatched nodes determined using maximum matching. In general, the fact that the number of unmatched nodes is equal to the \(M_A\left( \lambda _0\right)\) is always true for any undirected tree (connected graph with N nodes and \(N-1\) links) has been shown in prior research38,39. According to Liu et al.40, unmatched nodes are sufficient to understand the controllability of a dynamic network system. Controllability is a metric commonly used in network control theory which measures the ability to control the dynamics of a network under external influence (see Methods, for details). Among all the nodes from a network topology, for controlling a network, one needs to control a set of minimum nodes to guide the dynamics of the network. This set of minimum nodes is called the driver nodes (\(N_D\)) which are the unmatched nodes identified based on maximum matching. In that sense, for an undirected network, algebraic multiplicity of zero eigenvalue can be used as a physical measure of network controllability.
Figure 4a-c show schematics of three undirected hypothetical networks each with 8 source nodes and 16 total nodes. Network 1 (Fig. 4a) is pure branching with 7 branching nodes; Network 2 (Fig. 4b) has 4 branching nodes and 3 side-branching nodes; whereas, Network 3 (Fig. 4c) has 3 branching nodes and 4 side-branching nodes. However, they all have different number of driver nodes i.e., 6, 4 and 2, respectively, suggesting that algebraic multiplicity of zero eigenvalues can vary within a fixed network size but different channel arrangement. More specifically, it depends on the branching pattern of the RN. Note that branching nodes occur when two channels of the same Horton-Strahler order meet, forming a higher-order channel, while side branching nodes form when channels of different Horton-Strahler orders merge41,42,43 (see Methods, for details). It can also be seen that, for undirected RNs with side-branching, driver nodes avoid junction nodes, however, for pure branching (e.g., Fig. 4a), driver nodes position can be at junction nodes. In addition, in the case of pure branching, the number of driver nodes can be determined using a general solution applicable to undirected binary tree networks (see details in Appendix 1). Hence, one can conclude that the branching pattern of a RN can influence the location of driver nodes, and thus the controllability of a network. Although, as discussed above, \(M_A\left( \lambda _0\right)\) provides a quantitative intuition to detect driver nodes on a network topology and explain the controllability of undirected network, however, \(M_A\left( \lambda _0\right)\) has limitations to characterize the controllability for a directed network.
Maximum matching concept shown for a hypothetical directed network for the case of Fig. 4c. Violet squares nodes depict unmatched nodes (driver nodes) to achieve full control of network. Note that for a directed network, the number of driver nodes is equal to the number of source nodes but not all source nodes are driver nodes (see Fig. 6). Red links show the set of links exhibiting maximum matching for the directed network.
Exact location of driver nodes for a directed synthetic RNs (shown in Fig. 1(c)) generated by OCN. Driver nodes (star) superimposed on synthetic RN (a), and zoomed-in image showing location of the driver nodes (b).
For directed RNs, the number of unmatched nodes is equal to the number of source nodes40. Figure 5a shows the hypothetical network presented in Fig. 4c as a directed network along with the locations of the driver nodes based on the maximum matching (Figs. 5b-c). As can be seen from Fig. 5c, for a directed network, the number of driver nodes is equal to the number of source nodes (i.e., \(N_D=N_S\)), suggesting that the controllability for a directed network depends on the number of source nodes. Yuan et al.35 indicated that not all driver nodes are located at the source nodes and geometric multiplicity of zero eigenvalue may provide both number and location of the driver nodes. To explore the actual position of the driver nodes using the formulation by Yuan et al.35, we use a larger size network such as a synthetic RN generated by OCN. Figure 6a shows the location of driver nodes on the synthetic RN. Figure 6b visually confirms that not all the driver nodes are at the source nodes.
In the rest of the manuscript, we aim to address how the multiplicity of zero eigenvalue and driver nodes are related to the geomorphic and climatic properties of the synthetic and natural RNs.
Driver nodes on synthetic RNs
To further investigate the role of branching patterns on network dynamics, i.e., controllability, systematically as a function of physical processes and climate, we compute the number of driver nodes and identify their locations using the Yuan et al. framework35. We generate several synthetic RNs based on OCN model with a fixed number of nodes (\(N= 1513\)), however with varying energy exponent \(\gamma\) = 0.1 to 0.9. Figure 6a shows, as an example, a generated synthetic RN. The locations of the superimposed driver nodes on the network can be seen in Figs. 6a and 6b. Note that for each \(\gamma\), 15 independent networks were generated, and their ensemble averaged metrics were computed in order to minimize the effect of random network initialization.
Percentage of driver node as a function of \(\gamma\) for synthetic RNs (a). For undirected network driver nodes, \(N_D\) (%); Zero eigenvalue \(\lambda _0\) (%) as a function of \(\gamma\) (a). For directed network number of driver node (\(N_D=M_G\left( \lambda _0\right)\)); number of source node \(N_S\) as a function of \(\gamma\) (a), driver node (%) located on source node, branching and side-branching node (b). Networks were generated using OCN for \(\gamma\) = 0.1 to \(\gamma\) = 0.9.
An example of a hypothetical 4th-order RN with branching and side-branching nodes (a) and corresponding Tokunaga self-similarity model along with the computation of parameters a and c (b); different colors indicate different order (see41).
Figure 7a illustrates a comparative analysis of \(N_D\) between undirected and directed versions of synthetic RNs. In particular, Figure 7a shows the algebraic multiplicity of zero eigenvalues \(M_A\left( \lambda _0\right)\) in the eigenvalue spectra of the connectivity matrix, which is equal to the number of unmatched nodes (driver nodes) for undirected networks, as a function of \(\gamma\). As the percentage of zero eigenvalue represents controllability, C (\(C=N_D/N\), representing the ratio of number zero eigenvalues to the total number of eigenvalues) of a network, from Fig. 7a, it can be observed that controllability decreases with increasing \(\gamma\) where the drainage pattern changes drastically from an intertwisted RN to a relatively straightened pattern28. Note that controllability C for an undirected RN has been observed to be independent of the RN size and basin shape28. Likewise, for the case of directed RN, controllability also decreases with increasing \(\gamma\) with a similar trend but is higher than for the undirected RN (Fig. 7a). As discussed above, for a directed network, the number of driver nodes (\(N_D\)) computed based on geometric multiplicity of zero eigenvalues \(M_G\left( \lambda _0\right)\) is equal to the number of source nodes (\(N_S\)). However, a significant number of driver nodes are not located at the source nodes (Fig. 7b). We observed that \(\sim 4.5\%-7\%\) driver nodes are located at branching nodes and \(\sim 12.5\% -16 \%\) driver nodes are located at side-branching nodes. For detailed information on branching and side-branching nodes, refer to the Methods section and Fig. 8(a), which provides a schematic representation of a RN with branching and side-branching junctions for illustration. It is also observed that as the \(\gamma\) increases, the percentage of driver nodes located on side-branching nodes increases slightly, despite the fact that the total percentage of driver nodes decreases. This essentially indicates that as the drainage pattern changes from an intertwisted channel network to a relatively straightened pattern (as observed in steeper topographies), besides the source node, the controllability is mostly governed by the side branching nodes and vice versa.
Heterogeneity of synthetic RNs
Network heterogeneity is a metric that describes the diversity of characteristics within a network, often related to the uniformity of its organization (i.e., the arrangement of links and nodes) and is critical for understanding the emergence and evolution of RNs44,45. In this study, we utilize the Tokunaga self-similarity model (see, for details, Methods and Fig. 8(b), which illustrates the computation of Tokunaga model parameters for the RN example shown in Fig. 8a) to quantify the heterogeneity of the network. This model is frequently used to characterize the branching patterns of RNs11,46,47. It offers advantages over the Horton-Strahler41,42,43 indexing system by accounting for side-branching channels (see Methods and Fig. 8 for details), which frequently occur in natural RNs43. The Tokunaga parameter c-value, describes the varying degree of side-branching and has been suggested to provide deeper insights into the structure and function of networks47, as structural changes can indicate functional changes in a RN7,48. Figure 9 illustrates the relationship between the c-value and controllability obtained from synthetic RNs simulated using OCN approach for various \(\gamma\) values. The observed increasing trend from this figure suggests that larger heterogeneity in RNs is associated with increased controllability. In the next section, we explore the relationship between controllability and heterogeneity in natural RNs.
Controllability and heterogeneity of natural RNs under varying climate
In this section, we investigate the controllability computed based on the multiplicity of zero eigenvalues on the eigenvalue spectrum of natural RNs. For this, we extracted RNs from the digital elevation model (DEM) of 56 natural basins across the United States, based on the availability of LiDAR data. Although the flow of water in RNs is directional, due to biodiversity and ecological considerations (as discussed above), the eigenvalue spectrum was computed on an undirected adjacency matrix considering only the notion of the network connectivity. This connectivity has been shown to play a major role in biological and ecological communities in riverine ecosystems and their processes9,15. Note that, in this study, the networks were extracted using a curvature-based method from high-resolution (1m) topographic data49.
In order to understand the distinct climatic signature on the eigenvalue spectrum, the long-term climate was considered in the form of the climate aridity index (CAI), which is commonly defined as the ratio of mean annual potential evaporation (\(E_p\)) to precipitation (P)50,51,52,53,54,55,56. According to Budyko, regions where the aridity index is higher than 1, are generally classified as dry, and regions with aridity index less than 1 are classified as humid regions. In addition, CAI has also been associated with a broader range of climatic regimes, such as arid \(12>CAI\ge 5\), semi-arid \(5>CAI\ge 2\), subhumid \(2>CAI\ge 0.75\), and humid \(0.75>CAI\ge 0.375\)50,53,55.
The eigenvalue spectrum computed from the adjacency matrix of natural RNs for the different climatic regions exhibits a distinct range of zero eigenvalue. Similarly, to the synthetic RN, the eigenvalue range for natural RN’s can be explained by the algebraic multiplicity \(M_A\left( \lambda _0\right)\) and the geometric multiplicity \(M_G\left( \lambda _M\right)\).
Example of DEMs and superimposed RNs for basins in humid (a) and arid (b) climate; the channel order based on the Horton-Strahler (41) ordering scheme is shown with different colors. Controllability C (%) (c) and heterogeneity represented by c-value (d) for natural basins as a function of climate aridity index, c-value plotted against Controllability C (%) (e) for natural basins. Note that the significance of these relationships was tested using t test which showed a \(p-value\ <\ 0.05\), within a 95% confidence interval.
Figure 10(a-b) show two sample examples of natural basins with superimposed RN for humid (a) and dry (b) climates. Figure 10c show controllability as a function of CAI for all the extracted natural RNs considered here. As can be seen, controllability shows a decreasing trend with CAI, indicating that controllability is higher for RNs in humid climates as opposed to dry climates.
The dependence of controllability on climate underscores the significant influence of climate on the branching patterns of RNs. Zanardo et al.57 explored the relationship between the Tokunaga c-value and climatic variables such as rainfall, storm frequency, and duration and argued that the c-value increases with increasing precipitation. More recently, Ranjbar et al.47 demonstrated a positive correlation between the c-value and the complexity of RNs. Figure 10d shows the c-value as a function of CAI for the basins analyzed here, suggesting that side-branching decreases with increasing CAI. This observation highlights the role of side-branching in the network, indicating that both controllability and heterogeneity increase as CAI decreases.
To further explore whether the controllability metric depends on the branching pattern, we plot the c-value as a function of controllability in Figure 10e. We observe a general increasing trend, where higher c-values correspond to greater controllability, indicating that networks with more side-branching tend to exhibit enhanced controllability. Although the correlation is not as high as observed in synthetic RNs, the computed \(p-value\) suggests the relationship is significant. The lower correlation in the case of natural RNs (as compared to synthetic RNs, see Fig. 9) can be attributed to varying basin size, hydrology, geology and soil heterogeneity, among others. Our results suggest that intricate, inhomogeneous branching structures with more side-branching, commonly found in humid environments57, may produce RNs less prone to change under increasing precipitation. In contrast, arid climate basins, with more homogeneous branching structures, may exhibit more dynamic behavior under similar precipitation conditions. These quantitative insights offer valuable tools for a comparative and comprehensive understanding of humid versus dry basins.
Relation with the critical nodes and network vulnerability
Remaining pairwise connectivity as a function of the number of critical nodes (CN) removed, with numerical values indicating the power-law fitted slope (a), where red and blue colors correspond to basins represented by red and blue squares in panel (b). Vulnerability as a function of the climate aridity index (b); here different colors indicate a broader range of climatic regimes50,53, categorized as humid (\(0.375 \le CAI < 0.75\)), subhumid (\(0.75 \le CAI < 2\)), semiarid (\(2 \le CAI < 5\)), and arid (\(5 \le CAI < 12\)). The black solid star in (b) indicates the mean value within each climate category.
Critical nodes are the set of nodes whose removal result in the maximum fragmentation of the network27,58,59,60. A network structure or connection which is more vulnerable may create more fragmentation under the extreme external influence (see Methods). Therefore, we compute the vulnerability (defined by the power-law exponent of pairwise connectivity; see methods for details) of RNs for varying climate (see Fig. 11(a)). For example, in Fig. 11a we show pairwise connectivity as a function of number of critical nodes (CN) removed and the computed power-law slopes for two natural basins (shown in Fig. 10) from humid and arid environments (see also Fig. 11b). As can be seen, the computed power-law slopes, indicating network vulnerability, differ for different climatic conditions.
Figure 11(b) shows that vulnerability increases with increasing CAI indicating dry basins are more vulnerable than humid basins. In addition, Fig. 11(b) also illustrates the distribution of vulnerability across different basin types classified according to the CAI: Humid, Subhumid, Semiarid, and Arid. The black solid star in each climatic regime indicates the mean vulnerability value of each category. In particular, vulnerability tends to increase from the humid to arid basins, suggesting a potential relationship between aridity and vulnerability. The observed higher variability, particularly within the Subhumid category, indicates that the vulnerability may be influenced by additional underlying factors specific to each basin type. This quantification of vulnerability has important implications in the context of ecological and biological considerations. Specifically, riverine ecosystems often occur in spatially structured habitats where fragmentation directly plays a key role in their processes such as diversity, productivity and even resilience of the metapopulation9,10,15.
As discussed above, the controllability framework allows us to identify the driver nodes which correspond to the set of nodes that controls the dynamic response of a system. In the context of RNs, it is worth pointing out that most of those nodes correspond to the headwater locations (source nodes). A question of particular interest is to explore whether those junction driver nodes coincide with critical nodes as critical nodes result in maximum fragmentation of the network.
To investigate the characteristics of driver nodes and their relationship with the critical nodes, we compute the number of driver nodes that are located on the junction nodes (\(N_{DJ}\)). In addition to that, we also identify critical nodes which coincide with the driver junction nodes (\(N_{CN\cap N_{DJ}}\)) for specific conditions where the number of critical nodes (k) i) is equal to the number of junction driver nodes, i.e., \(k=N_{DJ}\), ii) is equal to the total number of driver nodes in that network, i.e. \(k=N_D\), and iii) for two different fixed values of k, i.e. \(k=5\) and \(k=10\). It is observed that when \(k=N_{DJ}\), no driver nodes and critical nodes coincide for any RN, indicating that driver nodes tend to avoid critical nodes. For \(k=N_D\), we observe a few common nodes. For example, Table 1 shows that out of 56 natural basins, 40 basins exhibit a very small amount (\(\sim\) 1-3.5 %) of junction driver nodes (\(N_{DJ}\)). 37 out of those 40 basins show common critical and driver nodes (i.e., \(N_{CN\cap N_{DJ}}\)). In addition, based on fixed values of k, we observed percentage of common nodes increases as k increases. Figure 12(a-b) shows an example of a natural basin along with superimposed critical nodes and driver nodes. Out of 238 nodes, 34 nodes are driver nodes (\(N_D\)), while only 3 nodes are located on the junction. In other words, controllability is mainly governed by the headwater locations (source nodes). In addition, for \(k=N_D\), although, the 3 junction driver nodes coincide with critical nodes found from a set of 34 critical nodes group, Fig. 12(c-d) (percentage of \(N_{DJ}\) and \(N_{CN\cap N_{DJ}}\) as a function of CAI) indicates that as the climate changes from humid to arid, the amount of \(N_{CN\cap N_{DJ}}\) increases. It is also worth pointing out that the relation between \(N_{DJ}\) and \(N_{CN\cap N_{DJ}}\) were obtained from 37 basins out of 56 basins as the 19 basins did not have common critical and driver junction nodes.
Summary and Concluding remarks
In this paper, we investigated the eigenvalue spectrum of the adjacency matrix associated with both synthetic and natural river networks (RNs). Synthetic RNs were generated using the optimal channel network (OCN) approach, while natural RNs were extracted from a 1-meter resolution digital elevation model. We explored the spectral properties of these networks in the context of controllability and heterogeneity, with particular focus on the range of zero eigenvalues by analyzing their multiplicity. Our findings suggest that the multiplicity of zero eigenvalues has the potential to quantify the dynamical properties (e.g., controllability) of RNs. For synthetic RNs, we observed a significant correlation between the c-value (characterizing RNs heterogeneity) and controllability, indicating that networks with higher heterogeneity exhibit greater controllability, i.e., a higher capacity to regulate dynamics under external influences, compared to those with lower heterogeneity. Similar observations were made in natural RNs, where a positive correlation between controllability and the c-value further suggested that humid basins are more heterogeneous and exhibit higher controllability than dry basins. Our comparative analysis between driver and critical nodes indicates that the vulnerability of an RN increases with increasing climate aridity index. In other words, RNs in arid climates tend to be more vulnerable (dynamically unstable) than those in humid climates. Additionally, our results suggest that driver nodes tend to avoid critical nodes and are primarily located at source nodes. The primary contribution of our research lies in the novel integration of network theory principles with hydrologically relevant connectivity, providing new insights into the structural robustness and functional dynamics of river systems. Unlike conventional hydrological approaches that mainly focus on physical and empirical modeling, our study introduces a spectral analysis framework to quantify network heterogeneity and assess the influence of critical nodes on system vulnerability. Overall, our findings highlight the relevance of the proposed controllability and heterogeneity metrics for understanding natural RNs and their response to climate variability.
Methods
Synthetic network generation
In this study, we generate synthetic RNs using optimal channel network (OCN) approach, which has been extensively explored from various hydrologic and geomorphic perspectives11,14,61,62,63,64. Previous studies found that OCNs can mimic commonly observed topologic and geometric properties of real RNs42. OCNs can also be used to generate DEMs which have been argued to represent properties of natural landscapes11. In general, OCN modeling relies on the local minimization of the topologic energy defined as \(E = \sum _{i=1}^{N-1} L_i Q_i^\gamma\), where \(L_iQ_i^\gamma\) represents the energy dissipated in the \(i^{th}\) link of the network. Here, \(L_i\) and \(Q_i\) are the length and discharge of the link, respectively42. The energy exponent \(\gamma\), which varies between 0 and 1, characterizes the mechanics of erosional processes and defines the branching pattern of the channel network42. The simulations were conducted on an arbitrarily shaped area designed to mimic real basins, using an initial grid of size \(50\times 50\) nodes (Fig. 1). We used a lobe-like shape that mimics a natural basin boundary. While the square grid consisted of 2,500 nodes, the lobe-like shape contained 1,513 nodes. For simplification, the topologic energy was computed assuming a unit distance between adjacent nodes and uniform precipitation over the entire simulated basin11,27,28. We also assumed that the network drains the entire amount of precipitated unit rainfall from every node within the network28,61,62. Here, we generate networks for nine different \(\gamma\) values, ranging from 0.1 to 0.9, while keeping the initial random tree network the same for each simulation (Fig. 1). The generated networks reproduce several topological properties, which are analyzed using a graph-theoretical framework (see also, Abed-Elmdoust et al.11, for details).
Matrices adopted to calculate spectral properties
Previous studies have suggested that the flow path in a RN can be defined through a directed graph which can be denoted by a \(N\times N\) adjacency matrix A, where:
Adjacency matrix (also referred to as connectivity matrix31) defines the connection between nodes and the flow direction in RNs. To compute the real eigenvalue spectrum of drainage networks, undirected adjacency matrix can be considered by ignoring the flow direction28 which can be represented as:
In addition, the relation between matrices A and B can be expressed as \(B=A+A^T\), where \(A^T\) is the transpose of A.
We also employ the degree matrix, which is a diagonal matrix of the network and contains information about the degree of each node, i.e., the number of links connected to each node. The degree matrix D can be expressed as:
where \(deg(N_i)\) of a node \(N_i\) represents the number of links terminates at that node. For a directed graph, the term degree of a node defines either in-degree (number of incoming links) or out-degree (number of outgoing links).
Identification of Driver nodes: network controllability
In this study, we identify the driver nodes (\(N_D\)) on a network to understand the controllability of RNs and investigate the dependency of controllability on RN topological structure, specifically the branching pattern. Controllability is a measure of the ability of a network to guide a dynamic system and the knowledge of the driver nodes offer the full control over the entire network40. In particular, the control of these driver nodes is enough to achieve full control of the network system dynamics. Liu et al.40 proposed a framework to identify a set of driver nodes for a directed network using the maximum matching in the network. However, Yuan et al.35 arguably provides an accurate framework to identify the driver nodes for both directed and undirected networks based on the spectral graph theory. Their framework was based on the multiplicity of eigenvalues and suggested that controllability is simply a measure of ratio between minimum number of driver nodes and the total number of nodes. Additionally, Yuan et al. depicted how geometric multiplicity of eigenvalue, \(M_G\left( \lambda \right)\), relates to the position of the driver nodes. More specifically, geometric multiplicity for a network can be calculated as:
where N is the dimension (i.e., number of nodes in a network) of square matrix A (adjacency matrix), \(\lambda\) is any eigenvalue from the eigenvalue spectrum, \(I_N\) is the N dimensional identity matrix and \(A-\lambda I_N\) matrix is the column canonical form of \(\lambda\) eigenvalue of the adjacency matrix. We use Yuan et al. (2013) framework35 to detect driver nodes for both synthetic and natural RNs and map them based on the multiplicity of the eigenvalue spectrum. We explore the use of geometric multiplicity of eigenvalue as a spectral metric to locate the driver nodes on the RN adjacency matrix and eigenvalue spectra. We further quantify the controllability of RNs and investigate the dependency of controllability on branching structure of the RN. The controllability C of a network can be expressed as:
where \(N=\) is the total number of nodes on a RN and, \(N_D=\) is the number of driver nodes, Here, \(\lambda =\lambda _M\) the eigenvalues that appear most frequently in an RN’s eigenvalue spectrum, (in this case \(\lambda =0\)) (see Figs. 2-3).
Equation (6) has a significant advantage over the maximum matching framework proposed by Liu et al.40 to detect driver nodes as it provides the exact number and position of driver nodes on a network topology based on the geometric multiplicity and can be applied to both directed and undirected networks. In that sense, for any RN, geometric multiplicity \(M_G\) is sufficient to understand dynamics and quantify controllability under external influence.
Network heterogeneity
As discussed above, network heterogeneity is a metric that describes the variability in the arrangement of links and nodes and is important for understanding emergence and evolution of a RN44,45. We used Horton-Strahler stream ordering system to quantitatively describe the branching structure of a RN by classifying streams based on their hierarchical connectivity41. In this system, first-order streams, which have no tributaries, serve as the fundamental units. When two streams of the same order merge, they form a stream of the next higher order (see Fig. 8(a), as an example). However, when streams of different orders merge, the resulting stream retains the higher order. Higher-order streams generally exhibit greater discharge and drainage area. Additionally, branching nodes occur when two streams of the same order meet to form a higher-order channel, whereas side-branching nodes form when streams of different orders merge11,41,65. In this study, we employ the Tokunaga self-similarity model46 to quantify network heterogeneity and characterize the branching patterns of RNs. The Tokunaga relation (see Fig. 8(b)) can be expressed as
where Tokunaga indices \(T_{ij}=T_{i\left( i+k\right) }=T_k\) and \(k\ =\ j-i\). The matrix \(T_{ij}\) can be computed as \(T_{ij}=\frac{N_{ij}}{N_j}\), where \(N_{ij}\) denotes the average number of streams of order i connected to streams of order j, and \(i<j\) (see43,47,66). This framework assumes that the mean of branches of order i connecting to randomly selected branch of order j, \(T_{ij}\), is independent of the branch orders and only depends on the difference \(k = j-i\) and follow exponential relationship with k (Eq. 7). a is a constant and c describes the degree of side-branching and can be seen as a measure of heterogeneity since it indicates the connectivity of low-order channels with respect to the high-order channels of the RN47,57. The parameter c is known as the Tokunaga parameter and is referred to as the c-value in the current manuscript (see Fig. 8(b)).
Critical nodes and vulnerability
Critical nodes are the set of nodes whose deletion maximizes the network fragmentation27,58,59,60. This can be achieved by minimizing the size of the largest remaining connected components or pairwise connectivity, i.e., the total number of node pairs connected by a path. This disruption metric can be used for understanding network vulnerability (i.e., optimal response of a network to an external attack) and protection (i.e., network defense) purposes. For this purpose, we have adopted the recently proposed critical node identification (CNI) framework27 on a suite of natural RNs obtained from digital elevation model (DEM) data to understand its vulnerability.
In order to identify critical nodes, a RN can be represented by a simple undirected graph \(G=\left( V,E\right)\) with a set of nodes V and links E. The links connecting node \(i\in V\) and \(j\in V\) are represented by a pair \(\left( i,j\right) \in E\). Let \(\mathbb {N}(i)=\{j:(i,j)\in E)\}\) denote the neighborhood of node i. We assume that up to k nodes in this graph are deleted as critical nodes. For any node \(i\in V\), we define the indicator variable \(v_i\) as
Then, for each pair of nodes \(i,j\in V\) \((i\ne j)\), we define the indicator variable \(u_{ij}\) as
The objective function, which quantifies the number of connected node pairs in the remaining graph, and the limit on the number of removed nodes can be expressed as \(\sum \limits _{i,j\in V}u_{ij}\) and \(\sum \limits _{i\in V}v_i\le K\), respectively. Critical nodes can then be determined via minimizing the objective function using linear integer programming. For more details on method to identify critical nodes see27,58,59,60,67.
In this study, the term vulnerability (i.e., the ability to hold integrity under external influence) is used to explain the dis-connectivity of the RN. We employ the fitted power-law exponent of the remaining pairwise connectivity vs removed critical node as a measure of vulnerability27.
Extraction of natural River Networks
We use RNs obtained from 56 natural basins located across the United States corresponding to different hydrologic and geomorphic conditions. These RNs were extracted from 1 m resolution digital elevation models (DEMs) using a curvature-based method49. The DEMs were obtained from National Elevation dataset website (https://apps.nationalmap.gov/viewer/). ArcGIS 10.4 and Python 3.1 were used to analyze the data and generate the figures (see Figs. 10 and 12).
Data availability
We utilized USGS LiDAR data to extract natural river networks. The processed DEM datasets of the natural basins are available in the Zenodo repository: https://doi.org/10.5281/zenodo.14799822.
References
Tucker, G. E. & Slingerland, R. Drainage basin responses to climate change. Water Resources Research 33, 2031–2047 (1997).
Tucker, G. E. Drainage basin sensitivity to tectonic and climatic forcing: Implications of a stochastic model for the role of entrainment and erosion thresholds. Earth Surface Processes and Landforms 29, 185–205 (2004).
Rodriguez-Iturbe, I., Muneepeerakul, R., Bertuzzo, E., Levin, S. A. & Rinaldo, A. River networks as ecological corridors: A complex systems perspective for integrating hydrologic, geomorphologic, and ecologic dynamics. Water Resources Research 45 (2009).
Hooshyar, M., Singh, A. & Wang, D. Hydrologic controls on junction angle of river networks. Water Resources Research 53, 4073–4083 (2017).
Bertuzzo, E. et al. On the space-time evolution of a cholera epidemic. Water Resources Research 44 (2008).
Czuba, J. A. & Foufoula-Georgiou, E. Dynamic connectivity in a fluvial network for identifying hotspots of geomorphic change. Water Resources Research 51, 1401–1421 (2015).
Zaliapin, I., Foufoula-Georgiou, E. & Ghil, M. Transport on river networks: A dynamic tree approach. Journal of Geophysical Research: Earth Surface 115 (2010).
Hansen, A. & Singh, A. High-frequency sensor data reveal across-scale nitrate dynamics in response to hydrology and biogeochemistry in intensively managed agricultural basins. Journal of Geophysical Research: Biogeosciences 123, 2168–2182 (2018).
Carrara, F., Altermatt, F., Rodriguez-Iturbe, I. & Rinaldo, A. Dendritic connectivity controls biodiversity patterns in experimental metacommunities. Proceedings of the National Academy of Sciences 109, 5761–5766 (2012).
Terui, A. et al. Metapopulation stability in branching river networks. Proceedings of the National Academy of Sciences 115, E5963–E5969 (2018).
Abed-Elmdoust, A., Miri, M.-A. & Singh, A. Reorganization of river networks under changing spatiotemporal precipitation patterns: An optimal channel network approach. Water Resources Research 52, 8845–8860 (2016).
Ranjbar, S., Hooshyar, M., Singh, A. & Wang, D. Quantifying climatic controls on river network branching structure across scales. Water Resources Research 54, 7347–7360 (2018).
Smith, T. R. & Bretherton, F. P. Stability and the conservation of mass in drainage basin evolution. Water Resources Research 8, 1506–1529 (1972).
Molnár, P. & Ramírez, J. A. Energy dissipation theories and optimal channel characteristics of river networks. Water Resources Research 34, 1809–1818 (1998).
Benda, L. et al. The network dynamics hypothesis: how channel networks structure riverine habitats. BioScience 54, 413–427 (2004).
Rinaldo, A., Rigon, R., Banavar, J. R., Maritan, A. & Rodriguez-Iturbe, I. Evolution and selection of river networks: Statics, dynamics, and complexity. Proceedings of the National Academy of Sciences 111, 2417–2424 (2014).
Hooshyar, M., Singh, A., Wang, D. & Foufoula-Georgiou, E. Climatic controls on landscape dissection and network structure in the absence of vegetation. Geophysical Research Letters 46, 3216–3224 (2019).
Boccaletti, S., Latora, V., Moreno, Y., Chavez, M. & Hwang, D.-U. Complex networks: Structure and dynamics. Physics reports 424, 175–308 (2006).
Chung, F., Lu, L. & Vu, V. The spectra of random graphs with given expected degrees. Internet Mathematics 1, 257–275 (2004).
Cvetkovic, D. M. & Rowlinson, P. Spectral Graph Theory. Topics in algebraic graph theory, eds. L. W. Beinke, R. J. Wilson, Cambridge University Press 88–112 (2004).
Gutman, I. Chemical graph theory–the mathematical connection. Advances in Quantum Chemistry 51, 125–138 (2006).
Van Mieghem, P. Performance analysis of communications networks and systems (Cambridge University Press, 2009).
Mohar, B. & Poljak, S. Eigenvalues in combinatorial optimization. In Combinatorial and graph-theoretical problems in linear algebra, 107–151 (Springer, 1993).
Bonetti, S., Bragg, A. D. & Porporato, A. On the theory of drainage area for regular and non-regular points. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 474, 20170693 (2018).
Tejedor, A., Longjas, A., Zaliapin, I. & Foufoula-Georgiou, E. Delta channel networks: 1. a graph-theoretic approach for studying connectivity and steady state transport on deltaic surfaces. Water Resources Research 51, 3998–4018 (2015).
Tejedor, A., Longjas, A., Zaliapin, I. & Foufoula-Georgiou, E. Delta channel networks: 2. metrics of topologic and dynamic complexity for delta comparison, physical inference, and vulnerability assessment. Water Resources Research 51, 4019–4045 (2015).
Sarker, S., Veremyev, A., Boginski, V. & Singh, A. critical nodes in river networks. Scientific reports 9, 1–11 (2019).
Abed-Elmdoust, A., Singh, A. & Yang, Z.-L. Emergent spectral properties of river network topology: An optimal channel network approach. Scientific reports 7, 11486 (2017).
Golub, G. & Van Loan, C. Matrix computations, vol. 3 baltimore. MD: JHU Press.[Google Scholar] (2012).
Nering, E. D. Linear algebra and matrix theory (University of Michigan, Tech. Rep., 1970).
Biggs, N., Biggs, N. L. & Norman, B. Algebraic graph theory. 67 (Cambridge university press, 1993).
Chun, F. Spectral graph theory. cbms regional conference series in mathematics. American Mathematical Society (1997).
Rai, A. et al. Understanding cancer complexome using networks, spectral graph theory and multilayer framework. Scientific reports 7, 1–16 (2017).
Fraleigh, J. B. A first course in abstract algebra (Pearson Education India, 2003).
Yuan, Z., Zhao, C., Di, Z., Wang, W.-X. & Lai, Y.-C. Exact controllability of complex networks. Nature communications 4, 2447 (2013).
Lovász, L. & Plummer, M. D. Matching theory, vol. 367 (American Mathematical Soc., 2009).
Istalkar, P. & Biswal, B. Streamflow prediction in ungauged basins: How dissimilar are drainage basins?. Journal of Hydrology 637, 131357 (2024).
Cvetković, D. M. & Gutman, I. M. The algebraic multiplicity of the number zero in the spectrum of a bipartite graph. Matematički vesnik 9, 141–150 (1972).
Gutman, I. & Borovicanin, B. Nullity of graphs: an updated survey. Zbornik Radova 14, 137–154 (2011).
Liu, Y.-Y., Slotine, J.-J. & Barabási, A.-L. Controllability of complex networks. nature 473, 167 (2011).
Horton, R. E. Erosional development of streams and their drainage basins; hydrophysical approach to quantitative morphology. Geological society of America bulletin 56, 275–370 (1945).
Rodriguez-Iturbe, I. & Rinaldo, A. Fractal river basins: chance and self-organization (Cambridge University Press, 2001).
Tarboton, D. G. A new method for the determination of flow directions and upslope areas in grid digital elevation models. Water resources research 33, 309–319 (1997).
Snijders, T. A. The degree variance: an index of graph heterogeneity. Social networks 3, 163–174 (1981).
Bell, F. K. A note on the irregularity of graphs. Linear Algebra and its Applications 161, 45–54 (1992).
Tokunaga, E. Consideration on the composition of drainage networks and their evolution. Geogr. Rep. Tokyo Metrop. Univ. 13, 1–27 (1978).
Ranjbar, S., Singh, A. & Wang, D. Controls of the topological connectivity on the structural and functional complexity of river networks. Geophysical Research Letters 47, e2020GL087737 (2020).
Roy, J., Tejedor, A. & Singh, A. Dynamic clusters to infer topologic controls on environmental transport of river networks. Geophysical Research Letters 47, e2021GL096957 (2022).
Hooshyar, M., Wang, D., Kim, S., Medeiros, S. C. & Hagen, S. C. Valley and channel networks extraction based on local topographic curvature and k-means clustering of contours. Water Resources Research 52, 8081–8102 (2016).
Arora, V. K. The use of the aridity index to assess climate change effect on annual runoff. Journal of hydrology 265, 164–177 (2002).
Budyko, M. I., Miller, D. H. & Miller, D. H. Climate and life, vol. 508 (Academic press New York, 1974).
Henning, D. & Flohn, H. Climate Aridity Index (Budyko-Ratio) (UN Environmental Project, 1977).
Ponce, V. M., Pandey, R. P. & Ercan, S. Characterization of drought across climatic spectrum. Journal of Hydrologic Engineering 5, 222–224 (2000).
Chen, X., Alimohammadi, N. & Wang, D. Modeling interannual variability of seasonal evaporation and storage change based on the extended budyko framework. Water Resources Research 49, 6067–6078 (2013).
Chen, X. & Sivapalan, M. Hydrological basis of the budyko curve: Data-guided exploration of the mediating role of soil moisture. Water Resources Research 56, e2020WR028221 (2020).
Biswal, B. Dynamic hydrologic modeling using the zero-parameter budyko model with instantaneous dryness index. Geophysical Research Letters 43, 9696–9703 (2016).
Zanardo, S., Zaliapin, I. & Foufoula-Georgiou, E. Are american rivers tokunaga self-similar? new results on fluvial network topology and its climatic dependence. Journal of Geophysical Research: Earth Surface 118, 166–183 (2013).
Veremyev, A., Prokopyev, O. A. & Pasiliao, E. L. An integer programming framework for critical elements detection in graphs. Journal of Combinatorial Optimization 28, 233–273 (2014).
Veremyev, A., Prokopyev, O. A. & Pasiliao, E. L. Finding groups with maximum betweenness centrality. Optimization Methods and Software 32, 369–399 (2017).
Arulselvan, A., Commander, C. W., Elefteriadou, L. & Pardalos, P. M. Detecting critical nodes in sparse graphs. Computers and Operations Research 36, 2193–2200 (2009).
Rigon, R., Rinaldo, A., Rodriguez-Iturbe, I., Bras, R. L. & Ijjasz-Vasquez, E. Optimal channel networks: a framework for the study of river basin morphology. Water Resources Research 29, 1635–1646 (1993).
Rodríguez-Iturbe, I. et al. Energy dissipation, runoff production, and the three-dimensional structure of river basins. Water Resources Research 28, 1095–1103 (1992).
Paik, K. & Kumar, P. Emergence of self-similar tree network organization. Complexity 13, 30–37 (2008).
Rinaldo, A., Rodriguez-Iturbe, I., Rigon, R., Ijjasz-Vasquez, E. & Bras, R. Self-organized fractal river networks. Physical review letters 70, 822 (1993).
Peckham, S. D. New results for self-similar trees with applications to river networks. Water Resources Research 31, 1023–1029 (1995).
Wang, K. et al. Side tributary distribution of quasi-uniform iterative binary tree networks for river networks. Frontiers in Environmental Science 9, 792289 (2022).
Pavlikov, K. Improved formulations for minimum connectivity network interdiction problems. Computers & Operations Research 97, 48–57 (2018).
Fiorini, S., Gutman, I. & Sciriha, I. Trees with maximum nullity. Linear algebra and its applications 397, 245–251 (2005).
Rojo, O. & Robbiano, M. An explicit formula for eigenvalues of bethe trees and upper bounds on the largest eigenvalue of any tree. Linear algebra and its applications 427, 138–150 (2007).
Liu, L. The spectral radius of the adjacency matrix of a complete binary tree as the number of nodes approaches infinity. In 2nd International Conference on Applied Mathematics, Modelling, and Intelligent Computing (CAMMIC 2022), vol. 12259, 322–328 (SPIE, 2022).
Acknowledgements
The authors acknowledge the support of STOKES advanced research computing center (ARCC) (webstokes.ist.ucf.edu) for providing computational resources. A.S. acknowledges partial support from NSF EAR-1854452 and EAR- 2342936. The LiDAR data were collected from the US Geological Survey website (https://lta.cr.usgs.gov/lidar_digitalelevation).
Author information
Authors and Affiliations
Contributions
S.S. conducted the simulations and wrote the first draft. S.S. and A.S. analyzed the results. A.V., V.B., S.P. and A.S. provided feedback in interpreting results. All authors contributed in preparing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: \(M_A\left( \lambda _0\right)\) for perfect binary tree networks
Appendix: \(M_A\left( \lambda _0\right)\) for perfect binary tree networks
A perfect binary tree network (PBTN) is one in which every node at the same height has 2 branches (degree 3), except for the leaf nodes at the maximum height (h). The total number of nodes for these tree graphs is given by \(2^h\) for trees with a root node (e.g., RNs) and \(2^h - 1\) otherwise. By utilizing Eq. (1) in conjunction with the characteristic polynomial attributes of matrix A, one can represent \(z_h + e_h = 2^h\), where \(z_h=M_A\left( \lambda _0\right)\) and \(e_h\) indicates the number of nonzero eigenvalues of A. In this case, h can be considered the highest order of RN. Therefore, we obtain the following recurrence relation for \(z_h\) based on the observation \(z_{h+1}=e_h\).
The general solution to this recurrence is
Therefore, for a rooted PBTN (e.g., RN), we have the values of \(z_h\) where \(C_1=-1\). Similarly, for \(h=2,3,4,5, \ldots\), the corresponding values of \(z_h\) are \(2,2,6,10, \ldots\).
For an unrooted, PBTN a slightly different recurrence holds
Hence, the general solution can be expressed as
As a result, for \(h=2,3,4,5, \ldots\), the corresponding values of \(z_h\) are \(3,5,11,21, \ldots\). Fiorini et al.68 have shown that for any tree graph with N nodes and maximum degree of D, the maximum \(z_h\) is
For binary trees, with \(D = 3\), this implies that
Applying this inequality to our rooted PBTN, we have
This is consistent with Eq. (11), and, in fact, this upper bound is equal to Eq. (11) when h is an even number. Rojo and Robbiano69 derived an explicit formula for the eigenvalues (and their multiplicities) of A for any Bethe tree. The unrooted PBTN is a special case with \(D=2\). In their notation, \(\kappa\) represents the number of levels in the tree, with the tree height given by \(h=\kappa - 1\). They demonstrated that all eigenvalues are expressed as:
where s ranges from 1 to \(h + 1\) and q ranges from 1 up to s. The largest eigenvalue (or spectral radius) occurs when the argument to the cosine is smallest, or when \(q = 1\) and \(s = h + 1\). Note that the limiting value of the spectral radius as h approaches infinity is therefore \(2\sqrt{2}\); also found by Liu70. The smallest eigenvalue occurs for an (q, s) pair such that \(\frac{q}{(s + 1)}\) is less than, but as close as possible, to 1/2. Rojo and Robbiano69 state that the multiplicity of an eigenvalue given by Eq. (17) is given by \(2^{h-s}\). Note that this is independent of q. Equation (17) give the same eigenvalue for different (q, s) pairs. In particular, the eigenvalue of 0 is attained whenever \(\frac{q}{(s + 1)} = \frac{1}{2}\), or \(q=\frac{(s + 1)}{2}\). So to obtain the multiplicity of the eigenvalue 0, it is necessary to compute the sum
where N(s) is the number of integer q-values for a given s such that \(q=\frac{(s + 1)}{2}\). We therefore find that \(N(s) = 1\) if s is odd, and \(N(s) = 0\) if s is even. For example, when \(h = 3\), we have \(N(1) = 1\), \(N(2) = 0\) and \(N(3) = 1\). Equation (18) gives the correct result when h is odd, however, requires the addition of 1 when h is even69.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Sarker, S., Singh, A., Veremyev, A. et al. Controllability and heterogeneity of river networks using spectral graph theory approach. Sci Rep 15, 13196 (2025). https://doi.org/10.1038/s41598-025-94886-2
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-94886-2














