Domination based classification algorithms for the controllability analysis of biological interaction networks

Grady, Stephen K.; Abu-Khzam, Faisal N.; Hagan, Ronald D.; Shams, Hesam; Langston, Michael A.

doi:10.1038/s41598-022-15464-4

Download PDF

Article
Open access
Published: 13 July 2022

Domination based classification algorithms for the controllability analysis of biological interaction networks

Stephen K. Grady¹,
Faisal N. Abu-Khzam²,
Ronald D. Hagan³,
Hesam Shams⁴ &
…
Michael A. Langston³

Scientific Reports volume 12, Article number: 11897 (2022) Cite this article

1722 Accesses
4 Citations
Metrics details

Subjects

Abstract

Deciding the size of a minimum dominating set is a classic NP-complete problem. It has found increasing utility as the basis for classifying vertices in networks derived from protein–protein, noncoding RNA, metabolic, and other biological interaction data. In this context it can be helpful, for example, to identify those vertices that must be present in any minimum solution. Current classification methods, however, can require solving as many instances as there are vertices, rendering them computationally prohibitive in many applications. In an effort to address this shortcoming, new classification algorithms are derived and tested for efficiency and effectiveness. Results of performance comparisons on real-world biological networks are reported.

A network approach for low dimensional signatures from high throughput data

Article Open access 23 December 2022

A practically efficient algorithm for identifying critical control proteins in directed probabilistic biological networks

Article Open access 12 August 2024

The basis of easy controllability in Boolean networks

Article Open access 01 September 2021

Introduction

Let G = <V,E > denote a finite, simple, undirected graph of order n. A dominating set for G is a subset D of V with the property that every vertex in V-D has a neighbor in D. A minimum dominating set (MDS) is of course one of smallest cardinality. Its size is usually denoted by γ(G). Deciding MDS is both NP-complete¹ and W[2]-complete². It is easy to see that G may have as many as 3^n/3 distinct MDS solutions, as is demonstrated by the union of n/3 disjoint triangles. A common strategy is therefore to concentrate on significance and classify a vertex as “essential” (aka “critical”) if it is used in every MDS, as “intermittent” if it is used in some but not every MDS, and as “redundant” if it is never used in any MDS.

MDS has found purpose in a wide variety of application domains, spanning topics from network science^3,4,5,6 to sensor placement⁷ to transportation streaming⁸. Compelling utility has been cast in the realm of systems biology, where MDS has been used to model the controllability of biological networks in research fields as diverse as cancer^9,10,11, drug discovery¹², gene regulation¹³, neuroscience¹⁴, protein interaction^15,16,17, and viral infection¹⁸. Vertex classifications under MDS have even been used in the search for ncRNA’s latent regulatory role in polygenic human disease¹⁹.

Previous classification strategies examine vertices one by one, and thus invoke an MDS algorithm n or more times in the worst case. Efficiency may be achieved in the average case, however, by observing that a vertex is essential should it have two or more pendant vertices²⁰ and redundant should all of its neighbors be essential²¹. The main results of this paper generalize and greatly extend these pioneering observations with five novel vertex classification rules with which we can further decrease the number of times MDS must be solved. To accomplish this, we devise highly efficient techniques that can take advantage of neighborhood structure and, if desired, adjacency-preserving vertex permutations. Additionally, we report on experiments conducted over a variety of biological application domain graphs that help demonstrate the relative effectiveness of these innovative new methods.

The remainder of this paper is organized as follows. In the next section, we define some needed notation and briefly review prior work. In a third section, we devise four reduction rules, provide arguments for their soundness, and show how they can be employed to speed the task of vertex classification. In a fourth section, we introduce a fifth rule based on algebraic symmetry and discuss its practical potential. In a fifth section, we evaluate the effectiveness of these inventive rules on a variety of real-world biological problem instances. In a final section, we draw conclusions and consider a few directions for future research.

Preliminaries

Notation

Let u and v denote elements of V. The distance between u and v is the number of edges in a shortest path between them. The neighborhood of u, denoted by N[u], comprises u and its neighbors or, equivalently, those vertices within distance one from u. (This is sometimes called the closed neighborhood of u, in order to distinguish it from the open neighborhood N[u] − {u}.) Neighborhoods are extended to sets in a straightforward fashion. Thus, for a set S of vertices, N[S] denotes S and the neighbors of all its elements. An orbit is an equivalence class of vertices under the action of an automorphism group. That is, u and v belong to the same orbit if and only if there exists a relabeling of V that results in an isomorphic graph for which u and v have exchanged labels²². Finally, given an MDS, D, we say that u dominates v whenever u and v are adjacent, and u but not v is an element of D.

Prior work

The vertex classification problem has been studied^20,21 using the two previously-mentioned observations coupled with an MDS algorithm that employs Integer Linear Programming (ILP). Despite the fact that known ILP methods can in principle require exponential time, a major appeal of this approach relies on the existence of powerful commercial ILP solvers that tend to work extremely well in practice. Thus, once an initial MDS, D, has been computed, one needs only to consider each vertex, u, in turn.

If u ∈ D, then construct an ILP instance of MDS with a constraint to exclude u. We refer to the resultant procedure as ILP-exclude, with parameters G and u. If γ(ILP-exclude(G,u)) exceeds γ(G), then u is essential, otherwise it is intermittent.
And if u ∉ D, then construct an ILP instance of MDS with a constraint to include u. We refer to the resultant procedure as ILP-include, also with parameters G and u. If γ(ILP-include(G,u)) exceeds γ(G), then u is redundant, otherwise it is intermittent.

Classifier A

For the sake of clarity and exposition, and to help explicate algorithmic comparisons, this procedure (previously unnamed in^20,21) is presented here in pidgin code and dubbed Classifier A. We note that the exploitation of pendant vertices can be employed at start-up, while the examination of neighbors is best applied only after all essential vertices have been identified.

Classifier A requires low-order polynomial time to initialize C and R (an exact upper bound depends on graph density and the data structures used), exponential time for a call to an ILP solver to answer a single instance of MDS, and time for at most n exponential-time calls to ILP-exclude/include. Classifier A’s needs for extra space are negligible.

In search of a better classifier

Classification rules

Classifier A’s most time-consuming operations are its multitude of calls to ILP-exclude/include. We therefore propose, scrutinize, and employ a series of pre-processing rules in hopes that we can reduce the total number of calls required, thereby increasing the scalability of MDS-based biological network analytics.

Rule 1. Suppose u and v are adjacent, and the neighborhood of u is a proper subset of the neighborhood of v. If v is essential, then u is redundant.

Soundness. If an MDS contains v, then it cannot contain u, since otherwise the MDS would not be minimum. Thus, if every MDS contains v, then none can contain u. (Note the need for proper containment. If N[u] = N[v], then neither u nor v can be essential, and both must be redundant or both intermittent.)

Rule 2. If u is not essential, and if every element in u’s neighborhood is either essential or adjacent to an essential vertex, then u is redundant.

Soundness. This is a generalization of Rule 1, in which vertices in the neighborhood of u may be dominated by more than just a single essential vertex.

Rule 3. Suppose u but not v is contained in an MDS for which those vertices dominated only by u are in the neighborhood of v. Then both u and v are intermittent.

Soundness. Replacing u with v produces a distinct but equivalent MDS.

Rule 4. If u has neighbors v and w whose only common neighbor is u and for which (N[N[v]] ∪ N[N[w]]) ⊂ N[u], then u is essential.

Soundness. Because N[v] ∩ N[w] = {u}, and because u dominates every vertex in N[N[v]] ∪ N[N[w]], it follows that u is required in any MDS, since otherwise at least two vertices from N[v] ∪ N[w] would be required in its place to dominate v and w.

These rules require only neighborhood explorations, and are thus amenable to illustration. Sample subgraph configurations are depicted in Figs. 1, 2, 3 and 4.

Classifier B

We make use of these four rules in a procedure we name Classifier B. This new classifier need not invoke Classifier A, because the aforementioned observations upon which Classifier A relies are subsumed by Rules 2 and 4. On the other hand, the order in which rules are applied by Classifier B is important if we are to avoid calling MDS multiple times.

Classifier B’s resource requirements are similar to those of Classifier A. It needs low-order polynomial time to apply Rules 1–4 in the computation of C, I and R (an exact upper bound again depends on graph density and the data structures used), exponential time for a call to an ILP solver to answer a single instance of MDS, and time for at most n exponential-time calls to ILP-exclude/include. Classifier B’s needs for extra space are negligible.

Bolstered by Rules 1–4, it should come as no surprise that Classifier B provides a considerable improvement over Classifier A. We will demonstrate this convincingly in the sequel. But first we consider the possible utility of a more computationally demanding rule.

The use of algebraic symmetry

Orbits and automorphisms

In an effort to provide additional reductions in the number of ILP-exclude/include calls required, we turn to notions of graph structure, neighborhood symmetry, and adjacency-preserving vertex permutations.

Rule 5. If V is partitioned into a set of vertex orbits, then vertices within the same orbit must possess the same classification.

Soundness. Vertices within the same orbit are indistinguishable under automorphic transformation, and so their classifications will be identical.

Classifier C

We therefore study yet a third procedure, which we christen Classifier C. This new classifier operates as does Classifier B, except that it incorporates Rule 5 by first computing all orbits and then, whenever a vertex is classified, any unclassified vertices in its orbit are assigned the same classification.

Classifier C, like Classifier B, requires low-order polynomial time to apply Rules 1–4, exponential time to solve a single instance of MDS, and time for at most n exponential-time calls to ILP-exclude/include. Classifier C also needs low-order polynomial time to update orbit classifications. More significantly, it requires exponential time to determine the orbits themselves with known practical methods²³. These orbits can be found using bliss²⁴, nauty²⁵, and a variety of other popular, well documented, easy-to-use tools. From these we chose saucy^26,27, by virtue of the fact that it has been tuned for sparse graphs, which are overwhelmingly representative of large-scale biological data. And indeed, saucy was roughly 10–20 times faster than bliss and over 1000 times faster than nauty across our test suite. We hasten to add, however, that saucy requires a bit more effort to implement than does nauty or bliss. This is because saucy only returns vertex pairs that occupy the same orbit. The user must then merge these pairs to form a complete orbit set. Classifier C’s needs for extra space are negligible.

Classifier comparisons

Computational milieu

Classifiers A, B, and C were implemented in C+ + and compiled using the g+ + (GCC) version 4.8.5 compiler under the CentOS Linux 7 × 86–64 operating system. Various mathematical optimization software packages were considered, including notable options such as CPLEX²⁸ and Xpress²⁹. From these we chose Gurobi³⁰ for our ILP solver. It is a hugely successful, widely used, state-of-the-art commercial product. Moreover, Gurobi is freely available to many in the research community via academic site license. As in previous work, we used ILP to satisfy each classifier’s initial MDS requirement. Possible alternatives include the measure and conquer method of³¹, which runs in O(1.4864ⁿ) time and polynomial space. We were careful to avoid reproducibility problems that might arise from complex parameter settings. Our classifiers take as input only finite simple graphs, while default settings were strictly obeyed for Gurobi.

In order to provide empirical comparisons at scale, all tests were executed on the Advanced Computing Facility (ACF) computational cluster maintained by the National Institute for Computational Sciences³². Timings were performed on a single core of ACF’s monster (big memory) node using a Dell PowerEdge R630 server, an Intel Xeon E5-2687 W v4 30 MB Intel Smart Cache 3.00 GHz processor, 1,024 GB DDR4 memory, and ACF's read/write Network File System.

Three dozen challenging graphs were assembled to form a comprehensive classifier test suite. Graphs that populate this suite were obtained from well-known repositories and derived from transcriptomic, proteomic, epigenetic, and a variety of other sorts of biological data. We excluded from this suite any graph on which a classifier failed to finish within 24 h, which generally seemed to result from exceptional size or, less frequently, from unusual density. Graphs thusly selected are described in Table 1. Runtimes per instance and classifier are displayed in Table 2.

Table 1 A test suite of real-world biological graphs.

Full size table

Table 2 Run times for each test suite instance and each classifier, measured in seconds.

Full size table

Empirical results

We first studied preprocessing, with success measured as a percentage of vertices classified without an ILP-exclude/include call. Over our test suite, Classifier A had an average success rate of only 14.1%. In contrast, Classifier B had an average success rate of 67.2%, while Classifier C had an average success rate of 72.5%. As expected, Rules 1–5 thus seem to place Classifiers B and C at an enormous computational advantage. See Fig. 5.

We then turned to overall processing times. Unsurprisingly, we found that Classifier A was simply not competitive. Its meager preprocessing success rate placed too great a burden on mathematical optimization software. The computational demands of Rule 5, however, posed a pivotal question: is Classifier C’s modest reduction in ILP-exclude/include invocations a smart investment? In other words, do Classifier C’s time-consuming orbit computations translate into runtimes that are better than those of Classifier B? The answer is hardly obvious. Even with a leading-edge graph automorphism package such as saucy, it can be exceedingly difficult to compete against ILP computations performed by a well-honed commercial product like Gurobi. Because runtimes varied greatly over the graphs in our test suite, we normalized all completion times to that of Classifier A. Resultant calculations revealed that, on average, Classifiers B and C were more or less in a dead heat. Classifier B took roughly 38.2% as long as Classifier A, while Classifier C took some 37.9% as long. Thus, under these experimental conditions, the overall impact made by adding Rule 5 was positive but barely noticeable. See Fig. 6.

It is difficult from these results to argue against the use of either Classifier B or Classifier C. Both are vastly more effective than Classifier A. And while Classifier B is the simpler of the two, Classifier C was able to eke out a slight gain in speed. Having said that, we must remember that this endorsement is dependent on both our test suite and the computational resources available. Classifiers B and C were highly competitive. Different datasets, alternative applications, or a change in automorphism software may cause the added overhead and complexity of Classifier C to have a much greater effect, either positive or negative, than was observed here. These experiments in fact prompt a few serendipitous dataset observations, which we will discuss in the final section.

Discussion

Conclusions

Major contributions of this paper include the development, analysis, implementation, and testing of five novel classification rules and two highly innovative classifier algorithms with which vertex significance can be gauged in a network domination setting. Extensive empirical evidence of the practical usefulness of these powerful new rules and classifiers was also generated using a comprehensive test suite centering on life science applications and biological data.

Classifiers B and C turn out to be huge improvements over Classifier A in terms of both preprocessing rates and overall runtimes. Their relative effectiveness would have been even more pronounced had we not had access to a commercial ILP solver with the exceptional efficiency of Gurobi. Results from our extensive test suite suggest that Classifiers B and C are very nearly equal in performance. Although Classifier C was faster by a narrow margin, users may wish to give Classifier B a slight nod for its comparative simplicity.

Patterns seen in results and data may be of additional interest. We observe, for example, the modest MDS size of chromatin interaction data (test graphs 1–9). Concomitantly, these are the only graphs for which the preprocessing performed by Classifier C is significantly better than that of Classifier B. It seems plausible that this rather curious situation might be attributable to graph density, but most biological data is sparse, and indeed these graphs are roughly as sparse as all others in our test suite. We therefore turned to degree distributions and found that the chromatin interaction histograms appear normalesque and not scale-free like histograms for the rest of our test suite. Whether this is causative is unknown. We found it interesting too that all classifiers were unusually successful in preprocessing graph 25 (bio-grid-worm). Upon investigation, we discovered that this graph has an extremely high number of redundant vertices. Whether this attribute relates to better preprocessing is unclear. And finally, graph 36 (bn-mouse-retina-1) caught our attention because it was especially difficult for all classifiers, and yet its MDS is about the same size as those of the chromatin interaction graphs. Other than idiosyncrasies of data capture (neuronal connections imaged by electron microscopy), we can posit no particular basis for its computational recalcitrance.

Directions for future research

The rules we have devised assign a single MDS classification to any vertex. It is sometimes possible, however, to eliminate one classification option, making it reasonable to envisage more convoluted rules that assign a pair of classification choices to some vertices. As we have seen with Rule 5, however, the overhead and complexity of such a strategy must not be so high that it negates any meaningful gains.

MDS vertex classifications may find additional utility among problem variants. The study of independent dominating set, for instance, is a restatement of maximal independent set, and can be traced back roughly 60 years³³. Other classic examples include connected dominating set³⁴ and total dominating set³⁵. Vertex classification strategies may also be of interest when data is drawn from reduced graph families. Limiting inputs to planar graphs, for example, is a popular restriction in circuit layout and many other engineering applications, although in our opinion this sort of limitation would be difficult to motivate from a biological perspective.

It might also be instructive to consider the relationship between orbit distributions and graph structure. For example, those who embrace the once-popular scale-free hypothesis³⁶ might predict that orbits would be found primarily among leaves that share a common neighbor. As a simple test, we therefore scanned the non-singleton orbit lists and computed the percentage of these lists that contained non-leaf vertices for each graph in our test suite. These values turned out to range more or less uniformly between 4 and 100%. Unsurprisingly, it thus appears that the utility of automorphic transformation is highly data dependent, and that the extent to which Rule 5 applies is primarily a function of the particular graph under examination. This would seem to suggest that the relationship between orbits and the topology of graphs derived from biological data might warrant future study.

Finally, while our focus has been on practical applications, numerous theoretical questions beckon. We think it highly probable, for example, that classification strategies such as those we have developed here may prove useful for combinatorial problems other than MDS. Rule 5, in particular, seems to have something of a universal appeal. Another good example rests with worst-case classifier behavior. Each method we have considered could in principle invoke an MDS solver as many as n + 1 times. Classifier A in fact did exactly this, for instance, on test graph 5 (HiC-Net-10). Classifiers B and C, on the other hand, never even came close to this sort of pathology. We think it is highly unlikely that real-world biological data of sufficient size would cause either of these classifiers to be so completely ineffective. To the best of our knowledge, however, the sort of worst-case performance that might be attained with highly contrived data remains unknown.

Data availiability

Software and data produced as part of this study are available at the following github repository: https://github.com/sgrady3/MDS-vertex-classification.

References

Garey, M. R. & Johnson, D. S. Computers and Intractability: A Guide to the Theory of NP-Completeness 1–340 (W. H. Freeman and Company, 1979).
MATH Google Scholar
Downey, R. G. & Fellows, M. R. Fixed-parameter tractability and completeness I: basic results. SIAM J. Comput. 24, 873–921 (1995).
Article MathSciNet MATH Google Scholar
Kelleher, L. L. & Cozzens, M. B. Dominating sets in social network graphs. Math. Soc. Sci. 16(3), 267–279 (1988).
Article MathSciNet MATH Google Scholar
Nacher, J. C. & Akutsu, T. Dominating scale-free networks with variable scaling exponent: heterogeneous networks are not difficult to control. New J. Phys. 14(7), 073005 (2012).
Article ADS Google Scholar
Nacher, J. C. & Akutsu, T. Analysis on controlling complex networks based on dominating sets, in Ic-Msquare 2012: International Conference on Mathematical Modelling in Physical Sciences, Vols. 410 (2013).
Nacher, J. C. & Akutsu, T. Minimum dominating set-based methods for analyzing biological networks. Methods 102, 57–63 (2016).
Article CAS PubMed Google Scholar
Eubank, S., et al., Structural and algorithmic aspects of massive social networks, in Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete algorithms. 718–727 (Society for Industrial and Applied Mathematics, 2004).
Hagan, R. D., et al. Towards controllability analysis of dynamic networks using minimum dominating set, in IEEE 23rd International Conference on Information Fusion (2020).
Ravindran, V., Sunitha, V. & Bagler, G. Identification of critical regulatory genes in cancer signaling network using controllability analysis. Phys. A Stat. Mech. Appl. 474, 134–143 (2017).
Article Google Scholar
Schwartz, J. M. et al. Probabilistic controllability approach to metabolic fluxes in normal and cancer tissues. Nat. Commun. 10, 1–9 (2019).
Article CAS Google Scholar
Wakai, R. et al. Identification of genes and critical control proteins associated with inflammatory breast cancer using network controllability. PLoS ONE 12(11), e0186353 (2017).
Article PubMed PubMed Central Google Scholar
Sun, P. G. Co-controllability of drug-disease-gene network. New J. Phys. 17(8), 085009 (2015).
Article MATH Google Scholar
Bakhteh, S., Ghaffari-Hadigheh, A. & Chaparzadeh, N. Identification of minimum set of master regulatory genes in gene regulatory networks. IEEE/ACM Trans. Comput. Biol. Bioinform. 17(3), 999–1009 (2020).
Article CAS PubMed Google Scholar
Lee, B. et al. The hidden control architecture of complex brain networks. iScience 13, 154–162 (2019).
Article ADS PubMed PubMed Central Google Scholar
Wuchty, S. Controllability in protein interaction networks. Proc. Natl. Acad. Sci. U.S.A. 111(19), 7156–7160 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, X. F. et al. Comparative analysis of housekeeping and tissue-specific driver nodes in human protein interaction networks. BMC Bioinform. 17, 1–14 (2016).
Article CAS Google Scholar
Zhang, X.-F. et al. Determining minimum set of driver nodes in protein-protein interaction networks. BMC Bioinform. 16, 146 (2015).
Article Google Scholar
Ravindran, V. et al. Network controllability analysis of intracellular signalling reveals viruses are actively controlling molecular systems. Sci. Rep. 9(1), 2066 (2019).
Article ADS PubMed PubMed Central Google Scholar
Kagami, H. et al. Determining associations between human diseases and non-coding RNAs with critical roles in network control. Sci. Rep. 5, 14577 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Nacher, J. C. & Akutsu, T. Analysis of critical and redundant nodes in controlling directed and undirected complex networks using dominating sets. J. Complex Netw. 2(4), 394–412 (2014).
Article MATH Google Scholar
Ishitsuka, M., Akutsu, T. & Nacher, J. C. Critical controllability in proteome-wide protein interaction network integrating transcriptome. Sci. Rep. 6, 1–13 (2016).
Article Google Scholar
Ostrowski, J. et al. Orbital branching. Math. Program. 126(1), 147–178 (2011).
Article MathSciNet MATH Google Scholar
Miyazaki, T. The complexity of McKay’s canonical labeling algorithm. DIMACS Ser. Discret. Math. Theor. Comput. Sci. 28, 239–256 (1997).
Article MathSciNet MATH Google Scholar
Junttila, T. & Kaski, P. Engineering an efficient canonical labeling tool for large and sparse graphs, in Proceedings, Workshop on Algorithm Engineering and Experiments (SIAM, 2007).
McKay, B. D. & Piperno, A. Practical graph isomorphism, II. J. Symb. Comput. 60, 94–112 (2014).
Article MathSciNet MATH Google Scholar
Katebi, H., Sakallah, K. A. & Markov, I. L. Symmetry and satisfiability: an update, in Proceedings, International Conference on Theory and Applications of Satisfiability Testing (Springer LNCS, 2010).
Saucy3. http://vlsicad.eecs.umich.edu/BK/SAUCY/ (2021).
CPLEX Optimization Studio. https://www.ibm.com (2021).
FICO Xpress. https://www.fico.com (2021).
Gurobi Optimizer. https://www.gurobi.com (2021).
Iwata, Y. A faster algorithm for dominating set analyzed by the potential method, in International Conference on Parameterized and Exact Computation (Springer, 2011).
University of Tennessee and Oak Ridge National Laboratory Joint Institute for Computational Sciences. http://www.jics.tennessee.edu (2021).
Berge, C. Theory of Graphs and its Applications (Methuen Publishing, 1962).
MATH Google Scholar
Sampathkumar, E. & Walikar, H. B. The connected domination number of a graph. Math. Phys. Sci. 13(6), 607–613 (1979).
MathSciNet MATH Google Scholar
Cockayne, E. J., Dawes, R. M. & Hedetniemi, S. T. Total domination in graphs. Networks 10, 211–219 (1980).
Article MathSciNet MATH Google Scholar
Broido, A. D. & Clauset, A. Scale-free networks are rare. Nat. Commun. 10, 1–10 (2019).
Article CAS Google Scholar
Zitnik, M., Sosic, R., Maheshwari, S. & Leskovec, J. BioSNAP Datasets: Stanford Biomedical Network Dataset Collection (2018).
Pratt, D. et al. NDEx, the network data exchange. Cell Syst. 1(4), 302–305 (2015).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Barrett, T. et al. NCBI GEO: archive for functional genomics data sets–update. Nucl. Acids Res. 41(Database issue), D991–D995 (2013).
CAS PubMed Google Scholar
Pei, H. et al. FKBP51 affects cancer cell response to chemotherapy by negatively regulating Akt. Cancer Cell 16(3), 259–266 (2009).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Ellsworth, K. A. et al. Contribution of FKBP5 genetic variation to gemcitabine treatment and survival in pancreatic adenocarcinoma. PLoS ONE 8(8), e70216–e70216 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, L. et al. Genetic variations associated with gemcitabine treatment outcome in pancreatic cancer. Pharmacogenet. Genomics 26(12), 527–537 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hong, Y. et al. A “metastasis-prone” signature for early-stage mismatch-repair proficient sporadic colorectal cancer patients and its implications for possible therapeutics. Clin. Exp. Metastasis 27(2), 83–90 (2010).
Article CAS PubMed Google Scholar
Pedraza, V. et al. Gene expression signatures in breast cancer distinguish phenotype characteristics, histologic subtypes, and tumor invasiveness. Cancer 116(2), 486–496 (2010).
Article CAS PubMed Google Scholar
Zheng, B. et al. PGC-1α, a potential therapeutic target for early intervention in Parkinson’s disease. Sci. Transl. Med. 2(52), 52ra73 (2010).
Article PubMed PubMed Central Google Scholar
Baker, E. et al. GeneWeaver: data driven alignment of cross-species genomics in biology and disease. Nucl. Acids Res. 44(D1), D555–D559 (2016).
Article MathSciNet CAS PubMed Google Scholar
Pitkänen, J. P. et al. Excess mannose limits the growth of phosphomannose isomerase PMI40 deletion strain of Saccharomyces cerevisiae. J. Biol. Chem. 279(53), 55737–55743 (2004).
Article PubMed Google Scholar
Rossi, R. A. & Ahmed, N. K. The network data repository with interactive graph analytics and visualization, in Proceedings, AAAI Conference on Artificial Intelligence. Austin, Texas (2015).
Yu, H. et al. High-quality binary protein interaction map of the yeast interactome network. Science (New York, N. Y.) 322(5898), 104–110 (2008).
Article ADS CAS Google Scholar
Oughtred, R. et al. The BioGRID database: a comprehensive biomedical resource of curated protein, genetic, and chemical interactions. Protein Sci. Publ. Protein Soc. 30(1), 187–200 (2021).
Article CAS Google Scholar
Luck, K. et al. A reference map of the human binary protein interactome. Nature 580(7803), 402–408 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This article is in memory of Charles A. Phillips.

The authors express their gratitude to James Ostrowski for fruitful discussions, to Austin Wyer for assistance with coding and testing, to Karem Sakallah for access to saucy, and to anonymous reviewers for critiques and comments. A preliminary version of a portion of this paper was presented at the International Conference on Information Fusion, virtually held in Sun City, South Africa, in July, 2020⁸.

Funding

This research has been supported in part by the National Institutes of Health under Grants R01AA018776 and R01HD092653, and by the Environmental Protection Agency under Grant G17D112354237. It is based in part upon work performed using computational resources supported by the University of Tennessee and Oak Ridge National Laboratory Joint Institute for Computational Sciences.

Author information

Authors and Affiliations

Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN, USA
Stephen K. Grady
Department of Computer Science and Mathematics, Lebanese American University, Beirut, Lebanon
Faisal N. Abu-Khzam
Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, USA
Ronald D. Hagan & Michael A. Langston
Department of Industrial and Systems Engineering, University of Tennessee, Knoxville, TN, USA
Hesam Shams

Authors

Stephen K. Grady
View author publications
Search author on:PubMed Google Scholar
Faisal N. Abu-Khzam
View author publications
Search author on:PubMed Google Scholar
Ronald D. Hagan
View author publications
Search author on:PubMed Google Scholar
Hesam Shams
View author publications
Search author on:PubMed Google Scholar
Michael A. Langston
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization S.K.G. and M.A.L.; Rule development S.K.G., F.N.A., and H.S.; Formal analysis S.K.G., F.N.A., and M.A.L.; Software testing S.K.G.; Writing and reviewing S.K.G., F.N.A., R.D.H., H.S., and M.A.L.; Supervision, administration and funding acquisition M.A.L.

Corresponding author

Correspondence to Stephen K. Grady.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Grady, S.K., Abu-Khzam, F.N., Hagan, R.D. et al. Domination based classification algorithms for the controllability analysis of biological interaction networks. Sci Rep 12, 11897 (2022). https://doi.org/10.1038/s41598-022-15464-4

Download citation

Received: 20 April 2021
Accepted: 23 June 2022
Published: 13 July 2022
DOI: https://doi.org/10.1038/s41598-022-15464-4

This article is cited by

A graph theoretical approach to experimental prioritization in genome-scale investigations
- Stephen K. Grady
- Kevin A. Peterson
- Elissa J. Chesler
Mammalian Genome (2024)