Novel drug target identification for the treatment of dementia using multi-relational association mining

Nguyen, Thanh-Phuong; Priami, Corrado; Caberlotto, Laura

doi:10.1038/srep11104

Download PDF

Article
Open access
Published: 08 July 2015

Novel drug target identification for the treatment of dementia using multi-relational association mining

Thanh-Phuong Nguyen^1,2,
Corrado Priami^1,3 &
Laura Caberlotto¹

Scientific Reports volume 5, Article number: 11104 (2015) Cite this article

5713 Accesses
12 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Dementia is a neurodegenerative condition of the brain in which there is a progressive and permanent loss of cognitive and mental performance. Despite the fact that the number of people with dementia worldwide is steadily increasing and regardless of the advances in the molecular characterization of the disease, current medical treatments for dementia are purely symptomatic and hardly effective. We present a novel multi-relational association mining method that integrates the huge amount of scientific data accumulated in recent years to predict potential novel targets for innovative therapeutic treatment of dementia. Owing to the ability of processing large volumes of heterogeneous data, our method achieves a high performance and predicts numerous drug targets including several serine threonine kinase and a G-protein coupled receptor. The predicted drug targets are mainly functionally related to metabolism, cell surface receptor signaling pathways, immune response, apoptosis and long-term memory. Among the highly represented kinase family and among the G-protein coupled receptors, DLG4 (PSD-95) and the bradikynin receptor 2 are highlighted also for their proposed role in memory and cognition, as described in previous studies. These novel putative targets hold promises for the development of novel therapeutic approaches for the treatment of dementia.

Dementia incidence varied by anticancer drugs and molecular targeted therapy in a population-based cohort study

Article Open access 30 July 2024

Recent advances in Alzheimer’s disease: mechanisms, clinical trials and new drug development strategies

Article Open access 23 August 2024

AI-based differential diagnosis of dementia etiologies on multimodal data

Article Open access 04 July 2024

Introduction

Neurodegenerative dementia (ND) is a multi-faceted cognitive impairment that is progressive and irreversible due to deterioration of brain cells and their interconnections. It involves multiple cognitive deficits manifested by memory impairment and cognitive disturbances. The understanding of the genetic basis of ND has advanced in recent years, giving some insights into disease pathophysiology, but there are still major knowledge gaps in understanding the molecular mechanism underlying dementia. Dementia can be caused by a wide variety of diseases including more frequent pathologies such as Alzheimer’s disease, but also rare ones including Pick’s disease. Despite the high prevalence of dementia in the population, no drug treatments are available that can provide a cure. The two main classes of drugs available to treat Alzheimer’s disease, cholinesterase inhibitors and NMDA receptor antagonists, can only ameliorate the symptoms, or temporarily slow down the disease progression¹, but they are not efficacious in treating the disease. Thus, due to the constant and rapid increase of life expectancy with an epidemic progression of neurodegenerative disorders, particularly Alzheimer’s disease², it becomes very urgent to understand the molecular basis of dementia and to develop novel efficacious treatments.

The identification of novel drug targets (DTs) is of great importance for the development of new pharmaceutical products³, but the traditional drug discovery process is often laborious and expensive⁴. Systems biology can contribute to this field of research through an integrated view, capturing the complexity of the systems and integrating the huge amount of scientific data accumulated and archived in recent years. In such a situation, computational methods have become more and more essential to mine high-throughput data and discover useful knowledge for drug discovery in general and drug target identification in particular^3,5,6,7,8,9. Among a wide range of approaches, the molecular network-based approach has the potential for the identification of DTs^8,10. Molecular networks are very informative in studying human diseases and drugs because it is well-known that most molecular components do not perform their biological function in isolation, but interact with other cellular components in an intricate interaction network^11,12,13. Emig et al. employed the network propagation and random walk method to predict DTs¹⁴. The domain-tuned-hybrid method was proposed to infer the network of drug-target interactions¹⁵. By analyzing human protein-protein interaction network, Milenković et al. developed a graphlet-based measure of network topology to predict potential drug targets¹⁶. Although previous works have been paving the way to the prediction of DTs, there exists a limiting factor in such data-intensive work due to the use of a single data source. Instead, it is essential to integrate the rich sources of -omic data (from the molecular to the network level) to acquire a comprehensive coverage of biomedical properties relevant to drug discovery.

In this study, we present a novel integrative approach to predict potential new drug targets for dementia based on multi-relational association mining (MRAM), an advanced data mining technique able to manipulate heterogeneous data without any information loss. The diseases studied are: Frontotemporal dementia (FTD), Alzheimer disease (AD), Lewy bodies disease (LBD), Progressive supranuclear palsy (PSP), Corticobasal dementia (CBD), Pick’s disease, Prion disease, Huntington’s disease and Amyotrophic lateral sclerosis-Parkinsonism/dementia complex. The investigation was based on the list of known dementia DTs curated in¹⁷ with the integration of protein interaction network (PIN) and biological data from the Reactome, Gene Ontology and InterPro databases. MRAM combined multiple relational data and achieved a better computational performance than other data mining techniques. Our method was able to predict novel DTs by inferring predictive association rules that were used to run testing experiments on the set of putative DTs that have direct interactions with both dementia-related genes and dementia DTs in the PIN described in¹⁷.

Our systems biology approach identified a series of potential novel DTs functionally associated to metabolism, cell surface receptor signaling pathways, immune response, apoptosis and long-term memory. Among the predicted DTs, numerous serine threonine kinases, such as DLG4 (PSD-95) and a G-protein coupled receptor, Bradykinin receptors 2, were highlighted which could be considered for the development of innovative therapeutic approaches for the treatment of dementia.

Materials and Methods

The pipeline that we applied is presented in Fig. 1 and consists of five steps as follows.

1
Extraction of molecular targets of drugs in different phases of the drug discovery process (from preclinical to marketed drugs);
2
Construction of a protein interaction network including the 1-step neighbors of DTs;
3
Integration of heterogeneous data from multiple databases (listed in Table 1);
Table 1 Reference databases used for data retrieval during the investigation.
Full size table
4
Induction of association rules for DT prediction by using the MRAM algorithm;
5
Biological interpretation of the predicted DTs.

Curation of dementia-related drug targets

Drug molecular targets were obtained by collecting information from different pharmaceutical company websites, from a clinical trial database (www.clinicaltrials.gov) and from the DrugBank database¹⁸. Drugs for the treatment of dementia in all phases of the drug discovery process, from preclinical to marketed drugs, were included. Although this approach is considering targets with lower (drugs in preclinical phases) and higher (marketed drug) level of confidence, it allowed obtaining the broadest coverage of the genes of interest for pharmaceutical drug development to identify the overall key molecular targets of interest for the treatment of dementia. We did not consider the overall pharmacological activity of the compounds, but only the primary targets of the drugs. From the set of DTs, we converted gene symbols to UniProt protein accessions using the identifier mapping scheme provided by the UniProt database¹⁹, obtaining the set of 268 DT proteins reported in Supplementary Material S1.

Construction of the interaction network of drug targets

PINs are becoming increasingly comprehensive and they provide a better way for the understanding of the interaction among molecules than gene networks^11,20. Our PIN was obtained from the Interologous Interaction Database (i2d)²¹. The i2d database stores two types of interactions: the source interactions curated from the majority of well-known data sources such as HRPD, BIND, BioGrid, DIP, IntAct and the predicted interactions obtained by a homology-based approach. To increase the reliability of the protein interaction data we only considered the 183,524 source interactions homo sapiens-related.

Based on the set of mapped DTs, we extracted the PIN by processing raw data of protein-protein interactions (PPI) in the i2d database. The final PIN of interest contained the DTs (nodes) and their direct interactions (edges). In this study, we considered one-step neighbors. The network was undirected and unweighted because we considered the binary interactions. Figure 2 illustrates the resulting PIN of dementia DTs. Protein identifiers are the UniProt ID accession number and gene identifiers are represented by the official gene symbols.

Combination of heterogeneous data from multiple data sources

We considered both topological relationships between the DTs and their network neighbors and functional data representing biological properties of the DTs.

Regarding the topological data features, we calculated the number of a protein’s neighbors in our PIN, i.e. the degree centrality index of proteins formally defined as the cardinality of the set DC(p_i) = {p_j ∈ N |e_ij ∈ E}, where e_ij denotes an interaction connecting p_i and p_j and E is the set of interactions. Degree centrality is one of the main measures used to study hubs of a network. We also considered articulation proteins. A protein is an articulation point in a network iff removing it (and the interactions through it) disconnects the network. These topological properties help elucidating the role of the DTs in the PIN.

Regarding the functional data features, we investigated three different kinds of properties: GO term, biological pathway and protein domain. The GO terms in the Gene Ontology database²² are divided into three categories: molecular function, biological process and cellular component and this information is used for the DT prediction. Since DTs most likely are part of the same cellular pathways, data extracted from the Reactome pathway database²³ were analyzed. Protein domains are defined as structural or functional elements within a protein and affect the way that one protein interacts with one another. The protein domains of the DTs were obtained from the InterPro database²⁴.

A multi-relational scheme was structured in form of tables and relationships between tables in the SQL Server Database Management to store our knowledge base as described above. The multi-relational scheme shows its advantage in the data integration because the data types are heterogeneous: GO terms, pathways and protein domains are categorical free text while the degree centrality indexes are numerical and the articulation point feature is boolean.

Prediction of drug targets using multi-relational association mining

MRAM and its applications

Most of the existing data mining algorithms seek data patterns in single tables. However, many datasets are inherently multi-relational and the information systems that manage them rely on multi-relational databases (MRDs). Multi-relational data mining (MRDM) open the way for handling and mining data in multiple tables (relations) directly in a MRD^25,26,27. In MRDM, data are represented in a relational form where the records of the target table are potentially related to several records in secondary tables in one-to-many or many-to-many relationships. Three popular MRDM techniques are classification, clustering and association. Association techniques (called multi-relational association mining - MRAM) have been successfully applied in bioinformatics, for example the analysis of gene set enrichment²⁸, the prediction of hepatitis patients²⁹, the analysis of different types of cancers based on microarray data³⁰, the detection of potential adverse drug reactions³¹ and the prediction of protein interactions³². MRAM mines the data directly in their original structure of multiple relational tables, not requiring any pre-processing stage to generate a single table as in classical association mining (AM) algorithms like Apriori³³ and FP-growth³⁴. We developed an MRAM approach to exploring multiple data from a wide range of data sources to predict the dementia DTs.

Predicting drug targets using MRAM

The extracted data from GO, i2d, InterPro and Reactome are represented as relational tables in the Microsoft SQL server management system. Later one-to-many or many-to-many relationships among the tables were established. For example, many proteins may have the same degree centrality (one-to-many relationship) and on the other side a protein may belong to many Reactome pathways and one Reactome pathway has many proteins involved (many-to-many relationship). Figure 3 illustrates an example of extracted data in a multi-relational table form where ‘pathway’, ‘protein’ and ‘degree’ are three entity types, shown at the left-hand side of the figure. There is a relationship type between ‘pathway’ and ‘protein’, specifying the pathways that the proteins take part in and between ‘protein’ and ‘degree’, specifying the degree centrality corresponding to a protein. The first relationship type is a many-to-many while the second is one- to-many. Note that the heterogeneity in the data makes the use of classical AM unsuitable in mining multiple relational data and underlines the importance of using an MRAM algorithm for the identification of the DTs.

The MRAM algorithm to explore data in multiple relational tables was employed by using the SQL Server 2012 Analysis Services (SSAS). The MRAM algorithm in the SSAS package uses optimization techniques to save space and make processing faster. Similar to traditional AM, MRAM handles data as items and group of items, called itemset. An association model consists of a series of itemsets and rules of the form X ⇒ Y, where X and Y are disjoint itemsets X ∩ Y = ∅. The algorithm finds rules within a dataset based on two parameters, support and probability/confidence. Support is the occurrence frequency of the targeted item or itemset in a given dataset. Probability is the co-occurrence frequency of items in Y and X.

The inputs of the algorithm are (1) positive training examples as the set of known DTs, denoted S^pos, (2) negative training examples as the set of non-DT selected randomly from the set of proteins which do not belong to S^pos, denoted S^neg and (3) a five-dimension vector representing degree centrality f_deg, articulation point f_art, GO term f_GO, Reactome pathway f_path and protein domain f_dom. The MRAM algorithm traverses the input dataset in multiple tables to find items that appear together in a case. The algorithm then groups into itemsets any associated items that have support greater than a threshold MINIMUM_SUPPORT. Probability is calculated for each rule and the algorithm restricts the number of rules based on a threshold parameter MINIMUM_ PROBABILITY. The resulting rules were used to infer new putative DTs.

Gene ontology and pathway analysis

We run the proposed method to predict putative DTs from the set of connector genes identified in our previous work¹⁷. Connector genes being directly linked to both the dementia disease genes extracted from the OMIM database³⁵ and the DTs extracted in this study are likely to have more chance to be relevant for the disease and, thus, for being potential DTs.

The newly predicted DTs were used to extract the most representative GO biological process terms (i.e., the ones that are over-represented, but that do not refer to most general biological processes). For identifying and visualizing enriched GO terms, we used GOrilla³⁶ and REVIGO web-based tools³⁷. Hypergeometric distribution was applied to test GO term enrichment and a p-value threshold of 0.05 was selected. Pathway enrichment analysis and disease association was performed using DAVID web-based tool³⁸.

Results

221 DTs out of 268 known DTs have at least one interacting neighbor in our PIN. Therefore the positive example S^pos consists of 221 DTs. Since there are no available negative examples (non-DTs), we randomly selected three sets of negative examples with different sizes (221, 500 and 1,000) to build the set S^neg. The approach applied to different size examples proved to be scalable and stable. We obtained a network of 3,112 proteins and 6,541 interactions that is well connected with the average shortest path length equal to 3 and the average number of neighbors equal to 4. We obtained 44,433 GO terms, 11,738 InterPro domains and 4,240 Reactome pathways related to the 3,122 proteins.

To evaluate the performance of our method, we computed the lift charts for MRAM, Decision Tree (D-Tree), Naïve Bayes (NB) and Neural Network (NN) methods. These charts show how the model performs for all states of the predictable attribute. Figure 4A–D shows the lift charts of D-Tree, NB, NN and MRAM with 500 negative examples. The lift chart of MRAM is closer to the ideal model than the other three methods. MRAM also achieved a better accuracy (93%) than D-Tree, NB and NN that achieved 89%, 83% and 88%, respectively. We then performed the 10-fold cross validation to compute Likelihood Log Score, Likelihood Lift, Likelihood Root Mean Square Error (RMSE)³⁹ and area under the curve (AUC)⁴⁰. Recall that the higher AUC, Likelihood Log Score, Likelihood Lift and the lower RMSE, the better performance. The experiments were performed for the four methods on the same set of data. Table 2 presents the average values calculated for 10 experiments corresponding to the 10-fold cross validation of the above-mentioned measures and the standard deviations calculated for the methods with the three sets S^neg of 221, 500 and 1,000 negatives. In all experiments, MRAM performed better than the other methods. To evaluate the contribution of each data feature, we did several experiments by excluding the features one-by-one and then computing the likelihood lift. Table 3 shows that the experiment with all data combined achieved the best result and the next was the one excluding InterPro domain feature. The worst likelihood lift was obtained when excluding topological features. As a result, the topological feature contributes most and the InterPro domain feature contributes least to the method.

Table 2 Computational measures calculated for the four methods with the three sets of negative examples with different sizes n₁, n₂, n₃.

Full size table

Table 3 Performance of MRAM in term of likelihood lift with different subsets of data features and the three sets of negative examples with different sizes n₁, n₂, n₃.

Full size table

Figure 5 shows some of the induced rules with probability equal to 1 in three columns: Probability, Importance and Rule. For example, the rule “GO:0005887 = C:integral to plasma membrane, Degree = 10–77, REACT_111102 = Signal Transduction ⇒DT = Y” has probability = 1 and importance = 0.514. The rule shows that one protein has chance to be a DT if it is integral to the plasma membrane, is central with at least 10 interactions (up to 77 interacting proteins) and takes part in the signal transduction pathway. The probability describes how likely the result of a rule is to occur. The importance measures the significance of a rule. The importance of a rule is calculated by the log likelihood of the right-hand side of the rule, given the left-hand side of the rule. For example, in the rule A ==> B, MRAM calculates the ratio of cases with A and B over cases with B but without A and then normalizes that ratio by using a logarithmic scale. Indeed a rule with high probability might be too general to provide useful information. The greater importance, the more significant the rule is. The top rules with probability = 1 and importance > 0.5 are presented in Supplementary File Table S2.

Functional enrichment analyses of GO biological process terms was performed for the list of predicted DTs, showing metabolic-related terms (regulation of glucose transport and of insulin signaling), cell surface receptor signaling pathways (Wnt, neurotrophin, MAPK cascade and tachykinin receptor signaling), immune response-related terms (innate immune response and toll-like receptor singalling), apoptosis and long-term memory (Fig. 5; Supplementary File Table S3).

Pathway analysis revealed similar results as the GO, but in addition indicates Alzheimer disease-amyloid secretase pathway, type 2 Diabetes and metabotropic glutamate receptor group I pathways as associated with the predicted DTs (Supplementary File Table S4).

Discussion

In recent years, wealth of information has been produced on neurodegenerative dementia and particularly on Alzheimer’s disease and the integration of these data to obtain novel knowledge is one of the big challenges in modern neurobiology. This investigation employed a data mining method on heterogeneous data including categorical free text (i.e. GO terms and pathways), numerical values (i.e. the degree centrality and number of domain-domain interactions) and boolean values (i.e. articulation protein). The data were managed in a multiple relational database for which classical AM methods do not provide a suitable platform. MRAM is the most recent approach which aims to overcome the difficulties in multi-relational data integration. It enables direct pattern extraction from multiple relations, without the necessity of transferring data to a single relation^25,28, thus avoiding computationally expensive joining operations and semantic losses caused by the representation limit of a single table with repetitions of many attributes and data. Because this merged table is large and sparse, the mining process becomes more expensive and time-consuming⁴¹. The experiments on the DT prediction showed that MRAM was the best among other well-known data mining techniques: DT, NB and NN methods. With the rapid growth of public biological databases, MRAM can be widely applied to discover complex patterns through the rich relational structure and the mixed-up types of data.

A significant enrichment in Alzheimer disease-amyloid secretase pathway (Panther:P00003), the hallmark of Alzheimer’s disease⁴² was evidenced by pathway analysis of the predicted DTs giving further support to the relevance of our findings to dementia. In addition, a significant enrichment in biological functions associated to long-term potentiation (Fig. 6), a phenomenon related to synaptic plasticity, one of the most important cellular mechanisms that underlies learning and memory, was also found. Proteins involved are PKC, PKA, ERK1/2, Rsk and CAMK2 (Supplementary File 3). These results are in line with recent studies suggesting that activation of protein kinase C could have potential for the treatment of dementia⁴³.

Interestingly, an association with type 2 diabetes pathway (KEGG:hsa04930) was also evidenced. Type 2 Diabetes is a major risk factor for Alzheimer’s disease and dementia and the concept that Alzheimer’s is fundamentally a metabolic disease that results in progressive impairment in the brain’s capacity to utilize glucose and respond to insulin/insulin like growth factor stimulation has recently gained increasing support^44,45. Thus, in line with ours^17,46 and other groups’^47,48 previous findings, these results further emphasize the strong link between Alzheimer’s disease and, more generally, neurodegenerative dementia, to metabolic disorders and diabetes. This finding may suggest that a dementia drug could be used as treatment platforms for both diseases and their co-morbidities in view of the overlapping molecular pathway.

Considering the most enriched GO terms and pathway, a major role for MAPK (KEGG: hsa04010) is evident (Figs. 6 and 7). The putative role of p38 MAPK as a new Alzheimer’s disease treatment strategy has emerged in recent years. p38 MAPK operates not only in response to stress and inflammatory reactions, but also in other events related to AD, such as excitotoxicity, synaptic plasticity and tau phosphorylation⁴⁹.

Finally, considering the degree index (Table 4), DLG4 (Discs, Large Homolog 4 or PSD-95), a postsynaptic marker playing a basic role in synaptic transmission by anchoring NMDA receptors and interacting with nNOS^50,51, was the protein with the highest degree centrality. Its link to dementia, particularly in cognitive performances is also supported by animal studies: mice lacking PSD-95 have severe spatial memory deficits⁵², while mice exposed to enriched environments with improved learning and memory, have elevated PSD-95⁵³. Finally, DLG4 has been linked to a genetic form of dementia: familial Danish dementia⁵⁴. A high affinity molecule acting on PSD-95 has been previously identified and could possibly be used for Alzheimer’s disease as also suggested by Bach and collaborators.⁵⁵.

Table 4 List of predicted drug targets with UniProt ID, Official gene symbol, Gene name and Degree centrality.

Full size table

Most of the putative DTs are kinases, particularly serine threonine kinases. In drug development, achieving selective inhibition of specific protein kinases is challenging since most small-molecule kinase inhibitors interact with multiple members of the protein kinase family⁵⁶. Thus, among the predicted DTs in the G protein-coupled receptor family, a possible protein of interest could be the bradikynin receptor 2 (BDKRB2). Bradikynin and related kinins are a family of small peptides which act as mediators of inflammation and pain and that transmit their biological effects via G protein-coupled receptors through the action on two bradykinin receptors, the B1 and the B2 subtypes⁵⁷. Previous studies support the proposed effectiveness of an action on BDKRB2 for the treatment of dementia, demonstrating the ability of a BDKRB2 antagonist (HOE 140) in reversing the spatial learning and memory deficits induced by Aβ peptide in an animal model of Alzheimer’s disease⁵⁸. Thus, further studies are needed in this direction to confirm the validity of this target for future development.

Conclusions

Our systems biology approach was able to integrate previous existing knowledge in dementia to identify novel molecular targets for the development of innovative therapeutic intervention. A series of kinase including DLG4 (PSD-95) and BDKRB2, a G protein-coupled receptor of the kinin family were identified, but further studies are needed to confirm this finding and the druggability of the proposed targets.

Additional Information

How to cite this article: Nguyen, T.-P. et al. Novel drug target identification for the treatment of dementia using multi-relational association mining. Sci. Rep. 5, 11104; doi: 10.1038/srep11104 (2015).

References

Di Santo, S. G., Prinelli, F., Adorni, F., Caltagirone, C. & Musicco, M. A meta-analysis of the efficacy of donepezil, rivastigmine, galantamine and memantine in relation to severity of Alzheimer’s disease. J. Alzheimers. Dis. 35, 349–61 (2013).
CAS PubMed Google Scholar
Takizawa, C., Thompson, P. L., van Walsem, A., Faure, C. & Maier, W. C. Epidemiological and Economic Burden of Alzheimer’s Disease: A Systematic Literature Review of Data across Europe and the United States of America. J. Alzheimers. Dis. 43, 1271–84 (2014).
Google Scholar
Rask-Andersen, M., Almén, M. S. & Schiöth, H. B. Trends in the exploitation of novel drug targets. Nat. Rev. Drug Discov. 10, 579–90 (2011).
CAS PubMed Google Scholar
Dickson, M. & Gagnon, J. P. Key factors in the rising cost of new drug discovery and development. Nat. Rev. Drug Discov. 3, 417–29 (2004).
CAS PubMed Google Scholar
Dudley, J. T., Deshpande, T. & Butte, A. J. Exploiting drug-disease relationships for computational drug repositioning. Brief. Bioinform. 12, 303–11 (2011).
CAS PubMed PubMed Central Google Scholar
Jin, G., Zhao, H., Zhou, X. & Wong, S. T. C. An enhanced Petri-net model to predict synergistic effects of pairwise drug combinations from gene microarray data. Bioinformatics 27, i310–6 (2011).
CAS PubMed PubMed Central Google Scholar
Nickel, J. et al. SuperPred: update on drug classification and target prediction. Nucleic Acids Res. 42, W26–31 (2014).
CAS PubMed PubMed Central Google Scholar
Pinto, J. P., Machado, R. S. R., Xavier, J. M. & Futschik, M. E. Targeting molecular networks for drug research. Front. Genet . 5, 160 (2014).
PubMed PubMed Central Google Scholar
Yamanishi, Y. et al. DINIES: drug-target interaction network inference engine based on supervised analysis. Nucleic Acids Res. 42, W39–45 (2014).
CAS PubMed PubMed Central Google Scholar
Csermely, P., Korcsmáros, T., Kiss, H. J. M., London, G. & Nussinov, R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol. Ther. 138, 333–408 (2013).
CAS PubMed PubMed Central Google Scholar
Loscalzo, J. & Barabasi, A.-L. Systems Biology and the Future of Medicine. Wiley Interdiscip Rev Syst Biol Med 3, 619–627 (2011).
PubMed PubMed Central Google Scholar
Yildirim, M. a, Goh, K.-I., Cusick, M. E., Barabási, A.-L. & Vidal, M. Drug-target network. Nat. Biotechnol. 25, 1119–26 (2007).
CAS PubMed Google Scholar
Vidal, M., E. Cusick, M. & Barabási, A.-L. Interactome Networks and Human Disease. Cell 144, 986–998 (2011).
CAS PubMed PubMed Central Google Scholar
Emig, D. et al. Drug target prediction and repositioning using an integrated network-based approach. PLoS One 8, e60618 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Alaimo, S., Pulvirenti, A., Giugno, R. & Ferro, A. Drug-target interaction prediction through domain-tuned network-based inference. Bioinformatics 29, 2004–8 (2013).
CAS PubMed PubMed Central Google Scholar
Milenković, T., Memišević, V., Bonato, A. & Pržulj, N. Dominating biological networks. PLoS One 6, e23016 (2011).
ADS PubMed PubMed Central Google Scholar
Caberlotto, L. & Nguyen, T.-P. A systems biology investigation of neurodegenerative dementia reveals a pivotal role of autophagy. BMC Syst. Biol. 8, 65 (2014).
PubMed PubMed Central Google Scholar
Law, V. et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 42, D1091–7 (2014).
CAS PubMed Google Scholar
Magrane, M. & Consortium, U. UniProt Knowledgebase: a hub of integrated protein data. Database (Oxford). 2011, bar009 (2011).
PubMed PubMed Central Google Scholar
Barabasi, A.-L., Gulbahce, N. & Loscalzo, J. Network Medicine: A Network-based Approach to Human Disease. Nat Rev Genet. 12, 56–68 (2011).
CAS PubMed PubMed Central Google Scholar
Brown, K. R. & Jurisica, I. Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biol. 8, R95 (2007).
PubMed PubMed Central Google Scholar
Gene, T. et al. Gene Ontology: tool for the unification of biology. Nat Genet. 25, 25–29 (2000).
Google Scholar
Croft, D. et al. The Reactome pathway knowledgebase. Nucleic Acids Res. 42, D472–7 (2014).
ADS CAS PubMed Google Scholar
Hunter, S. et al. InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res. 40, D306–12 (2012).
CAS PubMed Google Scholar
Domingos, P. Prospects and challenges for multi-relational data mining. ACM SIGKDD Explor. Newsl. 5, 1–4 (2003).
Google Scholar
Padhy, N. Multi Relational Data Mining Approaches: A Data Mining Technique. Int. J. Comput. Appl. 57, 23–32 (2012).
Google Scholar
Valencio, C. R. et al. MR-Radix: a multi-relational data mining algorithm. Human-centric Comput. Inf. Sci. 2, 1–17 (2012).
Google Scholar
Lavrac, N. Relational and Semantic Data Mining for Biomedical Research. Informatica 37, 35–39 (2013).
Google Scholar
Silva, A. & Antunes, C. Multi-relational pattern mining over data streams. Data Mining and Knowledge Discovery, 2014, 1–32 (2014).
MATH Google Scholar
Trajkovski, I., Zelezny, F., Tolar, J. & Lavrac, N. Relational Subgroup Discovery for Descriptive Analysis of Microarray Data, in Computational Life Sciences II, Vol 4216 (eds Berthold R. et al. ), 86–96 (Springer Berlin Heidelberg, 2006).
Google Scholar
Ji, Y., Shen, F. & Tran, J. A Multi-relational Association Mining Algorithm for Screening Suspected Adverse Drug Reactions, in Proc. 11th Int. Conf. Inf. Technol. New Gener, Las Vegas, NV., 407–412 (IEEE Press, April 2014).
Nguyen, T. P. & Ho, T. B. An Integrative Domain-based Approach to Predicting Protein-protein Interactions. J. Bioinform. Comput. Biol. 6, 1115–1132 (2008).
CAS PubMed Google Scholar
Nichol, M. B. et al. Fast Algorithms for Mining Association Rules. Ann. Pharmacother. 42, 62–70 (2008).
PubMed Google Scholar
Han, J., Pei, J. & Yin, Y. Frequent Pattern Tree: Design and Construction. in Proc. Conf. on the Management of Data (SIGMOD’00, Dallas, TX) 1–12 (ACM Press, 2000).
Hamosh, A., Scott, A. F., Amberger, J. S., Bocchini, C. a & McKusick, V. a. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 33, D514–7 (2005).
CAS PubMed Google Scholar
Eden, E., Navon, R., Steinfeld, I., Lipson, D. & Yakhini, Z. GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics 10, 48 (2009).
PubMed PubMed Central Google Scholar
Supek, F., Bošnjak, M., Škunca, N. & Šmuc, T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One 6, e21800 (2011).
ADS CAS PubMed PubMed Central Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. a. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 1–13 (2009).
Google Scholar
Microsoft Corporation, SQL Server 2012 Tutorials: Analysis Services - Multidimensional Modeling. SQL Server Books Online. (2012). Available at download.microsoft.com/. (Accessed October 2014).
Bekkar, M., Djemaa, H. K. & Alitouche, T. A. Evaluation Measures for Models Assessment over Imbalanced Data Sets. J. Inf. Eng. Appl. 3, 27–38 (2013).
Google Scholar
Silva, A. & Antunes, C. Finding Multi-dimensional Patterns in Healthcare, in Machine Learning and Data Mining in Pattern Recognition, Vol. 8556, (eds Perner. P. ) 361–375 (Springer International Publishing, 2014).
Google Scholar
Vassar, R. Beta-Secretase Cleavage of Alzheimer’s Amyloid Precursor Protein by the Transmembrane Aspartic Protease BACE. Science. 286, 735–741 (1999).
CAS PubMed Google Scholar
Sun, M.-K. & Alkon, D. L. Activation of protein kinase C isozymes for the treatment of dementias. Adv. Pharmacol. 64, 273–302 (2012).
CAS PubMed Google Scholar
Candeias, E. et al. The impairment of insulin signaling in Alzheimer’s disease. IUBMB Life 64, 951–7 (2012).
CAS PubMed Google Scholar
De la Monte, S. M. Triangulated mal-signaling in Alzheimer’s disease: roles of neurotoxic ceramides, ER stress and insulin resistance reviewed. J. Alzheimers. Dis. 30 Suppl 2, S231–49 (2012).
ADS PubMed Google Scholar
Caberlotto, L., Lauria, M., Nguyen, T.-P. & Scotti, M. The central role of AMP-kinase and energy homeostasis impairment in Alzheimer’s disease: a multifactor network analysis. PLoS One 8, e78919 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Steen, E. et al. Impaired insulin and insulin-like growth factor expression and signaling mechanisms in Alzheimer’s disease--is this type 3 diabetes? J. Alzheimers. Dis. 7, 63–80 (2005).
CAS PubMed Google Scholar
Sebastião, I. et al. Insulin as a Bridge between Type 2 Diabetes and Alzheimer Disease - How Anti-Diabetics Could be a Solution for Dementia. Front. Endocrinol. (Lausanne). 5, 110 (2014).
PubMed PubMed Central Google Scholar
Munoz, L. & Ammit, A. J. Targeting p38 MAPK pathway for the treatment of Alzheimer’s disease. Neuropharmacology 58, 561–8 (2010).
CAS PubMed Google Scholar
Christopherson, K. S., Hillier, B. J., Lim, W. A. & Bredt, D. S. PSD-95 assembles a ternary complex with the N-methyl-D-aspartic acid receptor and a bivalent neuronal NO synthase PDZ domain. J. Biol. Chem. 274, 27467–73 (1999).
CAS PubMed Google Scholar
Tezuka, T., Umemori, H., Akiyama, T., Nakanishi, S. & Yamamoto, T. PSD-95 promotes Fyn-mediated tyrosine phosphorylation of the N-methyl-D-aspartate receptor subunit NR2A. Proc. Natl. Acad. Sci. U. S. A. 96, 435–40 (1999).
ADS CAS PubMed PubMed Central Google Scholar
Migaud, M. et al. Enhanced long-term potentiation and impaired learning in mice with mutant postsynaptic density-95 protein. Nature 396, 433–9 (1998).
ADS CAS PubMed Google Scholar
Rampon, C. et al. Effects of environmental enrichment on gene expression in the brain. Proc. Natl. Acad. Sci. U. S. A. 97, 12880–4 (2000).
ADS CAS PubMed PubMed Central Google Scholar
Vitale, M. et al. Proteomic characterization of a mouse model of familial Danish dementia. J. Biomed. Biotechnol. 2012, 1–8 (2012).
Google Scholar
Bach, A. et al. A high-affinity, dimeric inhibitor of PSD-95 bivalently interacts with PDZ1-2 and protects against ischemic brain damage. Proc. Natl. Acad. Sci. U. S. A. 109, 3317–22 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Hopkins, A. L. & Groom, C. R. The druggable genome. Nat. Rev. Drug Discov. 1, 727–30 (2002).
CAS PubMed Google Scholar
Blaukat, A. Structure and signalling pathways of kinin receptors. Andrologia 35, 17–23 (2003).
CAS PubMed Google Scholar
Prediger, R. D. S. et al. Genetic deletion or antagonism of kinin B(1) and B(2) receptors improves cognitive deficits in a mouse model of Alzheimer’s disease. Neuroscience 151, 631–43 (2008).
CAS PubMed Google Scholar
Kanehisa, M. et al. Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res 42, D199–D205 (2014).
CAS PubMed Google Scholar

Download references

Acknowledgements

We are grateful to Bianca Baldacci for helping us to improve the quality of the figures.

Author information

Authors and Affiliations

The Microsoft Research, University of Trento Centre for Computational Systems Biology (COSBI), Piazza Manifattura 1, Rovereto, 38068, Italy
Thanh-Phuong Nguyen, Corrado Priami & Laura Caberlotto
Life Sciences Research Unit, University of Luxembourg, 162 A, avenue de la Faïencerie, L-1511, Luxembourg
Thanh-Phuong Nguyen
Department of Mathematics, University of Trento, Via Sommarive, 14-38123, Povo, Italy
Corrado Priami

Authors

Thanh-Phuong Nguyen
View author publications
Search author on:PubMed Google Scholar
Corrado Priami
View author publications
Search author on:PubMed Google Scholar
Laura Caberlotto
View author publications
Search author on:PubMed Google Scholar

Contributions

T.P.N., L.C., C.P. conceived and designed the study. T.P.N. and L.C. collected and analyzed the data. T.P.N., L.C., C.P. wrote the paper. All authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Tables

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Nguyen, TP., Priami, C. & Caberlotto, L. Novel drug target identification for the treatment of dementia using multi-relational association mining. Sci Rep 5, 11104 (2015). https://doi.org/10.1038/srep11104

Download citation

Received: 07 January 2015
Accepted: 13 May 2015
Published: 08 July 2015
DOI: https://doi.org/10.1038/srep11104

This article is cited by

Investigation of naphthofuran moiety as potential dual inhibitor against BACE-1 and GSK-3β: molecular dynamics simulations, binding energy, and network analysis to identify first-in-class dual inhibitors against Alzheimer’s disease
- Akhil Kumar
- Gaurava Srivastava
- Ashok Sharma
Journal of Molecular Modeling (2017)