Predictive model of castration resistance in advanced prostate cancer by machine learning using genetic and clinical data: KYUCOG-1401-A study

Shiota, Masaki; Nemoto, Shota; Ikegami, Ryo; Tatarano, Shuichi; Kamoto, Toshiyuki; Kobayashi, Keita; Sakai, Hideki; Igawa, Tsukasa; Kamba, Tomomi; Fujimoto, Naohiro; Yokomizo, Akira; Naito, Seiji; Eto, Masatoshi

doi:10.1038/s44276-024-00093-3

Download PDF

Article
Open access
Published: 09 September 2024

Predictive model of castration resistance in advanced prostate cancer by machine learning using genetic and clinical data: KYUCOG-1401-A study

Masaki Shiota¹,
Shota Nemoto²,
Ryo Ikegami²,
Shuichi Tatarano³,
Toshiyuki Kamoto⁴,
Keita Kobayashi⁵,
Hideki Sakai⁶,
Tsukasa Igawa⁷,
Tomomi Kamba⁸,
Naohiro Fujimoto⁹,
Akira Yokomizo¹⁰,
Seiji Naito¹⁰ &
…
Masatoshi Eto¹

BJC Reports volume 2, Article number: 69 (2024) Cite this article

2448 Accesses
3 Citations
Metrics details

Abstract

Background

The predictive power of the treatment efficacy and prognosis in primary androgen deprivation therapy (ADT) for advanced prostate cancer is not satisfactory. The objective of this study was to integrate genetic and clinical data to predict castration resistance in primary ADT for advanced prostate cancer by machine learning (ML).

Methods

Clinical and single nucleotide polymorphisms (SNP) data obtained in the KYUCOG-1401-A study (UMIN000022852) that enrolled Japanese patients with advanced prostate cancer were used. All patients were treated with primary ADT. A point-wise linear (PWL) algorithm, logistic regression with elastic-net regularization, and eXtreme Gradient Boosting were the ML algorithms used in this study. Area under the curve for castration resistance and C-index for prognoses were calculated to evaluate the utility of the models.

Results

Among the three ML algorithms, the area under the curve values to predict castration resistance at 2 years was highest for the PWL algorithm with all the datasets. Three predictive models (clinical model, small SNPs model, and large SNPs model) were created by the PWL algorithm using the clinical data alone, and 2 and 46 SNPs in addition to clinical data. C-indices for overall survival by the clinical, small SNPs, and large SNPs models were 0.636, 0.621, and 0.703, respectively.

Conclusion

The results demonstrated that the SNPs models created by ML produced excellent prediction of castration resistance and prognosis in primary ADT for advanced prostate cancer, and will be helpful in treatment choice.

Integrative single-cell and spatial transcriptomics with explainable AI reveal lethal prognostic axis in prostate cancer

Article Open access 06 January 2026

Development of machine learning prognostic models for overall survival of prostate cancer patients with lymph node-positive

Article Open access 27 October 2023

Machine-learning predicts time-series prognosis factors in metastatic prostate cancer patients treated with androgen deprivation therapy

Article Open access 18 April 2023

Background

Androgen deprivation therapy (ADT) is widely used as the backbone therapy for advanced prostate cancer [1]. Current intensive therapies for metastatic prostate cancer include radiation, docetaxel, and novel androgen receptor signaling inhibitors, such as abiraterone, darolutamide, enzalutamide, and apalutamide, in addition to ADT [2]. Furthermore, triplet combination therapy, which adds such novel androgen receptor signaling inhibitors to ADT plus docetaxel, has recently been shown to prolong survival in metastatic prostate cancer [3, 4]. Therefore, prognostic estimation in ADT can help in choosing the best treatment. However, prognostic estimation in ADT using clinical parameters such as prostate-specific antigen (PSA), Gleason score, and TNM category is not satisfactory for producing C-indices of >0.7 [5,6,7]. Therefore, novel predictive models to more precisely estimate the response to ADT for advanced prostate cancer are needed.

Genetic background has been suggested to affect the efficacy and prognosis in ADT for prostate cancer, as indicated by different outcome among different ethnicities and consistent outcome within families [8,9,10]. Over the past few decades, genome wide association studies (GWAS) have discovered associations between single nucleotide polymorphisms (SNPs) and various features [11]. In a previous study, we investigated the association between the SNPs and prognosis in Japanese patients undergoing primary ADT for advanced prostate cancer by GWAS [12]. In that study, two SNPs, rs76237622 in PRR27 and rs117573572 in MTAP, were validated to be associated with prognosis in ADT, but their predictive ability was not satisfactory [12].

Machine learning (ML) is a statistics-free approach that uses algorithms to identify patterns in rich and unwieldy data [13]. ML can resolve complex datasets of high dimensionality such as genomic data [14, 15]. In this study, we aimed to integrate genetic and clinical data that were obtained in the previous study [12], to predict castration resistance in primary ADT for advanced prostate cancer by ML.

Methods

Study population

Japanese patients with de novo advanced prostate cancer (TanyN1M0 or TanyNanyM1) enrolled in the KYUCOG-1401-A study (UMIN000022852) that was conducted in conjunction with a prospective multi-institutional clinical trial (KYUCOG-1401; UMIN000014243, jRCTs071180035) were included in this study. Inclusion and exclusion criteria for the KYUCOG-1401-A study have been described previously [12]. Patients (n = 8) censored before 2 years were excluded from this study. In the KYUCOG-1401 study, patients were randomized to receive gonadotropin-releasing hormone (GnRH) antagonist (degarelix) or agonist (leuprorelin or goserelin) plus the antiandrogen bicalutamide [16]. This study was conducted in accordance with the Declaration of Helsinki and the Japanese Ethical Guidelines for Medical and Health Research Involving Human Subjects. Eligible patients provided written informed consent. This study was approved by the Kyushu University review board (23087-00).

Clinical data

Clinicopathological information and efficacy in treatment data were collected prospectively using an electronic data capture system, as described previously [12]. Progression was defined as PSA progression (defined as PSA level of 2.0 ng/mL or higher, a rise of 50% or more from the lowest value, and three consecutive increases in PSA measured at least one week apart) or radiographic progression, as described previously [12, 17, 18]. For the analysis of progression-free survival (PFS), cancer-specific survival (CSS), and overall survival (OS), progression or death from any cause, death from prostate cancer, and death from any cause were defined as events, respectively. Patients who did not experience any of these events were censored at the last follow-up visit. For the survival analysis, the number of days from enrollment to the earliest event or censoring date was calculated. Patients who progressed to castration resistance at 2 years were defined as non-responders and patients who did not were defined as responders. Risk stratification by J-CAPRA risk score was performed as described previously [19].

Genetic data

Genetic data were obtained as described previously [12]. Genomic DNA was genotyped using a Japonica Array v2 according to the manufacturer’s instructions (Thermo Fisher Scientific, Waltham, MA, USA) [20,21,22]. This Axiom Array was customized for the Japanese genome by the Tohoku Medical Megabank Organization. Genotype calling was conducted using Genotyping Console software v4.2 (Thermo Fisher Scientific). We used the PSA-PFS at 2years-associated 2 and 46 SNPs with p < 1.0 × 10⁻⁵ and p < 1.0 × 10⁻⁴ that were identified in a previous study [12].

Construction of simple prediction scores

The variables were classified as binary (1 or −1) or quantitative. Quantitative variables were normalized by subtracting the mean value and dividing by the standard deviation. Missing values were set to 0.

The prediction models were constructed using three ML algorithms (point-wise linear algorithm, logistic regression with elastic-net regularization algorithm, and eXtreme Gradient Boosting) and three datasets (clinical, clinical and 2 SNPs, and clinical and 46 SNPs). Details of the clinical dataset are provided in Supplementary Table 1. The point-wise linear (PWL) algorithm is a deep learning-based algorithm that was implemented using PyTorch 1.5.1 and Python 3.7.4 [23]. The PWL algorithm uses a deep (multi-layered) neural network structure that generates a logistic regression model for each sample; i.e., a weight vector tailored to each sample. The importance of each feature is computed using its weight vector. Deep unified networks were used to construct the deep neural network in which the network layers and neurons are connected in a mesh-like structure that reduces the risk of overfitting [24]. The logistic regression with elastic-net regularization (LR) algorithm and eXtreme Gradient Boosting (XGBoost) were used to build the baseline models (implemented using scikit-learn v0.24.2, xgboost 1.0.2, and Python 3.7.4) [25]. The best hyper-parameter of each model was determined by 5-fold cross validation using the discovery cohort [23]. The prediction performance of each model was calculated by area under the curve (AUC) and evaluated using the validation cohort and the models fitted by the best hyper-parameter.

The important features to predict castration resistance were determined based on an importance score that was calculated using the weight vector of the PWL algorithm. The sample-wise importance score was calculated as described previously [26]. Importance score was defined by the rate at which a sample was ranked in the top 10% of features with sample-wise importance scores. Parameters with importance scores ≥0.1 were extracted as important features. An importance score of 0.1 indicated that at least 10% of the samples had parameters that were in the top 10% of important features. Simple prediction scores were constructed using important features and the sign of the median of sample-wise weights in those features. We used the original values for the variables in the simple prediction scores. The prediction performance of the simple prediction scores was evaluated by AUC using discovery and validation cohorts.

Estimation of effect by genetic background among different ethnic populations

Allele frequency data were obtained from the 1000 Genomes Project (https://www.internationalgenome.org/home). Estimated effect was the sum of the value for each SNP calculated as: coefficient × 2 × (minor allele frequency) × (1 − minor allele frequency) + 2 × coefficient × (minor allele frequency)².

Statistical analyses

Statistical analyses were performed using JMP16 software (SAS Institute, Cary, NC, USA). Continuous and categorical data are presented as median with interquartile range and number with percentage, respectively. The association among the categorical data was analyzed by the chi-square test. Survival analysis was performed using the Kaplan–Meier method and log-rank test. Harrell’s C-index was calculated using Stata v18 (College Station, TX, USA) as described previously [7]. All P-values were two-sided, and P-values < 0.05 were considered significant for all the analyses.

Results

Patients assignment

A total of 119 patients were included in the study, and were divided randomly in a 7:3 ratio into discovery (n = 82) and validation (n = 37) cohorts. Clinical parameters of the patients in each cohort are provided in Supplementary Table 1. Several clinical parameters including Gleason score, extent of disease grade, PSA level, and hemoglobin value were different between non-responders and responders in the discovery cohort (Supplementary Table 1). In addition, history of cerebral infarction, total type I procollagen-N-propeptide (P1NP), white blood cell count, and neutrophil count were higher in non-responders compared with their levels in responders in the discovery cohort (Supplementary Table 1).

Predictive ability of castration resistance by three ML algorithms using genetic and clinical data

The ability of three ML algorithms (PWL, LR, XGBoost) to predict castration resistance using genetic and clinical data was evaluated. Using only the clinical data (Supplementary Table 1) to predict castration resistance at 2 years, the AUC, sensitivity, and specificity in the discovery cohort were 0.710–0.785, 0.568–0.704, and 0.644–0.689, respectively (Table 1). In the validation cohort, AUC, sensitivity, and specificity were 0.720–0.786, 0.688–0.875, and 0.579–0.684, respectively (Table 1). When the two SNPs associated with PSA-PFS at 2 years with p < 1.0 × 10⁻⁵ were used together with the clinical parameters, AUC, sensitivity, and specificity in the discovery cohort slightly improved to 0.796–0.810, 0.700–0.754, and 0.689–0.778, respectively (Table 1). In the validation cohort, AUC, sensitivity, and specificity also slightly improved to 0.701–0.878, 0.625–0.750, and 0.684–0.789, respectively (Table 1). Finally, when the 46 SNPs associated with PSA-PFS at 2 years with p < 1.0 × 10⁻⁴ were used together with the clinical parameters, AUC, sensitivity, and specificity in greatly improved in the discovery cohort to 0.962–0.988, 0.600–0.864, and 0.978–0.978, respectively (Table 1). In the validation cohort, AUC, sensitivity, and specificity also greatly improved to 0.984–1.000, 0.875–0.938, and 1.000–1.000, respectively (Table 1).

Table 1 Predictive performance by machine learning methods using the indicated parameters.

Full size table

Model creation to predict castration resistance using genetic and clinical data by ML

The PWL algorithm produced the highest AUC values in the three ML algorithms in the discovery and validation cohorts, with the exception of the clinical model in the discovery cohort (Table 1). Therefore, we created a prediction model for castration resistance using genetic and clinical data by the PWL algorithms.

When critical parameters associated with castration resistance at 2 years were used, 12 clinical parameters were identified (Table 2). Besides the known predictive factors, Gleason score, PSA level, N-category, albumin level, and total testosterone level, other factors including comorbidity with hypertension, total cholesterol level, lymphocyte ratio, blood urea nitrogen level, comorbidity with dyslipidemia, and aspartate aminotransferase level were also identified as critical parameters (Table 2). When the predictive model (clinical model) was created using the formula in Supplementary Table 2, the AUCs in the discovery and validation cohorts were 0.730 (95% CI, 0.610–0.849) and 0.585 (95% CI, 0.383–0.787), respectively (Fig. 1A). When prediction scores calculated by the clinical model were divided quarterly (Q1–Q4), the ranges were −1.42267 to 6.86566 in Q1, 6.901361–8.656931 in Q2, 8.7444–10.01411 in Q3, and 10.06535–13.23641 in Q4. The clinical model correctly predicted 21/27 (77.8%) in Q1 and 16/27 (59.3%) in Q4 to be non-responders and responders, respectively (p = 0.0049, Fig. 1B).

Table 2 Important features in clinical model.

Full size table

**Fig. 1: Predictive models of castration resistance at 2 years created by machine learning using clinical and genetic parameters.**

When the two SNPs (p < 1.0 × 10⁻⁵) were added to the clinical parameters, six clinical parameters and two SNPs were identified to be critical to predict castration resistance at 2 years (Table 3). Besides the known predictive factors, Gleason score and extent of disease grade, other factors including comorbidity with diabetes mellitus, creatine kinase level, total P1NP level, and lymphocyte ratio were also identified as critical parameters (Table 3). When the predictive model (small SNPs model) was created using the formula in Supplementary Table 2, the AUCs in the discovery and validation cohorts improved to 0.857 (95% CI, 0.756–0.959) and 0.852 (95% CI, 0.706–0.998), respectively (Fig. 1C). When scores calculated by small SNPs model were divided quarterly (Q1–Q4), the ranges were −1.73 to 2.71 in Q1, 2.71–4.76 in Q2, 5.01–7.43 in Q3, and 7.6–13.24 in Q4. The small SNPs model discriminated responders and non-responders; 27/28 (96.4%) in Q1 and 27/28 (96.4%) in Q4 were correctly predicted to be non-responders and responders, respectively (p < 0.0001, Fig. 1D).

Table 3 Important features in small SNPs model.

Full size table

When the 46 SNPs (p < 1.0 × 10⁻⁴) were added to the clinical parameters, 4 clinical parameters and 19 SNPs were identified to be critical to predict castration resistance at 2 years (Table 4). Besides the known predictive factors, M-category and Gleason score, other factors including total bilirubin level and glucose level were also identified as critical parameters (Table 4). When the predictive model (large SNPs model) was created using the formula in Supplementary Table 2, the AUCs in the discovery and validation cohorts were prominently improved to 0.920 (95% CI, 0.854–0.986) and 0.978 (95% CI, 0.932–1.000), respectively (Fig. 1E). When scores calculated by the large SNPs model were divided quarterly (Q1–Q4), the ranges were 0.8439–5.716 in Q1, 5.834–7.5162 in Q2, 7.7932–9.5307 in Q3, 9.6555–14.3819 in Q4. The large SNPs model discriminated responders and non-responders; 19/21 (90.5%) in Q1 and 19/20 (95.0%) in Q4 were correctly predicted to be non-responder and responder, respectively (p < 0.0001, Fig. 1F).

Table 4 Important features in large SNPs model.

Full size table

Prognosis stratification by predictive models created by ML in advanced prostate cancer

We applied the three predictive models for prognosis stratification. PFS was significantly stratified by Q1–Q4 groups in all three models (Fig. 2A). The PFS was more prominently stratified in the small SNPs and large SNPs models than it was in the clinical model (Fig. 2A). The C-indices for PFS in the clinical, small SNPs, and large SNPs models were 0.617 (95% CI, 0.556–0.678), 0.727 (95% CI, 0.681–0.774), and 0.730 (95% CI 0.667–0.793), respectively (Supplementary Table 3). The CSS was also significantly stratified by the Q1–Q4 groups in all three models, but was stratified more prominently in the large SNPs model than it was in the clinical and small SNPs models (Fig. 2B). The C-indices for CSS in the clinical, small SNPs, and large SNPs models were 0.678 (95% CI, 0.546–0.809), 0.670 (95% CI, 0.551–0.790), and 0.781 (95% CI 0.671–0.890), respectively (Supplementary Table 3). The OS was significantly stratified by the Q1–Q4 groups only in the large SNPs model (Fig. 2C). The C-indices for OS in the clinical, small SNPs, and large SNPs models were 0.636 (95% CI, 0.520–0.753), 0.621 (95% CI, 0.512–0.731), and 0.703 (95% CI 0.583–0.822), respectively (Supplementary Table 3).

**Fig. 2: Prognosis stratification by predictive models of castration resistance at 2 years created by machine learning using clinical and genetic parameters.**

The preexisting well-known risk model, J-CAPRA risk group using TNM category, Gleason score, and PSA level stratified PFS, but not CSS and OS (Supplementary Fig. 1). The C-indices for PFS, CSS, and OS by J-CAPRA risk group were 0.588 (95% CI, 0.536–0.639), 0.602 (95% CI, 0.512–0.692), and 0.528 (95% CI 0.429–0.627), respectively (Supplementary Table 3).

Allele frequency by ethnicity and estimated effect of important SNPs in the large SNPs model on the response to ADT

The allele frequency of SNPs is known to differ among ethnic populations, which may affect the impact of genotype on outcomes. We investigated the allele frequencies of important SNPs in the large SNPs model. Minor allele frequency data for 16 SNPs were available in the 1000 Genomes Project, and differed among different ethnic groups as shown in Table 5. We estimated the effect of critical SNPs in the large SNPs model on the response to ADT. The estimated effect of the 16 SNPs was 0.94 in East Asians and −1.20 in Europeans, where a high value indicates higher probability of responder (Table 5).

Table 5 Minor allele frequency by ethnics and estimated effect on the response to androgen deprivation therapy.

Full size table

Discussion

The results also showed higher predictive ability when SNPs were used in addition to clinical parameters. Several risk models using clinical parameters have been developed to predict the response and prognosis of ADT. However, their predictive power is modest, as indicated by AUCs of <0.7 [5,6,7]. We also found that the clinical model had limited predictive power even when created by ML, although the C-index was modestly higher than that of previous risk models. The predictive power of the models created by ML was improved by adding small and large numbers of SNPs to the clinical parameters. In particular, the large SNPs model achieved C-indices >0.70 for PFS, CSS, and OS. Considering various previous predictive models failed to achieve C-indices >0.70, achieving higher prediction power of castration resistance at 2 years and prognosis by measuring 19 SNPs in addition to four clinical parameters would be valuable. Currently, intensive treatments have emerged as novel standard treatments for advanced prostate cancer [2]. Therefore, the large SNPs model will be helpful in choosing the best treatment for individual patients. Intensive treatment may be preferable if patient is a non-responder while de-escalated treatment may be preferable if patient is predicted to be a responder.

In addition, genetic parameters in the large SNPs model supported the ethnic differences of response to ADT. Several studies reported that Asians have a higher survival rate after primary ADT compared with that of Caucasians and African Americans [8, 9, 27]. Consistently, the score from allele frequencies of 16 SNPs in the large SNPs model indicated a higher possibility of responders in East Asians compared with that in people with European ancestry.

We identified various clinical and genetic parameters that were associated with response to ADT. Among 19 SNPs in the large SNPs model, rs1931229 was associated with the expression of TSPYL1, which is known to be a CYP17A1 and CYP3A4 regulator, by expression quantitative trait loci (eQTL) analysis, whereas rs941207 and rs2035081 were associated with HSD17B6 expression (data not shown). Because TSPYL1 and HSD17B6 are both involved in androgen synthesis, they are associated with the response to ADT through their role in this pathway [28, 29]. Preexistence of hypertension was associated with favorable response to ADT, whereas preexistence of diabetes mellitus and high glucose level were associated with unfavorable response to ADT. This is consistent with our previous finding that comorbidity with hypertension and diabetes mellitus were associated with longer and shorter survival in primary ADT, respectively [30, 31]. Higher total cholesterol level was also associated with better response to ADT, whereas preexistence of dyslipidemia was associated with poor response to ADT, implying a close relationship between ADT and lipid metabolism.

This study had several limitations. The sample size was relatively small. Although the SNPs models had excellent predictive performance in the Japanese population, future work is needed to explore the generalizability of the predictive performance of these SNPs models in other populations. Primary ADT alone is no longer a standard therapy, and utilized in a combination with other treatments such as androgen receptor signaling inhibitors. Conversely, a strong point of this study is that the clinical data were obtained from patients enrolled in a prospective trial, in which the treatment and testing schedules were subject to strict protocols.

Conclusion

Our results demonstrate that the SNPs models using clinical and genetic parameters created by a PWL algorism produced excellent prediction of castration resistance and prognosis in primary ADT for advanced prostate cancer. These models are expected to be helpful in treatment choice for advanced prostate cancer.

Data availability

The data sets generated and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

Shiota M, Eto M. Current status of primary pharmacotherapy and future perspectives toward upfront therapy for metastatic hormone-sensitive prostate cancer. Int J Urol. 2016;23:360–9.
Article PubMed Google Scholar
Blas L, Shiota M, Eto M. Current status and future perspective on the management of metastatic castration-sensitive prostate cancer. Cancer Treat Res Commun. 2022;32:100606.
Article PubMed Google Scholar
Fizazi K, Foulon S, Carles J, Roubaud G, McDermott R, Fléchon A, et al. Abiraterone plus prednisone added to androgen deprivation therapy and docetaxel in de novo metastatic castration-sensitive prostate cancer (PEACE-1): a multicentre, open-label, randomised, phase 3 study with a 2 × 2 factorial design. Lancet. 2022;399:1695–707.
Article CAS PubMed Google Scholar
Smith MR, Hussain M, Saad F, Fizazi K, Sternberg CN, Crawford ED, et al. Darolutamide and survival in metastatic, hormone-sensitive prostate cancer. N Engl J Med. 2022;386:1132–42.
Article CAS PubMed PubMed Central Google Scholar
Akamatsu S, Kubota M, Uozumi R, Narita S, Takahashi M, Mitsuzuka K, et al. Development and validation of a novel prognostic model for predicting everall survival in treatment-naïve castration-sensitive metastatic prostate cancer. Eur Urol Oncol. 2019;2:320–8.
Article PubMed Google Scholar
Zelic R, Garmo H, Zugna D, Stattin P, Richiardi L, Akre O, et al. Predicting prostate cancer death with different pretreatment risk stratification tools: a head-to-head comparison in a nationwide cohort study. Eur Urol. 2020;77:180–8.
Article PubMed Google Scholar
Shiota M, Terada N, Kitamura H, Kojima T, Saito T, Yokomizo A, et al. Novel metastatic burden-stratified risk model in de novo metastatic hormone-sensitive prostate cancer. Cancer Sci. 2021;112:3616–26.
Article CAS PubMed PubMed Central Google Scholar
Fukagai T, Namiki TS, Carlile RG, Yoshida H, Namiki M. Comparison of the clinical outcome after hormonal therapy for prostate cancer between Japanese and Caucasian men. BJU Int. 2006;97:1190–3.
Article CAS PubMed Google Scholar
Cooperberg MR, Hinotsu S, Namiki M, Carroll PR, Akaza H. Trans-Pacific variation in outcomes for men treated with primary androgen-deprivation therapy (ADT) for prostate cancer. BJU Int. 2016;117:102–9.
Article CAS PubMed Google Scholar
Hemminki K, Ji J, Försti A, Sundquist J, Lenner P. Concordance of survival in family members with prostate cancer. J Clin Oncol. 2008;26:1705–9.
Article PubMed Google Scholar
Tam V, Patel N, Turcotte M, Bossé Y, Paré G, Meyre D. Benefits and limitations of genome-wide association studies. Nat Rev Genet. 2019;20:467–84.
Article CAS PubMed Google Scholar
Shiota M, Tatarano S, Kamoto T, Matsuyama H, Sakai H, Igawa T, et al. Genome-wide association studies in advanced prostate cancer: KYUCOG-1401-A study. Endocr Relat Cancer. 2023;30:e230044.
Article CAS PubMed Google Scholar
Bzdok D, Altman N, Krzywinski M. Statistics versus machine learning. Nat Methods. 2018;15:233–4.
Article CAS PubMed PubMed Central Google Scholar
Sigala RE, Lagou V, Shmeliov A, Atito S, Kouchaki S, Awais M, et al. Machine learning to advance human genome-wide association studies. Genes. 2023;15:34.
Article PubMed PubMed Central Google Scholar
Koido M. Polygenic modelling and machine learning approaches in pharmacogenomics: Importance in downstream analysis of genome-wide association study data. Br J Clin Pharmacol. 2023. https://doi.org/10.1111/bcp.15913.
Yokomizo A, Shiota M, Morokuma F, Eto M, Matsuyama H, Matsumoto H, et al. GnRH antagonist monotherapy versus a GnRH agonist plus bicalutamide for advanced hormone-sensitive prostate cancer; KYUCOG-1401. Int J Urol. 2024;31:362–9.
Article CAS PubMed Google Scholar
Heidenreich A, Bastian PJ, Bellmunt J, Bolla M, Joniau S, van der Kwast T, et al. EAU guidelines on prostate cancer. Part II: Treatment of advanced, relapsing, and castration-resistant prostate cancer. Eur Urol. 2014;65:467–79.
Article CAS PubMed Google Scholar
Scher HI, Halabi S, Tannock I, Morris M, Sternberg CN, Carducci MA, et al. Design and end points of clinical trials for patients with progressive prostate cancer and castrate levels of testosterone: recommendations of the Prostate Cancer Clinical Trials Working Group. J Clin Oncol. 2008;26:1148–59.
Article PubMed Google Scholar
Cooperberg MR, Hinotsu S, Namiki M, Ito K, Broering J, Carroll PR, et al. Risk assessment among prostate cancer patients receiving primary androgen deprivation therapy. J Clin Oncol. 2009;27:4306–13.
Article PubMed PubMed Central Google Scholar
Kawai Y, Mimori T, Kojima K, Nariai N, Danjoh I, Saito R, et al. Japonica array: improved genotype imputation by designing a population-specific SNP array with 1070 Japanese individuals. J Hum Genet. 2015;60:581–7.
Article CAS PubMed PubMed Central Google Scholar
Shiota M, Fujimoto N, Yamamoto Y, Takeuchi A, Tatsugami K, Uchiumi T, et al. Genome-wide association study of genetic variations associated with treatment failure after intravesical bacillus Calmette-Guérin therapy for non-muscle invasive bladder cancer. Cancer Immunol Immunother. 2020;69:1155–63.
Article CAS PubMed PubMed Central Google Scholar
Shiota M, Miyake H, Takahashi M, Oya M, Tsuchiya N, Masumori N, et al. Effect of genetic polymorphisms on outcomes following nivolumab for advanced renal cell carcinoma in the SNiP-RCC trial. Cancer Immunol Immunother. 2023;72:1903–15.
Article CAS PubMed PubMed Central Google Scholar
Kumagai S, Togashi Y, Kamada T, Sugiyama E, Nishinakamura H, Takeuchi Y, et al. The PD-1 expression balance between effector and regulatory T cells predicts the clinical efficacy of PD-1 blockade therapies. Nat Immunol. 2020;21:1346–58.
Article CAS PubMed Google Scholar
Golas SB, Shibahara T, Agboola S, Otaki H, Sato J, Nakae T, et al. A machine learning model to predict the risk of 30-day readmissions in patients with heart failure: a retrospective analysis of electronic medical records data. BMC Med Inform Decis Mak. 2018;18:44.
Article PubMed PubMed Central Google Scholar
Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. 2016. https://doi.org/10.48550/arXiv.1603.02754
Shibahara T, Wada C, Yamashita Y, Fujita K, Sato M, Kuwata J, et al. Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified. PLoS One. 2023;18:e0286072.
Article CAS PubMed PubMed Central Google Scholar
Fujimoto N, Shiota M, Tomisaki I, Minato A. Gene polymorphism-related individual and interracial differences in the outcomes of androgen deprivation therapy for prostate cancer. Clin Genitourin Cancer. 2017;15:337–42.
Article PubMed Google Scholar
Qin S, Liu D, Kohli M, Wang L, Vedell PT, Hillman DW, et al. TSPYL family regulates CYP17A1 and CYP3A4 expression: potential mechanism contributing to abiraterone response in metastatic castration-resistant prostate cancer. Clin Pharmacol Ther. 2018;104:201–10.
Article CAS PubMed Google Scholar
Ishizaki F, Nishiyama T, Kawasaki T, Miyashiro Y, Hara N, Takizawa I, et al. Androgen deprivation promotes intratumoral synthesis of dihydrotestosterone from androgen metabolites in prostate cancer. Sci Rep. 2013;3:1528.
Article PubMed PubMed Central Google Scholar
Shiota M, Fujimoto N, Imada K, Kashiwagi E, Takeuchi A, Inokuchi J, et al. Prognostic impact of genetic polymorphism in mineralocorticoid receptor and comorbidity with hypertension in androgen-deprivation therapy. Front Oncol. 2018;8:635.
Article PubMed PubMed Central Google Scholar
Hirata Y, Shiota M, Kobayashi T, Kashiwagi E, Takeuchi A, Inokuchi J, et al. Prognostic significance of diabetes mellitus and dyslipidemia in men receiving androgen-deprivation therapy for metastatic prostate cancer. Prostate Int. 2019;7:166–70.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Dr. Kaori Yasuda and Dr. Atsushi Doi (Cell Innovator, Fukuoka, Japan) for excellent guidance regarding the GWAS analysis, the Clinical Research Support Center Kyushu (CReS Kyushu, Fukuoka, Japan) for their secretarial and supportive assistance, and Ms. Noriko Hakoda and Ms. Eriko Gunshima for technical assistance (Department of Urology, Kyushu University, Fukuoka, Japan). We thank the patients who volunteered to participate in this trial, as well as the following KYUCOG-1401-A investigators and trial staff: Noriaki Tokuda in the Saga-ken Medical Centre Koseikan (Saga), Kentaro Kuroiwa in the Miyazaki Prefectural Miyazaki Hospital (Miyazaki), Kazuya Kawahara in the Kawahara Nephro-Urology Clinic (Aira), Motonobu Nakamura in the National Hospital Organization Kyushu Cancer Center (Fukuoka), Ken Goto in the Japanese Red Cross Fukuoka Hospital (Fukuoka), Sumitaka Mitsu in the Kagoshima Prefectural Oshima (Amami), Masafumi Nagano in the Fujimoto General Hospital (Miyakonojo), Naotaka Sakamoto in the National Hospital Organization Kyushu Medical Center (Fukuoka), Youji Doman in the Saiseikai Sendai Hospital (Satsumasendai), Hironobu Wakeda in the Chiyoda Hospital (Hyuga), Yasufumi Nabekura in the Kumamoto Urological Hospital (Kumamoto), Kohei Mizuma in the National Hospital Organization Ibusuki Medical Center (Ibusuki), and Hisato Inatomi in the Munakata Suikokai General Hospital (Fukutsu). We thank Margaret Biswas, PhD, from Edanz (https://www.jp.edanz.com/ac) for editing a draft of this manuscript.

Funding

This work was supported by Astellas Investigator Sponsored Research to MS, which had no role in the design of this study and will not have any role during its execution, analyses, interpretation of the data, or decision to submit results. KYUCOG-1401 (UMIN000014243, jRCTs071180035) was supported by Astellas Pharma.

Author information

Authors and Affiliations

Department of Urology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
Masaki Shiota & Masatoshi Eto
Industrial & Digital Business Unit, Hitachi, Ltd., Tokyo, Japan
Shota Nemoto & Ryo Ikegami
Department of Urology, Graduate School of Medical and Dental Sciences, Kagoshima University, Kagoshima, Japan
Shuichi Tatarano
Department of Urology, Faculty of Medicine, Miyazaki University, Miyazaki, Japan
Toshiyuki Kamoto
Department of Urology, Graduate School of Medicine, Yamaguchi University, Ube, Japan
Keita Kobayashi
Department of Urology, Graduate School of Biomedical Sciences, Nagasaki University, Nagasaki, Japan
Hideki Sakai
Department of Urology, School of Medicine, Kurume University, Kurume, Japan
Tsukasa Igawa
Department of Urology, Kumamoto University, Kumamoto, Japan
Tomomi Kamba
Department of Urology, School of Medicine, University of Occupational and Environmental Health, Kitakyushu, Japan
Naohiro Fujimoto
Department of Urology, Harasanshin Hospital, Fukuoka, Japan
Akira Yokomizo & Seiji Naito

Authors

Masaki Shiota
View author publications
Search author on:PubMed Google Scholar
Shota Nemoto
View author publications
Search author on:PubMed Google Scholar
Ryo Ikegami
View author publications
Search author on:PubMed Google Scholar
Shuichi Tatarano
View author publications
Search author on:PubMed Google Scholar
Toshiyuki Kamoto
View author publications
Search author on:PubMed Google Scholar
Keita Kobayashi
View author publications
Search author on:PubMed Google Scholar
Hideki Sakai
View author publications
Search author on:PubMed Google Scholar
Tsukasa Igawa
View author publications
Search author on:PubMed Google Scholar
Tomomi Kamba
View author publications
Search author on:PubMed Google Scholar
Naohiro Fujimoto
View author publications
Search author on:PubMed Google Scholar
Akira Yokomizo
View author publications
Search author on:PubMed Google Scholar
Seiji Naito
View author publications
Search author on:PubMed Google Scholar
Masatoshi Eto
View author publications
Search author on:PubMed Google Scholar

Contributions

M Shiota: Conceptualization, funding acquisition, resources, formal analysis, visualization, writing–original draft. S Nemoto: formal analysis, visualization, writing–original draft. R Ikegami: formal analysis, visualization, writing–original draft. S Tatarano: Resources, writing–review and editing. T Kamoto: Resources, writing–review and editing. K Kobayashi: Resources, writing–review and editing. H Sakai: Resources, writing–review and editing. T Igawa: Resources, writing–review and editing. T Kamba: Resources, writing–review and editing. N Fujimoto: Resources, writing–review and editing. A Yokomizo: Resources, writing–review and editing. S Naito: Resources, writing–review and editing. M Eto: Resources, supervision, writing–review and editing.

Corresponding author

Correspondence to Masaki Shiota.

Ethics declarations

Competing interests

MS received honoraria from Janssen Pharmaceutical, AstraZeneca, Astellas Pharma, Sanofi, and Bayer and research funding support from Daiichi Sankyo. SN and RI are employees of Hitachi Co. Ltd. Toshiyuki Kamoto received research funding support from Janssen Pharmaceutical, Astellas Pharma, Shin Nippon Biomedical Laboratories, Ono Pharmaceutical, Bayer, Sanofi, and Takeda Pharmaceutical. HS received honoraria from Takeda Pharmaceutical and Astellas Pharma. TI received honoraria from Janssen Pharmaceutical and Astellas Pharma. TK received honoraria from AstraZeneca and Merck. AY received honoraria from Janssen Pharmaceutical and Astellas Pharma. ME received honoraria from Takeda Pharmaceutical, Janssen Pharmaceutical, AstraZeneca, and Astellas Pharma and research funding support from Astellas Pharma, Sanofi, and Takeda Pharmaceutical. MS is an Associate Editor at BJC Reports. He was not involved in any aspect of handling of this manuscript or any editorial decisions.

Ethics approval and consent to participate

This study was conducted in accordance with the Declaration of Helsinki and the Japanese Ethical Guidelines for Medical and Health Research Involving Human Subjects. Eligible patients provided written informed consent. This study was approved by the Kyushu University review board (23087-00).

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figure 1

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shiota, M., Nemoto, S., Ikegami, R. et al. Predictive model of castration resistance in advanced prostate cancer by machine learning using genetic and clinical data: KYUCOG-1401-A study. BJC Rep 2, 69 (2024). https://doi.org/10.1038/s44276-024-00093-3

Download citation

Received: 26 June 2024
Revised: 13 August 2024
Accepted: 20 August 2024
Published: 09 September 2024
Version of record: 09 September 2024
DOI: https://doi.org/10.1038/s44276-024-00093-3

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

Integrative single-cell and spatial transcriptomics with explainable AI reveal lethal prognostic axis in prostate cancer

Development of machine learning prognostic models for overall survival of prostate cancer patients with lymph node-positive

Machine-learning predicts time-series prognosis factors in metastatic prostate cancer patients treated with androgen deprivation therapy

Background

Methods

Study population

Clinical data

Genetic data

Construction of simple prediction scores

Estimation of effect by genetic background among different ethnic populations

Statistical analyses

Results

Patients assignment

Predictive ability of castration resistance by three ML algorithms using genetic and clinical data

Model creation to predict castration resistance using genetic and clinical data by ML

Prognosis stratification by predictive models created by ML in advanced prostate cancer

Allele frequency by ethnicity and estimated effect of important SNPs in the large SNPs model on the response to ADT

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethics approval and consent to participate

Additional information

Supplementary information

Supplementary Figure 1

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links