Developing a nomogram model for predicting non-obstructive azoospermia using machine learning techniques

Xiao, Hong; Ding, Yi-lang; Wang, Chao; Yang, Peng; Chen, Qiang; He, Hao-nan; Yao, Ruijie; Huang, Hai-lin; Chen, Xi; Wang, Mao-yuan; Tang, Song-xi; Zhou, Hui-liang

doi:10.1038/s41598-025-88387-5

Download PDF

Article
Open access
Published: 14 February 2025

Developing a nomogram model for predicting non-obstructive azoospermia using machine learning techniques

Hong Xiao¹^na1,
Yi-lang Ding¹^na1,
Chao Wang¹,
Peng Yang¹,
Qiang Chen¹,
Hao-nan He¹,
Ruijie Yao¹,
Hai-lin Huang¹,
Xi Chen¹,
Mao-yuan Wang¹,
Song-xi Tang¹ &
…
Hui-liang Zhou¹

Scientific Reports volume 15, Article number: 5468 (2025) Cite this article

3641 Accesses
1 Citations
Metrics details

Subjects

Abstract

Azoospermia, defined by the absence of sperm in the ejaculate, manifests as obstructive azoospermia (OA) or non-obstructive azoospermia (NOA). Reliable predictive models utilizing biomarkers could aid in clinical decision-making. This study included 352 azoospermia patients, with 152 diagnosed with OA and 200 with NOA. The data were randomly divided into a training set (244 cases) and a validation set (108 cases) for machine learning analysis. The training set was utilized for univariate and multivariate logistic regression to identify key predictors of NOA. Following this, nine machine learning. This study included 352 azoospermia patients, with 152 diagnosed with OA and 200 with NOA. The data were randomly divided into a training set (244 cases) and a validation set (108 cases) for machine learning analysis. The training set was utilized for univariate and multivariate logistic regression to identify key predictors of NOA. Following this, nine machine learning methods were employed to refine the prediction model. A novel nomogram model was developed, and its predictive performance was evaluated using receiver operating characteristic curves, calibration plots, and decision curve analysis. Univariate and multivariate logistic regression analyses identified semen pH and follicle-stimulating hormone (FSH) as positive predictors of NOA, while mean testicular volume (MTV) and inhibin B (INHB) were negatively correlated with NOA. Among nine machine learning methods evaluated, the Gradient Boosting Decision Trees achieved the highest performance with an area under the curve (AUC) of 0.974, whereas Random Forest showed the lowest AUC at 0.953. The nomogram model, incorporating these four factors, demonstrated robust predictive performance with AUCs of 0.984 in the training set and 0.976 in the validation set. Calibration and decision curve analysis confirmed the model’s accuracy and clinical utility. Optimal cut-off points for biomarkers were identified: FSH at 7.50 IU/L (AUC = 0.96), INHB at 43.45 pg/ml (AUC = 0.95), MTV at 9.92 ml (AUC = 0.91), and semen pH at 6.95 (AUC = 0.71). The novel nomogram model incorporating FSH, INHB, MTV, and pH effectively predicts NOA in patients. This model offers a valuable tool for personalized diagnosis and management of azoospermia.

A new model for determining risk of male infertility from serum hormone levels, without semen analysis

Article Open access 31 July 2024

Development and validation of a nomogram for failure to collect oocytes in POSEIDON Groups 3 and 4 undergoing IVF/ICSI treatment

Article Open access 28 December 2024

Development of a nomogram to predict risk factors for orchiectomy after testicular torsion in children

Article Open access 30 April 2025

Introduction

Azoospermia, the absence of sperm in ejaculate, affects approximately 1% of men during their peak reproductive years. It represents one of the most severe forms of male infertility^1,2. Approximately 40% of azoospermia cases result from normal testicular spermatogenic function hindered by reproductive tract blockages³. The remaining cases involve insufficient or absent sperm production within the testes, categorizing the condition into obstructive (OA) and non-obstructive azoospermia (NOA). Distinguishing between OA and NOA is crucial for optimizing treatment strategies, as the two conditions require significantly different approaches. However, current diagnostic practices, particularly those in resource-limited settings, present several challenges.

In clinical practice, distinguishing between obstructive azoospermia (OA) and non-obstructive azoospermia (NOA) typically requires invasive procedures, such as testicular biopsies or sperm retrieval techniques, which may not be accessible in primary andrology or resource-limited settings. Although non-invasive methods, including hormonal assessments and ultrasound imaging, are commonly used, they often lack sufficient sensitivity and specificity to provide a definitive diagnosis. This underscores the urgent need for more accessible and reliable diagnostic tools, particularly in resource-constrained environments, to ensure timely and accurate diagnoses for all patients, regardless of geographic or economic barriers, and to guide optimal treatment decisions.

Treatment options for OA and NOA differ significantly. Microsurgical vasoepididymostomy (MVE) has emerged as a highly effective surgical intervention, leading to successful natural pregnancy in approximately 38% of treated patients⁴. For those who are unsuccessful in natural conception attempts, have limited ovarian reserve in their partner, opt against surgery, as well as OA patients with obstructions at other sites, conception may still be possible through intracytoplasmic sperm injection using percutaneous epididymal sperm aspiration (PESA) or percutaneous testicular sperm aspiration (TESA). In cases of NOA, microsurgical testicular sperm extraction (mTESE) or testicular sperm extraction (TESE) is an effective surgical approach to retrieve viable sperm^1,5,6. The testicular histopathology is considered a crucial predictive factor for the success of sperm retrieval^7,8.

However, testicular histopathology may only reflect the status of most seminiferous tubules in patients, as those with sertoli cell only syndrome (SCOS) can also obtain sperm through mTESE, thereby potentially predicting fertilization success prior to sperm retrieval surgery⁹. For NOA patients, the European Association of Urology guidelines recommend avoiding TESA or fine needle aspiration before mTESE or TESE, while the American Urological Association/American Society for Reproductive Medicine (AUA/ASRM) guidelines discourage pre-sperm retrieval diagnostic testicular biopsy^1,10. In contrast, percutaneous testicular or epididymal sperm aspiration proves effective for OA patients, although outcomes are not guaranteed until sperm are observed. Therefore, avoiding diagnostic testicular biopsy in NOA patients is essential, as the procedure is often uninformative. It highlights the importance of identifying potential NOA cases among azoospermic patients. According to AUA/ASRM guidelines, azoospermia patients with a testicular longitudinal axis < 4.6 cm, FSH > 7.6 IU/l, and/or semen volume > 0.5/1.0 ml are more likely to be diagnosed with NOA¹⁰. Nonetheless, different studies employ various biomarkers and cutoff values^11,12.

In recent years, machine learning has shown promise in clinical settings for improving diagnostic accuracy and predicting patient outcomes. However, predictive models for NOA are notably lacking, especially in primary healthcare institutions with limited diagnostic resources. This study seeks to apply machine learning techniques to basic male parameters for accurate identification of NOA among azoospermia patients. After developing a nomogram, we will perform comprehensive validation to assess its effectiveness. We expect that this approach will improve personalized diagnosis and treatment for patients with azoospermia.

Materials and methods

Study population and observational index

A retrospective study was conducted on 352 patients diagnosed with azoospermia, collected by the Department of Andrology and Sexual Medicine of the First Affiliated Hospital of Fujian Medical University from January 2020 to February 2024. This study received approval from the Medical Ethics Committee of the First Affiliated Hospital of Fujian Medical University (MRCTA, ECFAH of FMU [2019] 213 and [2020] 375) and was conducted in compliance with the principles outlined in the Declaration of Helsinki. Informed written consent was obtained from all participants.

Azoospermia was confirmed in all patients after more than three semen centrifugation procedures (3000 g, 15 min; Centrifuge 5425, Eppendorf, Hamburg, Germany) spaced over two week intervals, yielding no detectable sperm¹³. The exclusion criteria included patients with hypogonadotropic hypogonadism, individuals under 14 years old, and those with incomplete clinical data. Detailed medical histories were obtained, including inquiries about orchitis, epididymitis, mumps, prior testicular trauma and surgery, and cryptorchidism status. Similarly, potential iatrogenic causes, such as the use of gonadotoxic medications, prior radiation exposure, and environmental exposure, were all taken into consideration. Patients with cryptorchidism had undergone orchiopexy at least six months prior to the study.

Testicular volume (TV) was determined using using a Prader’s orchidometer by two experienced andrologists (Song-xi Tang and Hui-liang Zhou). Subsequently, the volumes of the left and right testes were summed and averaged to calculate the mean testicular volume (MTV) for each patient. Ejaculate volume and semen pH were averaged from multiple assessments. The clinical assessments included measuring serum levels of prolactin, follicle-stimulating hormone (FSH), luteinizing hormone (LH), estradiol, testosterone (T), and inhibin B (INHB) between 8:00 a.m. and 10:00 a.m. Varicocele was diagnosed through physical examination conducted by two seasoned andrologists (Hui-liang Zhou, and Song-xi Tang) in conjunction with color Doppler ultrasound.

Patients initially diagnosed with NOA must undergo karyotype analysis of peripheral blood and Y chromosome microdeletion analyses; whole-exome sequencing is deemed unnecessary. Patients being assessed for OA underwent evaluations utilizing color Doppler ultrasound (GE LOGIQ Fortis, GE Healthcare, Chicago, USA) or magnetic resonance imaging (Magnetom Prisma, Siemens, Munich, Germany) along with physical examinations to determine the site of obstruction.

Histopathological analysis of the testis.

Histopathological examination of the testis involved embedding the testicular tissues in paraffin and serially sectioning them at a thickness of 4 μm for routine hematoxylin and eosin staining. The sections were observed under light microscopy (CKX31, Olympus, Tokyo, Japan). The condition characterized by the presence of only Sertoli cells was classified as SCOS. Maturation arrest (MA) was diagnosed when spermatogonia, primary spermatocytes, secondary spermatocytes, or spermatids were present without mature sperm. Hypospermatogenesis referred to testicular pathology where a small amount of mature sperm was observable. Diagnosis was based on the predominant histopathological pattern observed. Patients with testicular pathological conditions such as SCOS, MA, or hypospermatogenesis were classified as NOA, whereas specimens exhibiting normal or near-normal spermatogenesis were classified as OA¹⁴.

Statistical analysis

Statistical analysis was performed using SPSS software version 27.0 (IBM Corp., Armonk, NY, USA). Continuous variables were expressed as means ± standard deviation or medians with interquartile ranges, while categorical variables were presented as frequencies or percentages. The Shapiro-Wilk test was used to assess data distribution, and Levene’s test evaluated variance homogeneity. Non-normally distributed continuous variables were compared using the U-test. The application of Pearson’s chi-square and Fisher’s exact probability tests were utilized to compare rates between groups. Univariate and multivariate logistic regression analyses were conducted to identify independent risk factors. From the entire dataset, 70% was randomly selected for training the models, while the remaining 30% was allocated for testing.

The machine learning algorithms and nomogram was constructed within R version 4.2.3 (R Foundation for Statistical Computing, Vienna, Austria). We utilized nine distinct machine learning algorithms for modeling: Random Forest, Gradient Boosting Decision Trees (GBDT), XGBoost, LightGBM, Naive Bayes, Support Vector Machine (SVM), Logistic Regression, Decision Trees, and Neural Networks. The Random Forest model ensures reproducibility by optimizing the “mtry” hyperparameter using “tuneRF” and applying regularization through parameters such as “ntree = 500,” “nodesize = 5,” and “maxnodes = 30” to mitigate overfitting and enhance stability. The GBDT model optimizes hyperparameters via 5-fold cross-validation, tuning key parameters including “n.trees = 100,” “interaction.depth = 3,” “shrinkage = 0.05,” and “n.minobsinnode = 10” to balance complexity and reduce overfitting. Similarly, the XGBoost model employs 5-fold cross-validation with early stopping to mitigate overfitting. Parameters such as “max_depth = 6,” “eta = 0.1,” “subsample = 0.8,” “colsample_bytree = 0.8,” “min_child_weight = 1,” “lambda = 1,” and “alpha = 0” are tuned to manage complexity, with the optimal number of boosting rounds determined to ensure reproducibility. The LightGBM model follows a similar approach, using 5-fold cross-validation and early stopping (limited to 10 rounds) to prevent overfitting. Parameters such as “num_leaves = 31,” “learning_rate = 0.05,” “feature_fraction = 0.8,” “bagging_fraction = 0.8,” “bagging_freq = 5,” “lambda_l1 = 0.1,” “lambda_l2 = 0.1,” and “min_data_in_leaf = 20” are selected to balance model complexity and reproducibility by determining the optimal number of boosting rounds. The Naive Bayes model is trained using 5-fold cross-validation, with hyperparameter tuning on the Laplace smoothing parameter (“laplace = 0, 0.5, 1”), kernel usage, and the adjustment factor (“adjust = 1”), ensuring reproducibility with consistent tuning parameters and cross-validation settings. The SVM model uses a radial basis function kernel, tuning hyperparameters such as “C” (cost) and “sigma” (gamma) through 5-fold cross-validation and grid search (“C = 0.1, 1, 10” and “sigma = 0.01, 0.05, 0.1”) to prevent overfitting and ensure reproducibility. The Logistic Regression classifier employs Lasso regularization (“alpha = 1”) and is trained via 5-fold cross-validation (“nfolds = 5”) to select the optimal regularization parameter (“lambda.min”), promoting model generalization and preventing overfitting through automatic hyperparameter tuning. Reproducible results are achieved based on a fixed dataset and consistent parameter settings. The Decision Tree model is tuned through 10-fold cross-validation, adjusting parameters such as “cp = 0.01,” “minsplit = 20,” and “maxdepth = 5” to prevent overfitting and ensure reproducibility by controlling complexity and optimizing generalization. Finally, the Neural Network model uses 10-fold cross-validation, tuning hyperparameters such as the number of hidden units (“size = 5, 10, 15”) and regularization strength (“decay = 0, 0.001, 0.01”) to prevent overfitting with L2 regularization, ensuring reproducibility through consistent parameter settings and cross-validation.

The predictive performance was assessed using the area under the receiver operating characteristic (ROC) curve. Graphs were created using GraphPad Prism 10.0 (GraphPad Software Inc., San Diego, CA, USA). The calibration curve and decision curve analysis (DCA) were calculated using R software. Statistical significance was set at P < 0.05 for all two-tailed tests.

Results

Basic information

A total of 352 patients with azoospermia were included in the study, comprising 152 (43.47%) with OA and 200 (56.82%) with NOA. Among the 152 OA patients, 48 (31.58%) had concomitant varicocele, including 34 with unilateral varicocele, 14 with bilateral varicocele, and 6 patients with severe varicocele on at least one side. Severe varicocele was defined through clinical physical examination as palpable varicocele, with ultrasound (GE LOGIQ Fortis, GE Healthcare, Chicago, USA) revealing a maximum inner diameter of at least 3.1 mm during quiet respiration, and reflux observed during the Valsalva maneuver lasting for at least 6 s.

As for the NOA patients, 30 cases of Klinefelter syndrome, 12 cases of Y chromosome microdeletion, 18 cases of undergone bilateral cryptorchidism surgery, 11 cases of acquired NOA, and 129 cases of idiopathic NOA. Among the 200 NOA patients, 73 (36.50%) were diagnosed with concomitant varicocele. Of these, 44 had unilateral varicocele, 29 had bilateral varicocele, and 9 exhibited severe varicocele on at least one side.

Machine learning methods analysis

The data was randomly divided into a training set and a validation set in a 7:3 ratio. The training set comprised 244 cases, with 141 cases categorized as NOA and 103 cases categorized as OA. The validation set included 108 cases, with 59 categorized as NOA and 49 as OA. Statistical analysis demonstrated no significant difference between the two groups (all P > 0.05; Supplementary Table 1).

After performing both univariate and multivariate logistic regression analyses on the training set, we observed that semen pH and FSH had a positively correlation with NOA, while MTV and INHB were negatively correlated with NOA (Table 1). Subsequently, we conducted a binary logistic regression including only these four factors. This analysis revealed that semen pH (OR = 12.30, P = 0.019) and FSH (OR = 1.59, P < 0.001) were positive correlated with NOA. In contrast, MTV (OR = 0.73, P = 0.013) and INHB (OR = 0.98, P = 0.019) exhibited a negatively correlation with NOA.

Table 1 Predictors of non-obstructive azoospermia in azoospermia patients via logistic regression analysis.

Full size table

We further analyzed and optimized this model based on these four factors using nine machine learning methods: Random Forest, GBDT, XGBoost, LightGBM, Naive Bayes Classifier, SVM, Logistic Classifier, Decision Trees, and Neural Networks. The performance of each machine learning model was compared using AUC, sensitivity, and specificity (Table 2). In evaluating nine machine learning methods using the AUC metric for model validation, the GBDT demonstrated the highest performance with an area under the curve (AUC) of 0.974, reflecting its superior classification capabilities. In contrast, Random Forest had the lowest AUC at 0.953, indicating comparatively weaker performance. Dalong’s test further revealed that a significant difference in AUC values was found only between the GBDT and Random Forest (P < 0.05), while the AUC values of the other seven models showed no statistically significant differences compared to GBDT (P > 0.05).

Table 2 Comparison of the performance of nine machine learning methods.

Full size table

Establish and validate the nomogram model

Following this, a comprehensive multivariate analysis led to the development of a novel nomogram model (Fig. 1). This model incorporates four independent variables to predict NOA in azoospermia patients. Notably, it identifies FSH as the primary predictor among these variables.

The ROC curve shows that the AUC is 0.984 in the training set and 0.976 in the validation set, demonstrating strong predictive performance of the nomogram model (Fig. 2a-b). Moreover, the calibration curve of the nomogram exhibited favorable alignment between predicted and observed outcomes, the P-values of the Hosmer-Lemeshow test are greater than 0.05, indicating good model fit (Fig. 2c-d). Furthermore, analysis of the DCA underscored the enhanced net benefits associated with employing a predictive model prior to in azoospermia patients (Fig. 2e-f).

The cut-off points were determined for several biomarkers: FSH at 7.50 IU/L (AUC = 0.96, Sensitivity = 91.50%, Specificity = 93.20%); INHB at 43.45 pg/ml (AUC = 0.95, Sensitivity = 100.0%, Specificity = 82.30%); MTV at 9.92 ml (AUC = 0.91, Sensitivity = 99.00%, Specificity = 75.90%); and semen pH at 6.95 (AUC = 0.71, Sensitivity = 97.90%, Specificity = 42.7%).

In the validation set, NOA prediction rates were computed using distinct cutoff values for four indicators: INHB < 43.45 pg/ml resulted in 100.00% (48/48), FSH > 7.5 IU/L yielded 96.30% (52/54), and MTV < 9.92 ml showed 95.00% (38/40), pH > 6.95 achieved 63.74% (58/91). The cutoff value of semen pH had a lower NOA prediction rate in the validation set compared to the other three biomarkers (all P < 0.05), with significant statistical differences (all P > 0.05).

Discussion

Identifying potential NOA patients among azoospermic individuals is crucial for developing effective treatment strategies. Due to the discouragement of diagnostic testicular biopsy in azoospermic patients, predicting NOA based on non-invasive clinical parameters poses a significant challenge for andrologists but is necessary. Our study employed nine machine learning algorithms, all of which exhibited robust predictive capabilities for NOA in the validation set, thereby affirming the validity of our model. Among the four basic male fertility indicators—FSH, INHB, TV, and semen pH—FSH was found to be the most effective predictor.

FSH, a hormone secreted by the anterior pituitary gland, acts on Sertoli cells expressing FSH receptor. It is recognized for its role in stimulating mitotic and meiotic deoxyribonucleic acid synthesis in spermatogonia and preleptotene spermatocytes, and promoting cell survival through its interactions with Sertoli cells¹⁵. FSH serves as a key regulator of Sertoli cell numbers during testicular development, with the quantity of Sertoli cells produced in the perinatal period determining the germ cell count in adult males¹⁶. In adulthood, the size of the testes is influenced by circulating FSH levels, which correlate with the number of spermatogenic cells^15,17. In previous studies, the critical values of FSH ranged from 5.9 to 10.895 IU/l^{11,14,18,19,20,21}. However, we observed that the two papers from North America reported FSH critical values of 5.9 and 7.6 IU/l, which are the lowest among the six papers^14,21. The remaining studies reported values higher than these two and all involved Asian populations^11,18,19,20. Similarly, Regent et al.¹² described a TV threshold of 12.4 ml for NOA and OA among 45 individuals of European descent. In contrast, the TV threshold derived from our training set of 244 Asian individuals was 9.92 ml. Shamohammadi et al.¹¹ have noted differences in the thresholds for NOA and OA among different populations, suggesting that variations in testicular volume thresholds may stem from ethnic and racial differences. Andrologists should adjust their diagnostic criteria based on racial and regional differences when assessing azoospermic patients, highlighting the necessity of refining azoospermia parameters for each population group.

Other parameters contribute to our nomogram prediction model. INHB, secreted by Sertoli cells, exerts a direct inhibitory effect on FSH. Primarily functioning through negative feedback on the pituitary gland, INHB not only regulates FSH levels but also serves as an indicator of sperm production to some degree²². Prior research^22,23,24 has thoroughly investigated the predictive role of INHB in the outcomes of NOA patients undergoing mTESE, low levels of INHB and elevated FSH have been associated with reduced sperm count^25,26. Zhang et al.²⁰ described an INHB threshold of 41.5 pg/ml in a retrospective study of 70 Asian individuals. Following calculations for NOA predictions in our validation set, the INHB threshold from our study (43.45 pg/ml) showed no significant statistical difference. This may be attributed to the nearly identical ethnic and regional composition of the populations included in both studies. Our study found that NOA patients often have a “normal” semen pH value, which has been rarely discussed in previous research. The majority of semen components are secretions originating from the seminal vesicle and prostate gland, with semen pH primarily reflecting the acid-base equilibrium between the alkaline secretion from the seminal vesicle and the acidic secretion from the prostate gland^9,27. In individuals with NOA, where the seminal ducts remain unobstructed, ejaculate volume and semen pH typically align closely with normal values. However, in cases of OA, variations in ejaculate volume and semen pH are occasionally observed, depending on the site of the obstruction. In instances of bilateral ejaculatory duct obstruction, the presence of alkaline seminal vesicle fluid in the semen is minimal, consequently yielding a relative low pH value. While some patients with bilateral epididymal obstruction may exhibit normal ejaculation volume and pH values, patients experiencing epididymal obstruction due to congenital developmental defects may combined with seminal vesicle dysplasia²⁸. According to our predictive model, semen pH, as a fundamental indicator, may offer some value in distinguishing certain types of azoospermia, potentially aiding andrologists in identifying semen pH patients. However, we must acknowledge that, whether in terms of predictive efficacy or accuracy rates for NOA, semen pH performs less effectively than other parameters in the predictive model, particularly FSH.

Many other parameters hold potential in distinguishing OA and NOA. Particularly when FSH and INHB levels are at a borderline threshold, it becomes difficult to definitively categorize the patient as either OA or NOA. In addition to requiring high discriminatory sensitivity and specificity, the selection of biomarkers must also account for convenience, affordability, and accessibility. Jarow et al. found that semen volume is significantly higher in patients with testicular failure compared to those with OA²⁹. Gao et al. also discovered that combining FSH with plasma leptin and serum leptin enhances the discriminatory power between OA and NOA more effectively than using FSH alone¹⁹. Foresta et al. observed fewer intratesticular vessels in NOA patients compared to both OA patients and normal populations using testicular color Doppler ultrasound³⁰. Du et al. reported a significantly higher incidence of seminal duct abnormalities in OA patients (92.2% vs. 2.8%) through scrotal and transrectal ultrasonography of azoospermic patients³¹. Li et al. identified a significantly higher strain ratio in NOA patients compared to OA patients using real-time performance; however, there is a lack of an effective evaluation of the predictive value of real-time elastography for NOA³². One study evaluated the apparent diffusion coefficient (ADC) using magnetic resonance imaging and found that ADC values in the testes of OA patients were significantly lower than in NOA patients, but the predictive accuracy of this method was limited¹². Hesham et al.‘s prospective study revealed that besides ADC, the normalized apparent diffusion coefficient, choline, and lipids from diffusion-weighted magnetic resonance imaging and proton magnetic resonance spectroscopy effectively differentiate between OA and NOA patients. However, this method not only requires advanced medical facilities but also involves high costs and long waiting times³³. While some tsRNAs have shown potential in differentiating between OA and NOA, their sensitivity and specificity remain inferior to the new model we have developed³⁴. Additionally, there are emerging biomarkers such as seminal plasma cysteine-rich secretory protein 1³⁵ and seminal plasma testis-expressed protein 101³⁶. Furthermore, Fietz et al. proposed that testis-specific proteins L-lactate dehydrogenase C chain, Phosphoglycerate kinase 2, Dipeptidase 3, as well as reproductive cell-enriched heat shock proteins Heat shock-related 70 kDa protein 2 and Heat shock 70 kDa protein 4 L, hold promise as biomarkers for spermatogenic function based on seminal plasma proteomic analysis³⁷. In general, there is still a long way to go in distinguishing between OA and NOA.

The limitations of this study are primarily attributed to several factors. Firstly, there is a lack of external validation within patient cohorts, which limits the generalizability and robustness of the predictive models. Secondly, although the classification method used for defining NOA and OA is the most widely accepted in the scientific community, there remains an ongoing debate regarding their precise definitions. Lastly, the model’s applicability is further restricted by the absence of sufficiently reliable and reproducible imaging techniques that could enhance its accuracy and convenience. These limitations highlight the need for future research, including multicenter investigations, to address these issues and improve the model’s broader applicability.

In conclusion, we utilized nine machine learning algorithms and fundamental male fertility parameters to predict NOA. After validation, our predictive model and nomogram demonstrated accurate identification of NOA. Moving forward, we aim to develop an online calculator that incorporates additional factors to further refine our model’s performance in personalized medicine.

Data availability

The data supporting the findings of this study can be obtained upon request from the corresponding author and are not publicly accessible due to privacy or ethical constraints.

References

Minhas, S. et al. European Association of Urology Guidelines on male sexual and Reproductive Health: 2021 update on male infertility. Eur. Urol. 80 (5), 603–620 (2021).
Article PubMed MATH Google Scholar
Takeshima, T. et al. Clinical management of nonobstructive azoospermia: an update. Int. J. Urol. 31 (1), 17–24 (2024).
Article PubMed MATH Google Scholar
Eisenberg, M. L. et al. Male infertility. Nat. Rev. Dis. Primers. 9 (1), 49 (2023).
Article MathSciNet PubMed Google Scholar
Xiao, H. et al. Comparative evaluation of double- and single-armed two-suture longitudinal intussusception techniques in microsurgical vasoepididymostomy: an updated systematic review and meta-analysis. PLoS One. 19 (2), e0298019 (2024).
Article PubMed PubMed Central CAS Google Scholar
Silber, S. J. et al. Normal pregnancies resulting from testicular sperm extraction and intracytoplasmic sperm injection for azoospermia due to maturation arrest. Fertil. Steril. 66 (1), 110–117 (1996).
Article PubMed CAS Google Scholar
Corona, G. et al. Sperm recovery and ICSI outcomes in men with non-obstructive azoospermia: a systematic review and meta-analysis. Hum. Reprod. Update. 25 (6), 733–757 (2019).
Article PubMed MATH Google Scholar
Lantsberg, D., Mizrachi, Y. & Katz, D. J. Micro-testicular sperm extraction outcomes for non-obstructive azoospermia in a single large clinic in Victoria. Aust N Z. J. Obstet. Gynaecol. 62 (2), 300–305 (2022).
Article PubMed Google Scholar
Zhou, L. et al. The expression of Beclin-1 in testicular tissues of non-obstructive azoospermia patients and its predictive value in sperm retrieval rate. Transl. Androl. Urol. 10 (8), 3267–3274 (2021).
Article PubMed PubMed Central MATH Google Scholar
Kalsi, J., Thum, M. Y., Muneer, A., Abdullah, H. & Minhas, S. In the era of micro-dissection sperm retrieval (m-TESE) is an isolated testicular biopsy necessary in the management of men with non-obstructive azoospermia? BJU Int. 109 (3), 418–424 (2012).
Article PubMed Google Scholar
Schlegel, P. N. et al. Diagnosis and treatment of infertility in men: AUA/ASRM guideline part I. Fertil. Steril. 115 (1), 54–61 (2021).
Article PubMed MATH Google Scholar
Shamohammadi, I. et al. Evaluation of azoospermic patients to Distinguish Obstructive from Non-obstructive Azoospermia, and necessity of Diagnostic Testis Biopsy: a retrospective study. Int. J. Fertil. Steril. 16 (3), 156–161 (2022).
PubMed PubMed Central MATH Google Scholar
Regent, B., Skrobisz, K., Kozak, O., Matuszewski, M. & Studniarek, M. MRI in the evaluation of the azoospermic male. Diagn. Interv Radiol. 26 (4), 271–276 (2020).
Article PubMed PubMed Central Google Scholar
World Health Organization. WHO Laboratory Manual for the Examination and Processing of Human Semen. 5th edition. (2010).
Christman, M. S. et al. Operating characteristics of follicle-stimulating hormone in azoospermic men. Fertil. Steril. 101 (5), 1261–1265 (2014).
Article PubMed CAS MATH Google Scholar
Oduwole, O. O., Huhtaniemi, I. T. & Misrahi, M. The roles of luteinizing hormone, follicle-stimulating hormone and testosterone in spermatogenesis and folliculogenesis revisited. Int. J. Mol. Sci. 22 (23), 12735 (2021).
Article PubMed PubMed Central CAS Google Scholar
Xu, H. Y., Zhang, H. X., Xiao, Z., Qiao, J. & Li, R. Regulation of anti-Müllerian hormone (AMH) in males and the associations of serum AMH with the disorders of male fertility. Asian J. Androl. 21 (2), 109–114 (2019).
Allan, C. M. et al. Complete sertoli cell proliferation induced by follicle-stimulating hormone (FSH) independently of luteinizing hormone activity: evidence from genetic models of isolated FSH action. Endocrinology 145 (4), 1587–1593 (2004).
Article PubMed CAS Google Scholar
Huang, I. S., Huang, W. J. & Lin, A. T. Distinguishing non-obstructive azoospermia from obstructive azoospermia in Taiwanese patients by hormone profile and testis size. J. Chin. Med. Assoc. 81 (6), 531–535 (2018).
Article PubMed MATH Google Scholar
Gao, L. et al. [Leptin level in azoospermic patients and its clinical value]. Zhonghua Nan Ke Xue. 17 (6), 492–497 (2011).
PubMed CAS MATH Google Scholar
Zhang, W. X., Wang, R. & Li, P. Q. [Concentration of seminal plasma and serum inhibin B: a predictor in the diagnosis of azoospermia]. Zhonghua Nan Ke Xue. 13 (7), 598–600 (2007).
PubMed CAS MATH Google Scholar
Schoor, R. A., Elhanbly, S., Niederberger, C. S. & Ross, L. S. The role of testicular biopsy in the modern management of male infertility. J. Urol. 167 (1), 197–200 (2002).
Article PubMed Google Scholar
Deng, C. et al. Inhibin B-to-anti-mullerian hormone ratio as noninvasive predictors of positive sperm Retrieval in Idiopathic Non-obstructive Azoospermia. J. Clin. Med. 12 (2), 500 (2023).
Article PubMed PubMed Central CAS MATH Google Scholar
Huang, X. et al. Combination of serum inhibin B and follicle-stimulating hormone levels can not improve the diagnostic accuracy on testicular sperm extraction outcomes in Chinese non-obstructive azoospermic men. Chin. Med. J. 125 (16), 2885–2889 (2012).
PubMed CAS MATH Google Scholar
Vernaeve, V. et al. Serum inhibin B cannot predict testicular sperm retrieval in patients with non-obstructive azoospermia. Hum. Reprod. 17 (4), 971–976 (2002).
Article PubMed CAS MATH Google Scholar
Jensen, T. K. et al. Inhibin B as a serum marker of spermatogenesis: correlation to differences in sperm concentration and follicle-stimulating hormone levels. A study of 349 Danish men. J. Clin. Endocrinol. Metab. 82 (12), 4059–4063 (1997).
PubMed CAS MATH Google Scholar
Negri, F. et al. The importance of discordant follicle stimulating hormone and inhibin B levels in primary infertile men: findings from a cross-sectional study. World J. Mens Health (2024).
Harraway, C., Berger, N. G. & Dubin, N. H. Semen pH in patients with normal versus abnormal sperm characteristics. Am. J. Obstet. Gynecol. 182 (5), 1045–1047 (2000).
Article PubMed CAS Google Scholar
Cai, Z. & Li, H. Congenital bilateral absence of the Vas Deferens. Front. Genet. 13, 775123 (2022).
Article PubMed PubMed Central CAS Google Scholar
Jarow, J. P., Espeland, M. A. & Lipshultz, L. I. Evaluation of the azoospermic patient. J. Urol. 142 (1), 62–65 (1989).
Article PubMed CAS MATH Google Scholar
Foresta, C. et al. Doppler ultrasound of the testis in azoospermic subjects as a parameter of testicular function. Hum. Reprod. 13 (11), 3090–3093 (1998).
Article PubMed CAS MATH Google Scholar
Du, J. et al. Differential diagnosis of azoospermia and etiologic classification of obstructive azoospermia: role of scrotal and transrectal US. Radiology 256 (2), 493–503 (2010).
Article PubMed MATH Google Scholar
Li, M., Du, J., Wang, Z. Q. & Li, F. H. The value of sonoelastography scores and the strain ratio in differential diagnosis of azoospermia. J. Urol. 188 (5), 1861–1866 (2012).
Article PubMed MATH Google Scholar
Hesham Said, A., Ragab, A., Zohdy, W., Ibrahim, A. S. & Abd El Basset, A. S. Diffusion-weighted magnetic resonance imaging and magnetic resonance spectroscopy for non-invasive characterization of azoospermia: a prospective comparative single-center study. Andrology 11 (6), 1096–1106 (2023).
Article PubMed CAS Google Scholar
Han, X. et al. Seminal plasma extracellular vesicles tRF-Val-AAC-010 can serve as a predictive factor of successful microdissection testicular sperm extraction in patients with non-obstructive azoospermia. Reprod. Biol. Endocrinol. 20 (1), 106 (2022).
Article PubMed PubMed Central CAS MATH Google Scholar
Légaré, C. et al. Cysteine-rich secretory protein 1 in seminal plasma: potential biomarker for the distinction between obstructive and nonobstructive azoospermia. Fertil. Steril. 100 (5), 1253–1260 (2013).
Article PubMed MATH Google Scholar
Korbakis, D. et al. Preclinical evaluation of a TEX101 protein ELISA test for the differential diagnosis of male infertility. BMC Med. 15 (1), 60 (2017).
Article PubMed PubMed Central Google Scholar
Fietz, D. et al. Proteomic biomarkers in seminal plasma as predictors of reproductive potential in azoospermic men. Front. Endocrinol. (Lausanne). 15, 1327800 (2024).
Article PubMed Google Scholar

Download references

Funding

This work was supported by Fujian Provincial Health Technology Project (Grant number: 2023CXA016) and Startup Fund for scientific research of Fujian Medical University (Grant number: 2019QH1113).

Author information

Hong Xiao and Yi-lang Ding contributed equally to this work.

Authors and Affiliations

Department of Andrology and Sexual Medicine, First Affiliated Hospital of Fujian Medical University, Fuzhou, 350005, China
Hong Xiao, Yi-lang Ding, Chao Wang, Peng Yang, Qiang Chen, Hao-nan He, Ruijie Yao, Hai-lin Huang, Xi Chen, Mao-yuan Wang, Song-xi Tang & Hui-liang Zhou

Authors

Hong Xiao
View author publications
Search author on:PubMed Google Scholar
Yi-lang Ding
View author publications
Search author on:PubMed Google Scholar
Chao Wang
View author publications
Search author on:PubMed Google Scholar
Peng Yang
View author publications
Search author on:PubMed Google Scholar
Qiang Chen
View author publications
Search author on:PubMed Google Scholar
Hao-nan He
View author publications
Search author on:PubMed Google Scholar
Ruijie Yao
View author publications
Search author on:PubMed Google Scholar
Hai-lin Huang
View author publications
Search author on:PubMed Google Scholar
Xi Chen
View author publications
Search author on:PubMed Google Scholar
Mao-yuan Wang
View author publications
Search author on:PubMed Google Scholar
Song-xi Tang
View author publications
Search author on:PubMed Google Scholar
Hui-liang Zhou
View author publications
Search author on:PubMed Google Scholar

Contributions

HX, YLD, CW, SXT, and HLZ made considerable contributions to the concept and design of the study. HX, YLD, CW, PY, QC, HNH, RJY, and MYW participated in data collection. HX, YLD, CW, HLH, XC, SXT and HLZ participated in analysis and interpretation. HX, YLD, CW, SXT and HLZ conducted the study. All authors were involved in revising and reading the manuscript. HX, SXT, and HLZ made major revisions to the manuscript. HX, and YLD contributed equally to this study as co-first authors. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Hui-liang Zhou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Xiao, H., Ding, Yl., Wang, C. et al. Developing a nomogram model for predicting non-obstructive azoospermia using machine learning techniques. Sci Rep 15, 5468 (2025). https://doi.org/10.1038/s41598-025-88387-5

Download citation

Received: 16 August 2024
Accepted: 28 January 2025
Published: 14 February 2025
Version of record: 14 February 2025
DOI: https://doi.org/10.1038/s41598-025-88387-5