Abstract
This paper explores the use of optimized convolutional neural networks (CNNs) to classify diseases affecting potato leaves using TensorFlow-2. The dataset, sourced from Kaggle’s Plant Village repository, includes 152 images of healthy potato leaves and 1000 images each of early and late blight. The methodology covers data preparation, model architecture design, training, evaluation, and deployment. During data preparation, the data set was split into training sets (80%) and testing sets (20%), with images resized to 128x128 pixels. The Deep Learning (DL) models built using CNN with 4 different optimizers (ADAM, SGD, RMSPROP, and ADAMAX) and trained using a sparse categorical cross-entropy loss function, include multiple convolutional and pooling layers for feature extraction, and fully connected layers for classification. Early stopping was used to prevent overfitting. Model performance was assessed using accuracy, loss curves, confusion matrix, ROC curve, precision recall curve, classification report, and F1 score. In addition, we have used data augmentation to balance the dataset by increasing healthy potato leaves 6 times and the use of Ensemble Deep Learning (EDL). EDL10 which contains DL1 (CNN + ADAM), DL2 (CNN + SGD), DL3 (CNN + RMSPROP) and DL4 (CNN + ADAMX) performs best with a accuracy score of 97.0%. This highlights the importance of data balancing and the use of the ensemble classification approach for the detection of blight in Potato Leaves.
Similar content being viewed by others
Code availabilitry
All data supporting the findings of this study are publicly available. The complete source code is hosted in a DOI-minting repository and has been archived on Zenodo to ensure long-term accessibility and reproducibility. The code is released under an open-source license. The archived version corresponding to this publication is available at : https://doi.org/10.5281/zenodo.19624017.
Funding
The project was funded by KAU Endowment (WAQF) at king Abdulaziz University, Jeddah, Saudi Arabia. The authors, therefore, acknowledge with thanks WAQF and the Deanship of Scientific Research (DSR) for technical and financial support.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing of interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Jain, A., Dubey, A.K., Singh, S.K. et al. Optimized CNN-based ensemble deep learning approach for potato leaf disease detection with data augmentation. Sci Rep (2026). https://doi.org/10.1038/s41598-026-50480-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-50480-8


