Enhancement of cryptography algorithms for security of cloud-based IoT with machine learning models

Qasem, Mohammed Ali; Motiram, Bokare Madhav; Thorat, Suryakant; Al-Hejri, Aymen M.; Alshamrani, Sultan S.; Alshmrany, Kaled M.

doi:10.1038/s41598-026-45938-8

Download PDF

Article
Open access
Published: 26 March 2026

Enhancement of cryptography algorithms for security of cloud-based IoT with machine learning models

Mohammed Ali Qasem¹,
Bokare Madhav Motiram²,
Suryakant Thorat³,
Aymen M. Al-Hejri⁴,
Sultan S. Alshamrani⁵ &
…
Kaled M. Alshmrany⁶

Scientific Reports volume 16, Article number: 10972 (2026) Cite this article

624 Accesses
Metrics details

Subjects

Abstract

The rapid expansion of cloud-based Internet of Things (IoT) systems has intensified security challenges due to the large-scale transmission of sensitive data from resource-constrained devices to cloud infrastructures. Conventional cryptographic techniques often impose high computational and memory overhead. Consequently, there is a critical need for security frameworks that balance strong data protection with efficient resource utilization while supporting intelligent threat detection. This study proposes an integrated security framework that combines lightweight and hybrid cryptographic algorithms with machine learning (ML) models to secure IoT data transmission in cloud-based environments. Four encryption techniques, XOR, ChaCha20, AES, and a hybrid AES–RSA scheme, are systematically evaluated in terms of memory consumption, CPU usage, and overall resource efficiency using the Overall Resource Consumption Score (ORCS). Secure data transmission is simulated using the MQTT protocol, while ML-based intrusion detection is performed using Random Forest (RF), XGBoost, CatBoost, and ensemble classifiers. Experiments are conducted on two real-world IoT datasets, MQTTEEB-D and CIC IoT 2023 for IoT network traffic. On the MQTTEEB-D dataset, the hybrid AES–RSA scheme achieved a low memory usage of 0.126 KB per traffic with an ORCS of 0.56, while the voting ensemble classifier attained the highest detection accuracy of 92.68%. On the CIC IoT 2023 dataset, comprising 605,839 test records, the hybrid AES–RSA method required 0.374 KB per traffic and achieved an ORCS of 0.5425, whereas the voting ensemble model achieved an accuracy of 81.09%. The findings demonstrate that hybrid cryptography provides an effective balance between security and efficiency for cloud-based IoT systems, while ensemble ML models significantly enhance intrusion detection performance.

Evaluating machine learning approaches for multiple attack classification with improved computational efficiency in IoT networks

Article Open access 14 November 2025

TTEA: designing a quantum-ready and energy-conscious encryption model for secure IoT environments

Article Open access 25 March 2026

ISAAF: an IoT security and attack prevention framework using AI-driven predictive analytics

Article Open access 29 December 2025

Introduction

The rapid proliferation of the Internet of Things (IoT) and the expansion of cloud computing have significantly transformed various industries, enabling enhanced efficiency, scalability, and connectivity^1,2. However, this technological advancement brings with it critical security challenges, particularly in ensuring the confidentiality, integrity, and authenticity of data transmitted between IoT devices and cloud environments^3,4. The sensitivity of the data generated by IoT devices, ranging from healthcare information to personal data, demands robust security measures to mitigate the risk of cyberattacks. As such, cryptography has emerged as a fundamental tool in safeguarding IoT systems against these threats, ensuring the privacy and integrity of data during transmission and storage⁵.

In IoT ecosystems, where devices are often constrained by limited computational resources, traditional cryptographic algorithms designed for powerful computing environments are not suitable due to their high memory and processing power requirements⁶. Lightweight cryptography provides an approach to encryption that ensures the security of IoT systems while minimizing the impact on the devices’ limited resources, such as memory and CPU power⁷. These constraints present a significant challenge, as encryption is necessary for securing data before transmission over potentially insecure networks.

Despite significant advancements in IoT infrastructure, securing large-scale IoT systems remains a major challenge due to the heterogeneity of devices, constrained computational resources, and continuous data transmission to cloud environments⁸. Modern IoT infrastructures generate massive volumes of sensitive data that are frequently transmitted over public or semi-trusted networks, making them attractive targets for cyberattacks. While cloud platforms offer scalability and computational power, they also increase the attack surface, particularly when data encryption and intrusion detection mechanisms are not optimized for resource-limited IoT devices⁹. Therefore, there is a growing need for security solutions that are specifically designed for cloud-based IoT infrastructures, ensuring strong data protection while maintaining low memory and CPU overhead.

Existing IoT security approaches focus on either cryptography or intrusion detection, but not both. Traditional cryptographic algorithms are often too resource-intensive for lightweight IoT devices. Meanwhile, machine learning–based intrusion detection systems require access to unencrypted data, compromising confidentiality. Many studies use small datasets and fail to assess encryption’s impact on memory and CPU, or consider secure transmission protocols like MQTT¹⁰. This highlights a gap in comprehensive frameworks that evaluate encryption, secure cloud transmission, and intelligent attack detection under realistic IoT constraints.

In cloud-based IoT networks, data generated by IoT devices is often transmitted to the cloud for processing, storage, and analysis. Therefore, ensuring the confidentiality and integrity of data while it is being transmitted between IoT devices and the cloud is a critical issue¹¹. Furthermore, the integration of ML algorithms into IoT environments can enhance security by enabling intelligent monitoring of network traffic, detecting anomalies, and identifying potential threats. However, the performance of these ML models is directly influenced by the efficiency of the underlying cryptographic algorithms. This study aims to address the lack of unified security frameworks in cloud-based IoT environments by proposing a framework that integrates lightweight and hybrid cryptographic algorithms (XOR, ChaCha20, AES, and AES-RSA) with ensemble machine learning models. Subsequently assessing their performance regarding resource utilization (memory and CPU consumption) and security effectiveness.

In the first part of this research, a comprehensive analysis is conducted on the MQTTEEB-D and CIC IoT 2023 datasets, which contain real-world IoT network traffic data^12,13. The encryption process is a first phase in this study, then integration of ML models such as Random Forest (RF), eXtreme Gradient Boosting (XGBoost), CatBoost, and ensemble models, are used in mitigating potential security threats in IoT networks. Developing effective malware detection techniques in IoT networks is important to ensure the privacy and security of IoT devices, networks, and end users^14,15. Additionally, explores the integration of the MQTT protocol for secure data transmission from IoT devices to the cloud. MQTT is a lightweight messaging protocol commonly used in IoT applications, offering efficient and reliable communication between devices^16,17.

This work present end-to-end security framework that simultaneously integrates lightweight and hybrid encryption, secure cloud transmission via MQTT, and ensemble machine learning-based intrusion detection within a single pipeline. Specifically, the contribution of this study is threefold.

A composite metric, the Overall Resource Consumption Score (ORCS), is introduced to jointly quantify memory and CPU trade-offs across four encryption algorithms (XOR, ChaCha20, AES, and hybrid AES–RSA), enabling objective, resource-aware algorithm selection for constrained IoT devices.
The MQTT protocol is incorporated as a realistic cloud transmission layer applied after encryption, closely simulating real-world IoT deployment conditions.
The ensemble ML classifiers are trained and evaluated on encrypted IoT traffic rather than raw network data, reflecting the operational reality in which detection systems must function alongside active encryption.
Experiments are conducted on two real-world IoT datasets, MQTTEEB-D and CIC IoT 2023, confirming that the proposed framework generalizes across different IoT traffic scenarios.

The rest of paper structure as follows: Sect. 2 provides a review of the related work. Section 3 outlines the methodology employed in the study. Section 4 presents the results of the research and the discussion. Finally, Sect. 5 concludes the paper by summarizing the key contributions and suggesting directions for future research.

Related work

The idea of hybrid cryptography, which combines encryption methods with AI, is very important for IoT environments that are based in the cloud. The AES and ChaCha20 algorithms were selected based on their NIST certification, specifically the FIPS PUB 197 standard for AES, ensuring their robustness and trustworthiness in modern cryptographic applications, including IoT¹⁸. For example, in the study¹⁹, proposed a healthcare IoT framework that uses fog computing and a hybrid mathematical model that combines Elliptic Curve Cryptography (ECC) and Proxy Re-encryption (PR) with the Enhanced Salp Swarm Algorithm (ESSA). This cuts processing time from 60 milliseconds to 18 milliseconds and increases reliability from 25% to 3%.

On the other hand, using ML to manage cryptographic keys is a big step forward for IoT security. K. Karimunda et al.²⁰, introduced a security framework that integrated the MQTT-based IoT protocol and device communication. A hybrid approach combining elliptic curve cryptography to ensure message confidentiality through encryption and ANNs utilized for anomaly detection and classification. Their framework achieved an accuracy of 90.38% in detecting and classifying anomalous using MQTTset dataset. The limitation is in lack of specific dataset and encryption method that suitable to embedded in IoT devices.

H. Nagarajan, et al.²¹, proposed an AI-driven cryptographic framework to enhance security in smart cloud environments. The framework combined symmetric, asymmetric, and homomorphic encryption methods to counter emerging risks and ensure data integrity, confidentiality, and operational efficiency. AI models are trained to detect threats in real-time to identify anomalies and address new vulnerabilities. The AI-driven achieved a high accuracy of 93.8% and encryption throughput of 520.7 operations per second.

B. Duc Manh et al.²², proposes a privacy-preserving model employing a combination of AI and homomorphic encryption. AI-driven placed at blockchain nodes, data from blockchain nodes is encrypted using homomorphic encryption before being sent to a cloud and training by DNN. The proposed method achieved detection accuracy nearly identical to unencrypted approaches, with a gap of approximately 0.01. Homomorphic encryption generally introduces significant computational overhead compared to operations on unencrypted data.

Darshan Ingle and Divyanka Ingle²³, proposed a novel model called BC-Trans Network ensuring a robust and tamper-proof authentication mechanism. Fully homomorphic encryption was employed on CSE-CIC-IDS2018 dataset and transformer model. The model achieved an accuracy of 99.25%, a precision of 99.53%, a recall of 99.32%, and an F1 score of 99.59%, with detection times of 225.3 s, but the detection for binary normal and abnormal attacks classification.

T. Aljrees et al.²⁴, proposed a paradigm that combines efficient data encryption, the quondam signature algorithm, and federated learning to enhance IoT security. The proposed scheme optimizes time complexity through a synergy of offline phase computations and online phase signature generation. The execution time taken for the proposed encryption algorithm is 0.034 in seconds. The limitations of this study are that it did not address the scalability problems in Internet of Things networks, and the practical challenges in implementing encryption and data transmission across diverse Internet of Things environments.

S. Selvarajan et al.²⁵, introduced a model that operated in data authentication and attack prevention using a lightweight blockchain algorithm, and attack classification using an AI mechanism. Consensus proof-of-work to ensure privacy and sprinter neural network to predict and classify attacks using NSL-KDD, DS2OS, and BOT-IoT, and UNSW-NB15 datasets. The proposed AILBSM framework reduced execution time, achieving a processing time of 0.6 s. The model achieved an overall classification accuracy of 99.8%. The limitation was still for encryption on the IoT devices levels.

M. Elkhodr et al.²⁶, proposed an AI-driven orchestration, advanced cryptographic techniques. The model combined classical and post-quantum algorithms with digital twins using a Markov model and hash-based signature scheme. Simulation results showed processing impact under 0.05% and memory usage under 0.1%, threat detection rates between 85% and 99%. Quantum-resistant cryptography not practically implemented in the simulations due to the absence of mature quantum-computing simulation tools.

M. Jarin et al.²⁷, proposed the use of Elliptic Curve Cryptography (ECC) for encrypting cloud data and transmission, employing the NSL-KDD dataset and ML models such as LightGBM and Random Forest (RF). For binary classification tasks, LightGBM achieved an accuracy of 98.71%, while RF achieved 98.51%. A limitation of the study is that it only evaluates binary classification tasks, without incorporating IoT-embedded algorithms or cloud simulations.

R. Yuvarani and R. Mahaveerakannan²⁸, employed a hybrid cryptographic combining symmetric (AES, Blowfish, Twofish) and asymmetric (ECC, RSA) encryption algorithms to secure IoT cloud banking environments. The proposed algorithm is evaluated through experimental simulations on an IoT cloud banking environment, with analysis against various attacks. The approach achieved 25% improvement in encryption throughput and 30% reduction in computational overhead versus standalone algorithms, but there was no AI training.

N. KASHYAP et al.²⁹, implemented a hybrid cryptosystem (ECC + AES) on Raspberry Pi for secure IoT data transmission to the cloud. The methodology combined ECC for secure key exchange AES for fast encryption and file uploads to AWS S3 buckets, demonstrating improved encryption. They achieved faster encryption mechanism compared to previous algorithms. Testing appears focused only on Raspberry Pi; generalizability to other IoT devices is unclear.

K. S. Prasad et al.³⁰, introduced the CASAE-POADMA methodology by integrated attention-based stacked autoencoders (ASAE) with a Pelican Optimization Algorithm (POA) for the detection and mitigation of cyberattacks. The results validated on benchmark datasets, demonstrate an impressive 99.50% accuracy in detecting and mitigating attacks. However, the validation is limited to these datasets, and there is no real-world IoT network testing included. Table 1 presents the literature summary with key details on cryptographic algorithms, AI techniques for the related studies.

Table 1 The overview summary of related work.

A Correct	B Correct	B Wrong
A Correct	a	b
A Wrong	c	d

Subjects

Abstract

Similar content being viewed by others

Evaluating machine learning approaches for multiple attack classification with improved computational efficiency in IoT networks

TTEA: designing a quantum-ready and energy-conscious encryption model for secure IoT environments

ISAAF: an IoT security and attack prevention framework using AI-driven predictive analytics

Introduction

Related work

Methodology

Research assumptions

Dataset description

MQTTEEB-D dataset

CIC IoT dataset 2023 dataset

Dataset preprocessing and feature selection

Dataset encryption

Light IoT encrypt (XOR)

ChaCha20 encryption algorithm

AES

A Hybrid AES and RSA algorithm

Encryption algorithms hyperparameters

Dataset splitting

AI models

Random forest

XgBoost classifier

CatBoost classifier

Ensemble voting classifier

Ensemble bagging classifier

ML models hyperparameters

Sending data via Cloud-MQTT

Evaluation

Performance metrics

Encryption algorithms metrics for resource consumption

The McNemar’s statistical significance testing

Results and discussion

The results of AI models and cryptographic analysis on the MQTTEEB dataset

Analysis of memory and CPU usage for cryptographic Algorithms

ML performance for training and testing on the cloud

The results of AI models and cryptographic analysis on the CIC IoT 2023 dataset

Analysis of memory and CPU usage for cryptographic Algorithms

ML performance for training and testing on the cloud

Memory and CPU consumption discussion

Scalability analysis

Statistical significance analysis

MQTTEEB-D dataset (n = 44,514)

CIC IoT 2023 dataset (n = 605,839)

Comparison of the proposed work with related previous studies

Limitations and future work

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links