Abstract
Water deficiency is a significant globally challenge that requires the advancement of sustainable and effective desalination methods. Solar stills provide a feasible solution for the production of fresh water in areas dealing with water limitations, particularly in remote locations. However, their efficacy is frequently limited by fluctuating climatic conditions. The intermittent and changing character of solar radiation imposes significant limitations on most applications. Accurate solar radiation forecasting is crucial for estimating the distillate yield of a solar still system. With computer technology developing so quickly, a growing many deep learning models are employed in solar radiation prediction. For this purpose, the article evaluates the freshwater yield of the modified pyramid solar still in Tehran and Zahedan, Iran. Utilizing monthly data from 1984 to 2023 and employing Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Convolutional Neural Network (CNN), and CNN-LSTM algorithms, predictions for solar irradiance and temperature are calculated for the next ten years. The results validated the better performance of the CNN and GRU models for Tehran, while the LSTM model succeeded for Zahedan in forecasting global solar irradiance (GHI) and temperature. The presented models are utilized for predict the monthly output of the studied solar stills. The predicted average annual freshwater yield for the ten years from 2024 to 2033 is calculated to be 2630 L in Tehran and 2710 L in Zahedan.
Similar content being viewed by others
Introduction
Water is regarded as the fundamental basis of life and a crucial part that sustains world, permeating ecosystems, economics, and everyday activities by humans1. It is concerning that fewer than 1% of the Earth water is drinkable and accessible. There is a legitimate concern about a future water scarcity. Up to 2030, the United Nations predicts that 40% of the global social would face water deficiency2. Due to the rapid pace of industrialization and urbanization in the current era, energy and water consumption has risen to unprecedented levels, thereby exacerbating environmental degradation and contributing to increased greenhouse gas emissions3. Despite its essential nature, billions globally encounter the issue of obtaining safe and potable water for drinking. The emergence of novel technology in solar distillation is recognized as a promising advancement in water purification and desalination methods4. Solar stills have demonstrated efficacy in diverse environments, such as remote locations with restricted access to potable water, disaster-stricken regions, and communities that function autonomously from centralized power systems. They represent a prevalent and efficient solar desalination technology that uses solar energy without impacting the environment5. Solar stills are available in numerous configurations, such as traditional, tubular, pyramid, and stepped designs, each having distinct characteristics and operational features6. Beside these conventional systems some newer emerging technologies such as interfacial solar evaporation can be found7. Interfacial solar evaporation plays a crucial role in desalination because of its sustainable, eco-friendly characteristics. Solar-thermal-driven desalination has been recognized as a promising solution to the global freshwater scarcity challenge under the drive for energy conservation and carbon footprint reduction8. Recent advances in interfacial water evaporation technologies have shown significant potential to enhance the performance of solar-driven desalination systems by localizing heat at the air- water interface and minimizing thermal losses. Such approaches can greatly improve evaporation rates and freshwater yield compared to conventional bulk-heating mechanisms. Recent progress in the functionalization of solar-driven steam generation has been reported, indicating its emerging role in broader applications related to water and energy sustainability9,10. Zhang et al.11 proposed a photothermal textile with vertically confined water layers for scalable, high-flux, and long-term-reliable solar desalination. Also, an interfacial solar evaporation platform (ISEP) combined with a microplastics adsorbent has been proposed as a “one stone, two birds” strategy to produce clean water while enhancing microplastics removal12. Furthermore, to improve the efficiency of photothermal evaporation-driven lithium extraction, the development of evaporator structures with high solar conversion and rational design has been emphasized13. Chen et al.14 designed the polymer sponge evaporator to regulate the solar thermal gradient by incorporating copper–carbon core–shell nanoparticles, which possess comparable solar absorptance. This design aimed to investigate the influence of the solar thermal gradient, or localized heating, on evaporation performance. Future integration of interfacial evaporation strategies may significantly improve efficiency and freshwater yield. Also, the integration of desalination units into hybrid drying systems has been highlighted as a means to strengthen their contribution within the water–energy–food nexus, particularly in arid and island regions. Barzigar et al.15 investigated the application of artificial intelligence technologies, which can further enhance system performance by optimizing heat distribution between drying and desalination processes, dynamically adjusting desalination rates in response to projected water demand, and effectively managing thermal energy flows across multiple operational functions. Critical factors to consider such as solar irradiance, ambient temperature, wind speed, desired feed water flow rate (highest possible water levels in the saline water storage tank), feed water temperature (temperature differential between water and glass), humidity, surface area of the basin, angle of inclination of the top cover, and transparency of glass for solar radiation16. Accurate forecasting and precise solar radiation analysis are useful in decreasing risk and facilitating asset utilization in the most economical approach. As they are meteorological characteristics, Intensity of solar irradiance, ambient temperature, and wind speed could not modified; however, by forecasting these values, future production can be assessed across several intervals17.
Artificial Neural Networks (ANN) have been extensively studied and employed in time series predicting because of their ability to learn from input features, infer the latent components of incomplete dataset, and manage non-linearity. Deep learning algorithms have received significant consideration in applications of their proficiency in time series analysis in extracting complicated patterns formed by raw data. Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN), and Gated Recurrent Unit (GRU), are the main models employed for time series analysis for their capability in processing sequential information and modeling long-term horizons18,19,20. According to studies, these algorithms have received special attention for weather forecasting. Three ANN models, mainly Recurrent Neural Network (RNN), LSTM, and GRU, were employed to forecast solar irradiation in some urban areas in Bangladesh. Regarding the three algorithms, the GRU-based algorithms exhibited the greatest efficiency for Mean Absolute Percentage Error (MAPE)21. Deep learning models, involving RNN, LSTM, and GRU, were utilized for solar energy forecasting, revealing that LSTM and RNN outperformed the GRU algorithm significantly because of their capacity to maintain long-term horizons22. Furthermore, it demonstrated that GRU and LSTM considerably enhance accuracy of predicting when utilizing multivariate data in comparison to univariate data23. Convolutional Neural Networks (CNNs) are also effective in time series analysis24. The hybrid technique may effectively increase forecasting performance as it is utilized for time series and provides the capability to extract spatial and temporal properties, making it suitable for numerous applications. In order to address the identified constraints and obstacles, researchers are employing hybrid deep learning models that combine the advantages of many architectures, especially CNN, LSTM, and GRU, among others. Each of these models provides specific benefits. CNN is capable in recognizing spatial patterns, make them valuable for detecting correlations in multivariate meteorological data. Park et al.25 shown that hybrid models integrating domain-specific forecasting methodologies can adjust to sudden weather fluctuations, hence greatly enhancing predicting accuracy. Wang et al.26 presented that the CNN-LSTM model exhibits enhanced accuracy in PV energy Prediction performance in comparison with that of the individual LSTM and CNN models. Kumari et al.27 revealed that the CNN-LSTM hybrid enhances forecasting accuracy in comparison to individual methods like RNN, LSTM, CNN, and GRU. Uluocak et al.28 presented hybrid models that integrate CNN with various neural networks, including GRU and LSTM. The models have demonstrated enhanced efficacy in one-day ahead air temperature forecasts, exceeding individual networks and machine learning algorithms. Finally, Table 1 provides a summary of the recent researches relevant to weather prediction with different forecast horizons and their best evaluation parameters. Advancements have been made in applying algorithms in desalination process, with numerous studies exploring how weather forecasts influence water production of solar still. For this propose, Elsheikh et al.29 utilized an LSTM model to predict the output of a single slope and stepped solar stills. The stepped solar still yields freshwater was enhanced up to 128% in comparison to a conventional solar still. The coefficient of determination for the forecasted findings is 0.97 for the traditional solar still and 0.99 for the stepped solar distiller. Farouk et al.30 concentrated on three essential meteorological variables: sun irradiance, wind speed, and ambient temperature, all of that significant impact the system of distilling of the pyramidal solar. Regression models are developed to predict the impacts of many performance factors by applying Response Surface Methodology (RSM) with a central composite design that includes four parameters and five levels. Alawee et al.6 examined the performance, production, and predicted production of two kinds of solar stills: the traditional tubular solar still and the convex tubular solar still. The results were evaluating the distillation output of solar stills and studying the utilization of an advanced gradient boosting machine learning (ML) model for predicting distillation productivity. Murugan et al.31 developed the optimal ML model designed for forecasting solar still output under particular weather conditions. Various ML models were evaluated, including linear regression, random forests, decision trees, multilayer perceptron, and support vector machines. Migaybil et al.32 introduced an ANN model to predict the water output of solar stills. The main aim referred to implement the transfer learning method in the RI ANN model for traditional solar stills. The water output was forecasted utilizing kernel-based ML approaches33. The diurnal distillation output of a single slope solar still was modeled utilizing a conventional ANN model trained on meteorological data34. The efficiency of the inclined stepped solar distillation still was analyzed via a cascaded forward neural network35. A Harris Hawks optimizer and a multilayer perceptron neural network have been combined to create a hybrid an artificial intelligence approach for predicting water output in an active solar still system combined with an external condenser36.
To our knowledge, there are limited publications in the field of long-term forecasting, that encouraged researchers to investigate novel methodologies for forecasting long-term solar still output. This study employs various algorithms, that includes LSTM, CNN, GRU, and a hybrid algorithm named CNN-LSTM, to predict the output of the studied solar stills located in Tehran and Zahedan, Iran, for the next ten years. Solar irradiance and temperature are forecasted to assess the potential of suitable solar still placement in an allocated region. The results are compared using Root Mean Square Error (RMSE), Mean Square Error (MSE), coefficient of determination (\({\text{R}}^{2}\)), Mean Absolute Error (MAE), and Coefficient of Variance (COV), used as evaluation parameters, the evaluation metrics of the algorithms are compared.
Experimental methodology
Test-setup
Modified pyramidal solar still with rotational mechanism to enhance the condensation and evaporation processes was designed and constructed. The system utilizes a pyramidal configuration operating at a rotational speed of 1 rpm. The experimental setup is equipped with rotating wheels connected to a central shaft, which is driven by a 6 W DC motor. Figure 1and 2 illustrate the experimental configuration. The tests were performed at the University of Science and Technology in Tehran, located at a latitude of 35.44° N and longitude of 51.41° E. Productivity of freshwater was determined hourly and collected utilizing a measuring jar for distilled water to recording total Solar distillation system yield per day2.
Constructed testing apparatus of the pyramidal solar still2.
Illustration of the experimental configuration2.
The factors influencing the performance of the distiller system were recorded and evaluated in the specified time period. The parameters include wind velocity, sun irradiance, ambient temperature, and rotational speed. The configuration of the pyramidal distillation system requires the consideration of parameters including the exposed surface area to solar irradiation. The design specifications feature an absorptive plate of 70 × 70 cm. The basin of water is constructed using 1.5 mm thick galvanized iron, and the cover of pyramid comprises four clear glass panels, each 4 mm in thickness. Feedwater system of the solar still technique consists of a small funnel connected to a floating water buoy in the tank. The data logger, feeding system, and distillated water jug for collection are attached to the exterior perimeter of the solar still. The examination setup connected to a rotating shaft driven. A solar panel was utilized to eliminate the need for fossil-based energy systems. The pyramid-shaped solar still follows the proportions of prior models, featuring a tilt angle of the glass cover that corresponds using the local latitude2.
Measuring tools
Solar distiller performance is often evaluated on numerous critical features, involving solar irradiance, temperature, distillated volume, and air speed. The configuration is supplied with several measurement instruments essential for collecting data and analysis. This equipment is included into the monitoring system and document the important variables through the experiential tests. Appropriate instruments are employed to determine those features via experimentation30. A solar meter is utilized to measure irradiation values at different daily interval. Determinations of temperatures were obtained using thermocouple wires. Furthermore, the anemometer is employed to measure air velocity. The solar distiller yield is assessed utilizing the accurate balance mechanism. Table 2 represents comprehensive details of the instruments employed in the experimental analysis2.
System efficiency
The performance of several solar distiller models has been evaluated using energy and exergy efficiency45. The efficiency depends on the water output of the solar distiller and the heat transfer mechanism within it. The condensation rate indicates the distillate yield of freshwater generated by the solar still system, additionally referred to as the hourly or daily the system productivity. The overall thermal efficiency of a solar still unit in passive and active operation types, as presented by Kabeel et al.46, allows for the hourly distilled yield of the solar still to be defined as follows in Eq. (1):
where \(\eta_{th}\) is presented as the thermal efficiency, \({\text{h}}_{{{\text{fg}}}}\) is the latent heat of vaporization, \({\text{I}}\left( {\text{t}} \right)\) and \(A\) are the average daily solar radiation, the total area respectively. According to Eq. (2), the thermal efficiency over the day is then47:
where \({\text{I}}\left( {\text{t}} \right)\) is the mean daily solar irradiance on the still from sunrise to sunset. T is the time period.
Besides, \(h_{fg}\) is determined by the water temperature (\(T_{W}\)), by Emad et al.48 as follows in Eq. (3):
\({\text{T}}_{{\text{W}}}\) is proportional to the ambient temperature in each season.
Deep neural networks
LSTM model
LSTM, a specialized type of recurrent neural network (RNN), is developed to address the vanishing gradient issue. Its sophisticated architecture allows LSTM networks Aimed at accurately capturing and processing long-term dependencies in sequential data. These networks are extensively utilized across different sectors, involving speech recognition, challenges including sequence modeling and natural language processing. The primary advancement of LSTM is the incorporation of memory cells and gating processes. In the LSTM architecture illustrated in Fig. 3, the forgetting gate eliminates input data while maintaining critical data49. It generates a vector ranging from 0 to 1 to ascertain which data in the cell state Ct−1 should be preserved. Input control gate modifies xt using the sigmoid function and ht−1 using the tanh function to collectively update the cell state. Output control gate comprises two components: The output at time t, ot, along with the updated hidden state ht, with ot determined by xt and oht-1. The detailed operational concept is articulated by Eqs. (4) to (9) 50:
Wf, Wi, Wg and Wo represent the matrix weights of the respective gates, while bf, bi, bg and bo denote the bias terms. The sigmoid activation function, represented by × , and the tanh function are both employed as activation functions in the model.
CNN model
CNNs are classified as a type of feedforward network models that utilize convolutional operations and consist of convolutional layers, pooling layers, and fully connected layers, as seen in Fig. 4. The convolutional layers execute localized operations on small segments of the input information using a convolutional kernel, as indicated in Eq. (10), therefore capturing spatial patterns and characteristics51.
In the above context, ⨂ signifies convolutional computation, F represents a segment of the input information, while w, Hk, and Wk stand for the convolution kernel weight parameter, height, and width, respectively52.
CNN-LSTM model
A common design for analyzing time-series data is CNN-LSTM, which combines CNN and LSTM networks. This study uses a hybrid approach to feature extraction using the CNN layer and subsequently transmitting them to the LSTM layer. As seen in Fig. 5, the time-sequenced data are first fed toward the feature-extracting convolutional layer in the CNN of features, followed by the pooling layer, which extracts hidden data, and the flatten layer, which decreases the feature dimension. For more time-series modeling and prediction, the gathered feature sequences are then fed into the LSTM. The aggregated features are subsequently fed into the fully connected layer for forecasting. CNN extracts spatial features in time series data, whilst LSTM captures temporal dependencies; this combination leverages the potential of both models, enhancing the modelling and predictive capabilities of time series data53.
GRU model
The GRU neural network is a simplified variant of the LSTM, designed to reduce the number of gates while maintaining long-term memory capabilities, effectively addressing the vanishing gradient issue. GRU comprise two gates: the update gate and the reset gate. The former refers the information discarded related to a new input and the update data incorporated, while the next articulates the extent of long-term or prior data discarded by the forget gate. Equations (11) and (13) refer to the reset and forget gates23.
In Eq. (13), the input data xt and the prior time step data ht-1 are linearly combined, with the various matrices ω being multiplied correspondingly on the right side. Subsequently, the reset gate rt and ωhh. ht−1 are multiplied. Finally, the updated information regarding the present condition is computed using an activation function, often the tanh function54.
As per equation (14), the multiplication of zt and ht−1 indicates the conclusive data information retained from the preceding time step. The output and the information retained from the present memory to the long-term memory. correspond to the output ht generated by the final gate unit. Figure 6 illustrates the structure of the GRU.
Evaluation metrics analysis
Various error measuring techniques can assess the precision of models for forecasting. The study suggests utilizing five assessment measures to assess the model accuracy: mean absolute error (MAE), root mean square error (RMSE), square error (MSE), coefficient of determination (R2), mean and coefficient of variance (COV). Equations (15) to (19) are used to determine the five parameters29,43,55,56:
Input data
Case study
Iran contains broad desert landscapes and clear sky, providing advantageous conditions for solar energy, which exhibits significant variability in potential across various regions57. Calculations indicate that the annual solar radiation hours in Iran reach 2800 h. The sun irradiance in Iran is 2.5 times greater than that of European countries. The region contains significant solar energy potential, including approximately 300 days of sunlight annually and an mean solar radiation of 2200 kWh/m258. The Global Horizontal Irradiation (GHI) atlas for Iran (see Supplementary Fig. S1 online), indicating significant sun irradiation values in the southeastern region, so providing it a suitable option for the establishment of a solar distiller59. Tehran, the capital of Iran, is situated in the northern part of the country, located at coordinates latitude of 35.44, longitude of 51.41. July exhibits the peak intensity of sun irradiation, often exceeding 7 kWh/m2/day and December has the lowest solar irradiation level, usually around 2.5 kWh/m2/day. Zahedan is a city situated in southeastern Iran (latitude 29.47, longitude 60.90) that possesses one of the greatest levels of solar radiation, with a low of 3.97 kWh/m2/day in December and a maximum of 8.5 kWh/m2/day in June. Meteorological data for Tehran and Zahedan from 1984 to 2023 (480 months) have been obtained every month from the NASA website60.
Data pre-processing and feature selection
Solar irradiance, temperature, from Tehran and Zahedan were obtained from the NASA website60 for the period between 1984 and 2023. Details of the characteristic can be presented in Table 3. The dataset contains no missing values; nevertheless, preprocessing is still necessary. This preparation phase involves feature engineering and data normalization.
Feature selection is used to determine the most significant characteristics influencing the result in order to reduce computing costs and improve prediction accuracy. One approach of feature selection involves utilizing features that have the strongest association with the output variable61. Pearson correlation is employed in this investigation as the method for feature selection. This coefficient is appropriate for use with continuous input and target parameters in time series data, quantifying relationships on a scale from − 1 to 1. It is typically employed to compute the standard variations of input and target features, indicating the degree of correlation through the coefficient. When the value of the coefficient is between 0 and 1, the target feature rises as the input feature enhanced. On the other hand, When a coefficient ranges between 0 and − 1, it indicates that the target feature decreases as the input variable increases62. The heat map in Fig. 7 and 8 demonstrate the strong correlation between PS, QV2M, T2MDEW, and TS with GHI, T2M, and WS10M for Tehran and Zahedan. Thus, these characteristics have been regarded as input variables in neural network-based multivariate time series forecasting. According to these matrixes, characteristics exhibiting a correlation exceeding 0.7 with solar irradiance and T2M have been identified as a significant element for predicting. As can easily be understood from matrix for Tehran, T2MDEW, T2MWET, RH2M, TS, and PS are highly correlated with GHI and T2M. T2MWET, RH2M and TS exhibit the strongest interaction with sun radiation. and temperature in Zahedan.
Data normalization
In addition, differences in the scale and range of data can reduce the accuracy and efficiency of deep learning approaches. Normalization addresses this issue by scaling all numerical features to a uniform range, thereby ensuring no single variable disproportionately influences the model due to its scale. Research employing solar and wind power prediction models frequently utilizes the min–max scaling technique for normalization purposes. By rescaling the data according to the highest and lowest values, this method normalizes the data by transforming it to have a standard scale. By standardizing the data ranges between 0 and 1, this makes it easier to compare and understand different kinds of data63. The data normalization process is defined by Eq. (20) 64:
The time series data must then be defined as a supervised learning problem. Multistep forecasting could forecast all times during one stage65. Using previous time steps as input features and future time steps as target variables, this study employed the direct approach for multistep ahead data prediction. Figure 9 illustrates that each expected data can be predicted multiple times. To avoid overfitting and underfitting, the dataset is divided into test and train portions in a 30–70 ratio. The methods used in this work to predict temperature and solar radiation is shown in Fig. 10. This methodology comprises four primary processes: (a) data collecting, (b) data preprocessing, (c) training the model, and (d) model evaluation.
Results
The input features are situated in two locations: a. Tehran, b. Zahedan. The results from the assessment of multivariate multi-step forecasts over the defined prediction horizon are presented in Supplementary Tables S1 and S2 online. As mentioned before, two separate situations were tested. The first case is located in Tehran, and the second case is located in Zahedan city. GHI is the primary target variable forecasted by the model, followed by T2M as the secondary output. It should be considered that changes in prediction priorities require hyperparameter reconfiguration. Moreover, increasing data volume and incorporating multivariate features tend to enhance the model forecasting accuracy.
Solar radiation prediction
The results of different models used to predict GHI in Tehran and Zahedan are shown. The CNN model in Tehran showed the best accuracy across all parameters, with the lowest testing RMSE (0.0526), MSE (0.0027), and MAE (0.0402). Additionally, a significant correlation between the predicted and actual values is indicated by its R2 value of 0.9671. With an R2 of 0.9664, the GRU model showed consistent performance across target features despite being significantly less accurate. Both LSTM and CNN-LSTM models demonstrated strong performance, with LSTM slightly surpassing CNN-LSTM in RMSE and MAE metrics. Their results were not as strong as those of the CNN models used alone, though, indicating that simpler models may be enough for predicting GHI in this region.
The LSTM model performed better for Zahedan than the other models, with an R2 of 0.9595 and an RMSE of 0.0547 during testing. The CNN-LSTM and GRU models performed well, with CNN-LSTM yielding slightly better results than GRU, where CNN-LSTM achieved an COV of 10.39% compared to 10.46% for GRU. Interestingly, while these models displayed slightly higher error metrics, their R2 values remained above 95%, indicating strong predictive capability. In contrast to Tehran, the CNN model performed the least effectives. Overall, Tehran yielded slightly better prediction accuracies across all models compared to Zahedan, possibly due to differing environmental conditions.
Temperature prediction
The performance metrics for temperature prediction across models and locations are summarized. Unlike solar radiation predictions, GRU showed the best performance in Tehran, obtaining an RMSE of 0.0458, an MSE of 0.0021, and an R2 of 0.9721 during testing. The CNN model, though marginally less precise, exhibited consistent performance with an R2 score of 0.9719 throughout the target feature. The LSTM and CNN-LSTM models demonstrated competitive results, with LSTM marginally surpassing CNN-LSTM in terms of RMSE (0.0474) and MAE (0.0371). The hybrid CNN-LSTM models offered competitive performance but did not consistently outperform standalone CNN and LSTM models. For instance, CNN-LSTM achieved an R2 of 0.9690 in Tehran, indicating strong predictive capabilities but falling slightly short of LSTM.
In Zahedan, the LSTM model again exhibited strong performance, similar to its results in GHI prediction, with an RMSE of 0.0554 and an R2 of 0.9589. Also, GRU and CNN-LSTM provided comparable performance, GRU had an MSE of 0.0031 and an R2 of 0.9588, closely rivaling CNN-LSTM. However, its MSE was slightly higher than that of CNN-LSTM, which achieved a value of 0.0034. As with GHI prediction, Tehran exhibited slightly better predictive accuracy across all models for temperature, which could be attributed to the quality of input data or environmental factors.
According to Table 4, the CNN algorithm demonstrates the maximum performance for Tehran, while the LSTM model succeeds for Zahedan in GHI prediction, the GRU algorithm demonstrates the superior Tehran performance. The LSTM algorithm demonstrates the best performance for forecasting in Zahedan over a 10-year horizon in prediction in temperature prediction. Hybrid CNN-LSTM models demonstrated strong performance but did not always surpass standalone other models, indicating that hybridization may not necessarily improve prediction accuracy in all cases.
In multi-step direct forecasting, as illustrated in Fig. 9, each observation is predicted many times, and the forecasts for each observation are averaged to supply a visual comparison of method performance on the dataset. Figure 11 illustrates the CNN and GRU performance algorithms in predicting GHI and T2M in Tehran using training data to forecast the next ten years. Figure 12 illustrates forecasting performance of the LSTM model applied to GHI data and T2M in Zahedan employing training data to predict trends over the next ten years, which confirms a reasonable alignment of predicted values with actual data.
Fitting a regression line between observed and predicted values offers an alternative method for evaluating model performance. A strong correlation between the expected and actual data is seen by the distribution of points close to the line y = x. There is a strong correlation between the actual and predicted values of T2M and GHI (see Supplementary Fig. S2 and S3 online).
Further validating the suggested CNN model ability to forecast GHI and the GRU model capacity to predict T2M in Tehran during the subsequent decade to satisfactory accuracy and effectiveness, that was the objective of this research is presented (see Supplementary Fig. S4 online). Also, the suggested LSTM model capacity to forecast GHI and T2M in Zahedan for the last ten years is illustrated (see Supplementary Fig. S5 online).
This kind of prediction is highly beneficial for anyone aiming to construct and evaluate the environmentally friendly and economic feasibility of solar distiller in a particular region. Equations (1) to (3) indicate that two factors, GHI and T2M, are required to forecast the solar still capacity to produce fresh water. Consequently, considering into account the GHI and T2M obtained from the CNN and GRU models for Tehran, along with the LSTM model for Zahedan, the freshwater output of this solar still is computed. Figure 13 and 14 illustrate the yearly freshwater output for the proposed solar still located in Tehran and Zahedan from 1984 to 2033. Also, a comparison of the water productivity in Tehran and Zahedan for the next 10 years is shown (see Supplementary Fig. S6 online). It is important to remember that certain elements, like freezing temperatures, particulate matter, and foggy climate, have been excluded from the numerical estimates of water productivity.
Discussion
The findings of this study indicate that deep learning models, are effective in long-term forecasting of solar radiation and temperature for the selected regions of Tehran and Zahedan. The comparison of model performance across multiple evaluation metrics provides key insights into the suitability of these methods for renewable energy applications.
In Tehran, the CNN and GRU models consistently outperformed GRU, LSTM, and hybrid CNN-LSTM models across for solar radiation and temperature prediction, these models achieved the highest accuracy, as indicated because of its low RMSE, MSE, and MAE values, along with its high R2. This performance advantage is a result of their capability to efficiently capture spatial features within the data, which are particularly important for modeling environmental parameters such as solar radiation and temperature. In Zahedan, LSTM demonstrated better performance than other models, albeit with slightly higher error metrics compared to other models in Tehran. The observed performance difference between the two locations may be attributed to variations in climatic conditions, geographical factors, or differences in the quality and distribution of the input data. This finding highlights the importance of regional calibration when applying forecasting models to diverse geographical settings.
While hybrid CNN-LSTM models provided competitive results, they did not consistently outperform the standalone models. For instance, CNN-LSTM achieved high accuracy in Tehran but was slightly inferior to CNN in terms of RMSE and R2 values. Similarly, CNN-LSTM provided strong predictions in Zahedan but did not surpass the performance of LSTM. This suggests that the added complexity of combining convolutional and recurrent layers may not always yield significant improvements, particularly when the spatial features dominate the data, as in the case of solar radiation and temperature. The evaluation metrics used in this investigation, performed an extensive analysis of model performance. The consistently high R2 values across all models in both locations, exceeding 97% in most cases, indicate that the models effectively captured the variability in solar radiation and temperature data.
The comparative analysis between regions, Tehran consistently exhibited better prediction accuracy than Zahedan across all models and metrics. This trend could be explained by several factors, including a more stable and homogeneous climate in Tehran, which may make it easier for models to learn underlying patterns. Conversely, more variable climatic conditions of Zahedan might introduce additional noise into the data, making predictions more challenging. Additionally, differences in solar irradiance and temperature profiles between the two regions might affect the ability of model to generalize.
The findings of this research provide practical insights for the placement and operation of solar stills in Tehran and Zahedan. The ability to accurately forecast solar radiation and temperature is crucial for improving design efficiency and placement of solar stills to maximize their efficiency. The superior performance of selected models suggests that they can be reliably used for long-term forecasting, aiding decision-making processes for renewable energy projects. Furthermore, the model ability to handle regional variations underscores their potential applicability to other locations with different environmental conditions.
Conclusion
The purpose of this research was to forecast the freshwater output of the pyramid solar still. This research evaluates four models (LSTM, CNN, CNN-LSTM, and GRU) to predict multistep GHI and T2M for two locations in Iran, Tehran and Zahedan, over the next ten years through numerical simulations utilizing real GHI and T2M histories. The direct technique was employed in this work to forecast multistep forward data due to its lack of error propagation problems found in the recursive method. Five evaluation metrics were used to compare the outcomes, RMSE, MSE, MAE, COV, and \({R}^{2}\). The results validated the better performance of the CNN and GRU models for GHI and T2M prediction in Tehran, while the LSTM model succeeded in Zahedan, compared with other models in properly aligning with the real test value using two specified inputs that highlights the importance of location-specific calibration. Hybrid CNN-LSTM models offered competitive results, though they did not consistently outperform standalone CNN models, suggesting that added model complexity may not always yield significant gains. Recurrent models, such as LSTM, performed adequately but were generally less accurate than CNN models, likely due to the spatially dominance of the input data. The aim of the GHI and T2M forecast was to perform a predictability analysis of freshwater yield for the next ten year to inform planning analysis. The difference between the actual and forecasted information proved the model optimal consistency in following the trend and forecasting values.
The study also highlighted regional differences, with Tehran exhibiting slightly better prediction accuracy than Zahedan. The importance of region-specific data analysis is highlighted by these findings and model fine-tuning to account for variations in environmental conditions. This study shows how deep learning models can be used, particularly CNNs, as reliable tools for long-term forecasting of solar still productivity. Accurate predictions of solar radiation and temperature can support the optimal placement and design of solar stills, ultimately enhancing the efficiency of renewable energy systems. Future studies should emphasis on growing the dataset to contain more variety of locations and exploring advanced hybrid or ensemble approaches to further improve predictive accuracy and generalizability. The major conclusions are presented as follows:
-
The predicted monthly variation of GHI and T2M over the next years in Tehran is between 97.8 and 330.8 W/m2, while the temperature is expected to range from 2 to 31.6 °C. The monthly variation range for GHI and T2M during the next ten years in Zahedan is 167.1–354.6 W/m2, whereas the temperature range is between 6.4 and 43.6 ◦C.
-
Between 2024 and 2033, the predicted freshwater yield of the solar still is estimated to reach 2630 L in Tehran and 2710 L in Zahedan.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Abbreviations
- ANN:
-
Artificial neural network
- CNN:
-
Convolutional neural network
- COV:
-
Coefficient of variance
- GHI:
-
Global horizontal irradiance \({\text{(W}}/{\text{m}}^{2} )\)
- GRU:
-
Gated recurrent unit
- LSTM:
-
Long short-term memory
- MAE:
-
Mean absolute error
- MAPE:
-
Mean absolute percentage error
- ML:
-
Machine learning
- MSE:
-
Mean square error
- \({\text{R}}^{2}\) :
-
Coefficient of determination
- RMSE:
-
Root mean square error
- RNN:
-
Recurrent neural network
- RSM:
-
Response surface methodology
- T2M:
-
Temperature at two meters (°C)
- \(\text{I}(\text{t})\) :
-
The average daily solar radiation (W/m2)
- \({\text{T}}_{\text{w}}\) :
-
Water temperature (\(^\circ \text{C})\)
- \({h}_{fg}\) :
-
Latent heat of water vaporization
- \({\dot{m}}_{w}\) :
-
Hourly productivity from solar still
- \({\eta }_{th}\) :
-
Thermal efficiency
References
Sahu, R. & Tiwari, A. C. Performance enhancement of single slope solar still using nanofluids at different water depth. Desalin. Water Treat. 317, 100046 (2024).
Baiee, A. A. A., Asiaei, S. & Hosseinalipour, S. M. Experimental study on improving performance and productivity of pyramid solar still using rotation technique. Front. Heat Mass Transf. 22(3), 955–976 (2024).
Fallah, M. et al. Comprehensive review of methodologies and case studies in advanced exergy, exergo-economic, and exergo-environmental analyses. Energy 137606 (2025).
Kemerchou, I. et al. Enhancing solar still efficiency in southeastern Algeria: An experimental case with palm stems. Desalin. Water Treat. 317, 100148 (2024).
Katekar, V. P. & Deshmukh, S. S. A review on research trends in solar still designs for domestic and industrial applications. J. Clean. Prod. 257, 120544 (2020).
Alawee, W. H. et al. Forecasting sustainable water production in convex tubular solar stills using gradient boosting analysis. Desalin. Water Treat. 318, 100344 (2024).
Zhao, Z. & Wang, C. Photothermal textiles achieve sustainable desalination. Device 3(6) (2025).
Dang, C. et al. System integration for solar-driven interfacial desalination. Device 2(5) (2024).
Mao, K., Zhang, Y. & Tan, S. C. Functionalizing solar-driven steam generation towards water and energy sustainability. Nat. Water 3(2), 144–156 (2025).
Zhang, Y. & Tan, S. C. Best practices for solar water production technologies. Nat. Sustain. 5(7), 554–556 (2022).
Zhang, Q. et al. High-flux and anti-salt solar desalination via scalable photothermal textiles with vertically confined water layers. Device 3(6) (2025).
Yu, Z. et al. Microplastic detection and remediation through efficient interfacial solar evaporation for immaculate water production. Nat. Commun. 15(1), 6081 (2024).
Liu, Q. et al. Solar-driven fast and selective extraction of lithium from seawater enabled by unidirectional photothermal conversion and confined crystallization with facile synthesis of nanoarray evaporator. Chem. Eng. J. 506, 159990 (2025).
Chen, X. et al. Understanding solar thermal gradient to improve solar evaporation performance for water collection. Nano Res. Energy 4, e9120152 (2025).
Barzigar, A., Hosseinalipour, S. & Mujumdar, A. S. AI for Enhanced Solar Dryer Performance: Integration of PV/T Panels, Solar Collectors, Energy Storage, Biomass, and Desalination Units 1–2 (Taylor & Francis, 2025).
Kumar, D., Himanshu, P. & Ahmad, Z. Performance analysis of single slope solar still. Int. J. Mech. Robot. Res. 3(3), 66–72 (2013).
Sharshir, S. W. et al. Factors affecting solar stills productivity and improvement techniques: A detailed review. Appl. Therm. Eng. 100, 267–284 (2016).
Chu, Y. et al. Intra-hour irradiance forecasting techniques for solar power integration: A review. Iscience 24(10) (2021).
Skrobek, D. et al. Prediction of sorption processes using the deep learning methods (long short-term memory). Energies 13(24), 6601 (2020).
Skrobek, D. et al. Implementation of deep learning methods in prediction of adsorption processes. Adv. Eng. Softw. 173, 103190 (2022).
Faisal, A. F. et al. Neural networks based multivariate time series forecasting of solar radiation using meteorological data of different cities of Bangladesh. Results Eng. 13, 100365 (2022).
Jebli, I. et al. Deep learning based models for solar energy prediction. Adv. Sci. Technol. Eng. Syst. J. 6(1), 349–355 (2021).
Wojtkiewicz, J. et al. Hour-ahead solar irradiance forecasting using multivariate gated recurrent units. Energies 12(21), 4055 (2019).
Wibawa, A. P. et al. Time-series analysis with smoothed Convolutional Neural Network. J. Big Data 9(1), 44 (2022).
Park, J. et al. Domain hybrid day-ahead solar radiation forecasting scheme. Remote Sens. 15(6), 1622 (2023).
Wang, K., Qi, X. & Liu, H. Photovoltaic power forecasting based LSTM-convolutional network. Energy 189, 116225 (2019).
Kumari, P. & Toshniwal, D. Long short term memory–convolutional neural network based deep hybrid approach for solar irradiance forecasting. Appl. Energy 295, 117061 (2021).
Uluocak, I. & Bilgili, M. Daily air temperature forecasting using LSTM-CNN and GRU-CNN models. Acta Geophys. 72(3), 2107–2126 (2024).
Elsheikh, A. H. et al. Utilization of LSTM neural network for water production forecasting of a stepped solar still with a corrugated absorber plate. Process Saf. Environ. Prot. 148, 273–282 (2021).
Farouk, W. et al. Mathematical modeling and optimization of pyramidal still performance using response surface method and Ag-nanoparticle enhanced phase change material. J. Energy Storage 75, 109569 (2024).
Murugan, D. K. et al. Machine learning approaches for real-time forecasting of solar still distillate output. Environ. Chall. 13, 100779 (2023).
Migaybil, H. H. & Gopaluni, B. A performance neural network model for conventional solar stills via transfer learning. Appl. Energy 375, 124118 (2024).
Zayed, M. E. et al. Predicting the yield of stepped corrugated solar distiller using kernel-based machine learning models. Appl. Therm. Eng. 213, 118759 (2022).
Santos, N. I. et al. Modeling solar still production using local weather data and artificial neural networks. Renew. Energy 40(1), 71–79 (2012).
Abujazar, M. S. S. et al. Productivity modelling of a developed inclined stepped solar still system based on actual performance and using a cascaded forward neural network model. J. Clean. Prod. 170, 147–159 (2018).
Essa, F., Abd Elaziz, M. & Elsheikh, A. H. An enhanced productivity prediction model of active solar still using artificial neural network and Harris Hawks optimizer. Appl. Therm. Eng. 170, 115020 (2020).
Qasem, S. N. & Alzanin, S.M. An effective forecasting approach of temperature enabling climate change analysis in Saudi Arabia. Int. J. Adv. Comput. Sci. Appl. 15(3) (2024).
Aswin, S., Geetha, P. & Vinayakumar, R. Deep learning models for the prediction of rainfall. In 2018 International Conference on Communication and Signal Processing (ICCSP) (IEEE, 2018).
Sabat, N. K. et al. A comparative analysis of univariate deep learning-based time-series models for temperature forecasting of the bhubaneshwar. In 2022 IEEE 2nd International Symposium on Sustainable Energy, Signal Processing and Cyber Security (iSSSC) (IEEE, 2022).
Respaty, W. A. et al. Weather prediction in Jakarta: An analysis of climate data and regional influences using LSTM and GRU. In 2023 International Conference on Data Science and Its Applications (ICoDSA) (IEEE, 2023).
Zafar, R. & Chung, I.-Y. Day-ahead solar irradiance forecasting using a hybrid weather-based attention BiLSTM approach for power system operation scheduling. In 2023 IEEE PES Innovative Smart Grid Technologies Europe (ISGT EUROPE) (IEEE, 2023).
Azizi, N. et al. Deep learning based long-term global solar irradiance and temperature forecasting using time series with multi-step multivariate output. Renew. Energy 206, 135–147 (2023).
Yaghoubirad, M. et al. Deep learning-based multistep ahead wind speed and power generation forecasting using direct method. Energy Convers. Manag. 281, 116760 (2023).
Zhao, X., Huang, L. & Nie. Y. Temperature prediction based on integrated deep learning and attention mechanism. In 2021 IEEE 6th International Conference on COMPUTER and Communication Systems (ICCCS) (IEEE, 2021).
Sharshir, S. et al. Thermal performance and exergy analysis of solar stills–A review. Renew. Sustain. Energy Rev. 73, 521–544 (2017).
Kabeel, A., Hamed, M. H. & Omara, Z. Augmentation of the basin type solar still using photovoltaic powered turbulence system. Desalin. Water Treat. 48(1–3), 182–190 (2012).
Cooper, P. The maximum efficiency of single-effect solar stills. Sol. Energy 15(3), 205–217 (1973).
Ghandourah, E. et al. Performance enhancement and economic analysis of pyramid solar still with corrugated absorber plate and conventional solar still: A case study. Case Stud. Therm. Eng. 35, 101966 (2022).
Wang, W. et al. Self-feedback LSTM regression model for real-time particle source apportionment. J. Environ. Sci. 114, 10–20 (2022).
Wang, D. et al. Forecasting power demand in China with a CNN-LSTM model including multimodal information. Energy 263, 126012 (2023).
Ning, F. et al. Enhanced spectrum convolutional neural architecture: An intelligent leak detection method for gas pipeline. Process Saf. Environ. Prot. 146, 726–735 (2021).
Li, Q. et al. Arctic short-term wind speed forecasting based on CNN-LSTM model with CEEMDAN. Energy 299, 131448 (2024).
Shi, H. et al. A CNN-LSTM based deep learning model with high accuracy and robustness for carbon price forecasting: A case of Shenzhen’s carbon market in China. J. Environ. Manag. 352, 120131 (2024).
Assis, M. V. et al. A GRU deep learning system against attacks in software defined networks. J. Netw. Comput. Appl. 177, 102942 (2021).
Saba, A. I. & Elsheikh, A. H. Forecasting the prevalence of COVID-19 outbreak in Egypt using nonlinear autoregressive artificial neural networks. Process Saf. Environ. Prot. 141, 1–8 (2020).
Wang, Z. et al. LSTM-convolutional-BLSTM encoder-decoder network for minimum mean-square error approach to speech enhancement. Appl. Acoust. 172, 107647 (2021).
Shahsavari, A., Yazdi, F. & Yazdi, H. Potential of solar energy in Iran for carbon dioxide mitigation. Int. J. Environ. Sci. Technol. 16, 507–524 (2019).
Dehghani, M. & Feylizadeh, M. An overview of solar energy potential in Iran. Int. J. Curr. Life Sci. 4(9), 7173–7180 (2014).
Solargis. Available from: https://solargis.com/.
NASA. Available from: https://power.larc.nasa.gov/data-access-viewer/.
Moustafa, E. B., Hammad, A. H. & Elsheikh, A. H. A new optimized artificial neural network model to predict thermal efficiency and water yield of tubular solar still. Case Stud. Therm. Eng. 30, 101750 (2022).
Chakraborty, D. et al. Computational solar energy–Ensemble learning methods for prediction of solar power generation based on meteorological parameters in Eastern India. Renew. Energy Focus 44, 277–294 (2023).
Ahmed, U. et al. Short-term global horizontal irradiance forecasting using weather classified categorical boosting. Appl. Soft Comput. 155, 111441 (2024).
Zhao, Y. & Cen, Y. Data Mining Applications with R (Academic Press, 2013).
Suradhaniwar, S. et al. Time series forecasting of univariate agrometeorological data: A comparative performance evaluation via one-step and multi-step ahead forecasting strategies. Sensors 21(7), 2430 (2021).
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Author information
Authors and Affiliations
Contributions
S.A. wrote the main manuscript, original draft, Project administration, Methodology, Conceptualization S.M.H. Writing—original draft, Investigation S.A. review & editing, Supervision.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Consent to participate
All participants provided informed consent to take part in the study.
Consent for publication
All participants consented to the publication of data and results arising from the research.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Allahyari, S., Hosseinalipour, S.M. & Asiaei, S. Deep learning approaches for predicting solar radiation and freshwater yield in modified pyramid solar still. Sci Rep 15, 41322 (2025). https://doi.org/10.1038/s41598-025-25094-1
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-25094-1
















