Introduction

Carbon dioxide (CO2) emissions are responsible for greenhouse gases, global warming, and undesirable climate change1. The growing human population, economic progress, industrialization, and rapid development of countries (especially India and China) make this emission dangerous. Acidic gases such as H2S and CO2 are produced beside light hydrocarbons in a large number of petroleum and gas production industries2. Scientists warned that the greenhouse gas levels might reach such a dangerous level that they can not be reversed at all3. Therefore, reducing CO2 emissions is one of the main concerns for continuing our life on the earth in the current century. Different solvents4 are applied for absorbing CO2 gas, such as amine solutions5, amino acid salt6, eutectic solvent7, brine8, and ionic liquids9. However, amine solutions have drawbacks, especially high evaporation loss, accumulation of water in the gas phase, and the release of corrosive side materials. Besides the solvent type, information about the phase behavior of solvents, like solubility parameters, can help researchers to improve the absorption procedure. Ionic liquids are a new group of environmentally friendly solvents. The disadvantages of the amine solutions are not observed in the ionic liquid, especially when generating the pollutant vapors during applications. Moreover, ionic liquids have the potential to dissolve many diverse materials, such as organic, inorganic, and organometallic compounds. The research studies conducted on CO2 capture by ionic liquids have experimental and modeling types. Researchers focused on experimental measurements of CO2 solubility in 1-butyl-3-methylimidazolium chloride10, 1-butyl-3-methylimidazolium acetate11, 1-butyl-3-methylimidazolium methyl sulfate11, 1-methyl-1-propylpyrrolidinium dicyanamide12, 1-ethyl-3-methylimidazolium acetate12, 1-octyl-3-methylimidazolium tetrafluoroborate13, and 1-N-butyl-3-methylimidazolium hexafluorophosphate14. Highlighting the difficulty, economic cost, and reproducibility of an experimental study, some researchers attempted to present a network for anticipating the CO2 mole fraction in ionic liquids. Indeed, machine learning approaches like ANN, Grey Wolf optimization, Whale optimization, and Gravity search optimization algorithms15, multivariate adaptive regression8, and adaptive network-based fuzzy inference systems (ANFIS)16 are appropriate tools for predicting gas solubility. ANN helps anticipate data. ANN is one of the crucial programs17,18 and a part of artificial intelligence that generates a trend of the human nervous system regarding the obtained values in the laboratory. Companies such as petroleum which generate huge amount of CO2 gases can employ proficient chemical engineers in the field of ANN for predicting the CO2 solubility in diverse temperatures, pressures, etc., with use of suggested formulations of ANN program without spending much investment in installing advanced equipment, hiring labors, purchasing materials, and consuming electricity or heat. Thus, ANN can intensely help the industries in terms of costs and economic aspects. Some conducted works in the application of ANN for the absorption of CO2 with ionic liquids have been listed in Table 1. The objective of this work is to model the CO2 solubility with the help of an ANN. For this purpose, two ANN models were utilized, including a multilayer perceptron (MLP) and a radial basis function (RBF). Temperature and pressure were considered as inputs of the ANN in this work. Additionally, the impacts of temperature and pressure on the CO2 mole fraction were studied. The limitations of the previous work can be referred to in this note, that they have not studied only one type of ionic liquid ([Bmim][PF6]) in the adsorption of CO2 gas at different temperatures and pressures. The main purpose of this work is to use artificial intelligence for detecting the best temperature and pressure at which [Bmim][PF6] ionic liquid performance becomes high in trapping CO2 molecules. The other main purpose is finding the most appropriate ANN structure for predicting the solubility of CO2 in [Bmim][PF6] at every temperature and pressure without spending cost and time in the laboratory.

Table 1 The conducted investigations in the absorption of CO2 using the ANN method.

Experimental databank

ANNs are among the data-driven methodologies. This means that they obtain their understanding of the corresponding process from experimental measurements. Therefore, collecting an experimental database is necessary to develop the ANN-based paradigm for predicting CO2 mole fraction in the [Bmim][PF6]. For this aim, a relatively huge database having 371 datasets is gathered from thirteen different experimental studies30,31,32,33,34,35,36,37,38,39,40,41,42. Table 2 presents the work’s independent and dependent variables, their ranges, and the number of collected datasets from different references. In summary, our databank covers temperatures of 283.15 to 546.15 K, pressures of 0.092 to 196.8 bar, and CO2 solubility of 0.001 to 0.721. To provide readers with visual observation, the histogram of temperature, pressure, and CO2 mole fraction in the [Bmim][PF6] is shown in Figs. 1S to 3S (supplementary section), respectively. It is essential to note that the histogram is a popular method to understand how many data points are available in any specific range of a considered domain. The x-axis divides the whole domain of a given variable into several segments, while its y-axis shows the number of available data points in each segment. Figs. 1S and 2S (supplementary section) show that significant parts of the collected experimental database were conducted at small temperatures and pressures. It is not hard to observe that more than 310 datasets cover temperatures < 350 K, and only 51 data points are available for the higher temperatures. Moreover, large numbers of our collected datasets (~ 220 data points) cover pressures smaller than 20 bar. On the other hand, despite the pressure and temperature data points, the collected CO2 solubility measurements have a better distribution. They are almost uniformly distributed over the available domain.

Table 2 Summary of the gathered database for CO2 solubility in [Bmim][PF6] ionic liquid.

Theory and methodology

The fundamentals of ANNs as intelligent mathematical models are tried to be explained in this section. Moreover, the structure of multi-layer perceptron (MLP) and radial basis function (RBF) neural networks is also concisely reviewed. The main differences between these two types of ANN methodology are related to their activation function and training procedure.

ANN theory

ANNs are the learnable mathematical paradigms constructed by simulating the biological neuron operation43. Equations (1) and (2) explain all computational operations conducted by the neurons.

$${\text{net}}\, = \,\sum\limits_{{{\text{i}} = 1}}^{{\text{N}}} {{\text{w}}_{{{\text{mi}}}} } \,{\text{x}}_{{\text{i}}} \, + \,{\text{b}}_{{\text{m}}}$$
(1)
$${\text{out}}_{{\text{m}}} \, = \,{\text{f (net)}}$$
(2)

These equations express that the entry signal (xi) is multiplied by the adjustable parameters, namely, weights (wmi), and then added to another tunable parameter, i.e., bias (bm). This linear mathematical operation should pass through an activation function (f) to generate the output signal (outm). The rationale for model selection is based on the statistical analysis, especially relative deviation and correlation coefficient values, which display the closeness of experimental data to the given model.

MLP network

MLP, which is categorized as a feed-forward neural network. It should be mentioned that the numbers of hidden neurons are often determined by a trial-and-error procedure..

The trend of the calculation process in this model is written in Eq. 3. The learning algorithm formula in the MLP model is written in Eq. (4). η is the learning rate, a small positive value that controls the adjustment magnitude. ytrue is the actual value. ypredict is the predicted value. Also, wi+1 and wi refer to the new and old values of the weights, respectively.

$${\text{Y}} = {\text{f}}_{{2}} \left( {\sum\limits_{{{\text{i}} = {1}}}^{{\text{n}}} {{\text{w}}_{{\text{j}}} \times {\text{f}}_{{1}} \left( {\sum {{\text{h}}_{{{\text{ij}}}} {\text{x}}_{{\text{i}}} + {\text{b}}_{{\text{j}}} } } \right) + {\text{b}}_{{\text{o}}} } } \right)$$
(3)
$${\text{w}}_{{\text{i + 1}}} = {\text{w}}_{{\text{i}}} + \eta \times \left( {{\text{y}}_{{{\text{true}}}} - {\text{y}}_{{{\text{predict}}}} } \right) \, \times {\text{x}}_{{\text{i}}}$$
(4)

The symbols of ‘hij’, ‘bi’, ‘f1’ are the weights vector, biases, and activation function in the hidden layer, and ‘wj’, ‘bo’, ‘f2’ are the weights vector, biases, and activation function in the output layer.

RBF network

This ANN type has Gaussian and linear functions in the hidden and output portions, respectively. Equations (5) and (6) show the mathematical formula of Gaussian and linear activation functions, respectively44.

$${\text{f (net) }} = \, \,{\text{e}}^{{\frac{{ - {\text{net}}^{2} }}{{2\sigma^{2} }}}}$$
(5)
$${\text{f (net) }} = \, \,{\text{net}}$$
(6)

where ’’ refers to the spread of the Gaussian relation.

Statistical analysis

Several statistical criteria are employed to monitor the performance of the developed RBF and MLP. Equations (9) to (12) show the mathematical expressions of different kinds of errors, and the correlation coefficient, respectively45,46.

$${\text{AARD\% }} = \frac{100}{{\text{N}}}\sum\limits_{{\text{n}}}^{{\text{N}}} {\frac{{\left| {{\text{S}}_{{\text{n}}}^{{{\text{act}}}} - {\text{ S}}_{{\text{n}}}^{{{\text{pred}}}} } \right|}}{{{\text{S}}_{{\text{n}}}^{{{\text{act}}}} }}}$$
(7)
$${\text{MSE}} = \frac{1}{{\text{N}}}\sum\limits_{{{\text{n}} = 1}}^{{\text{N}}} {\left( {{\text{S}}_{{\text{n}}}^{{{\text{act}}}} - {\text{ S}}_{{\text{n}}}^{{{\text{pred}}}} } \right)^{2} }$$
(8)
$${\text{RMSE}} = \left( {\frac{1}{{\text{N}}}\sum\limits_{{{\text{n}} = 1}}^{{\text{N}}} {\left( {{\text{S}}_{{\text{n}}}^{{{\text{act}}}} - {\text{ S}}_{{\text{n}}}^{{{\text{pred}}}} } \right)}^{2} } \right)^{0.5}$$
(9)
$${\text{R}}^{2} = 1 - \frac{{\sum\limits_{{{\text{n}} = 1}}^{{\text{N}}} {\left( {{\text{S}}_{{\text{n}}}^{{{\text{act}}}} - {\text{ S}}_{{\text{n}}}^{{{\text{pred}}}} } \right)^{2} } }}{{\sum\limits_{{{\text{n}} = 1}}^{{\text{N}}} {\left( {{\text{S}}_{{\text{n}}}^{{{\text{act}}}} - \, \overline{{{\text{S}}^{{{\text{pred}}}} }} } \right)^{2} } }}$$
(10)

wherein ‘S’ shows CO2 mole fraction in [Bmim][PF6], and ‘N’ refers to the number of available data.

Results and discussion

This study aims to detect the suitable topology of the MLP and RBF networks, compare their predictive performances, and find the most accurate model to estimate CO2 solubility in [Bmim][PF6] ionic liquid. Therefore, the next three sections are devoted to RBF model design, MLP development, and a comparison of their results to find the most accurate model. The more precise model is validated using the collected experimental database. Finally, the proposed model is applied to probe the impact of temperature and pressure on the CO2 mole fraction in [Bmim][PF6].

RBF model developing

During any RBF neural network development, it is essential to find the number of hidden neurons and the spread value (for the Gaussian transfer function of the hidden layer). The minimum number of hidden neurons equals one, while its maximum number is often determined using a rule of thumb47. The number of training datasets should be 2–10 times the number of adjustable parameters, like weights and biases of the ANN model48,49,50,51,52,53,54. In this work, the middle value of this range (i.e., six) is utilized to detect the maximum quantity value of hidden neurons. Our experimental datasets (371 data points) are accidentally classified into training and testing subdivisions (75%:25%). Indeed, we have 278 training and 93 testing datasets. The RBF neural network with two independent variables, K hidden neurons, and one dependent variable poses 4 K + 1 weights and biases. Therefore, available training datasets should be higher than 6 × (4 K + 1). It can be simply concluded that the maximum number of hidden neurons is eleven. Therefore, various RBF neural networks differing in the number of hidden neurons and spread value are designed. For the RBF model with one to eleven hidden neurons, the spread values of 0.0001 to 10 are used. Indeed, 1100 RBF neural networks are developed during this analysis. Figure 1 presents the most accurate results obtained by the RBF model with various numbers of hidden neurons.

Fig. 1
Fig. 1The alternative text for this image may have been generated using AI.
Full size image

The best-obtained results by the RBF neural network with different numbers of hidden neurons.

The RBF neural network’s principle states that this model automatically adds one hidden neuron for each training dataset. Therefore, it is necessary to develop RBF neural networks with more than eleven hidden neurons and monitor their predictive performances. Several new RBF neural networks (with 12 to 278 hidden neurons) with different spread values are developed in this stage, and their accuracy during the training and testing stages is measured. The most accurate results obtained by different RBF structures are summarized in Table S1. It can be observed that the RBF model with the 2-42-1 architecture and spread = 0.1353 presented the most suitable accuracy for predicting CO2 mole fraction in the [Bmim][PF6]. It is worth noting that constructing the RBF model with more than 42 hidden neurons results in network overfitting.

To provide the readers with a visual observation of the overfitting concept, the accuracy of the RBF network with a 2-278-1 structure is presented. Figure 1 illustrates the performance of this RBF neural network for training, testing, and overall datasets. It can be seen that the RBF model with the small spread value shows excellent accuracy for the training datasets, but its performance for the testing subdivision is entirely unacceptable. It can be concluded that using the small spread value results in overfitting the RBF neural networks with 278 hidden neurons. In summary, the RBF neural networks with 278 hidden neurons and spread values higher than one show relatively similar performances (AARD ≈ 0.6%). Comparing the reported results in Figs. 1 and 2 confirmed that increasing the number of hidden neurons of the RBF neural networks produces no significant improvement in accuracy. Indeed, the RBF model with ten hidden neurons presents better results than the RBF model with 278 hidden neurons.

Fig. 2
Fig. 2The alternative text for this image may have been generated using AI.
Full size image

Presented relative deviation by the RBF models with 278 hidden neurons and different spread values.

MLP model development

The number of hidden neurons, transfer functions of hidden and output layers, and training algorithm need to be determined for the MLP model. Like the RBF, the maximum value of hidden neurons for the MLP is eleven. Four different combinations of the tangent and logarithm sigmoid have been utilized as transfer functions in the MLP neural networks in this work. Moreover, nine different optimization algorithms are employed to train the MLP models. More specifically, we developed 7920 MLP neural networks (11 × 4 × 9 × 20) (numbers of hidden neurons × four combinations of transfer function × nine training algorithms × twenty tries per structure) in this study. Table S2 summarizes the most accurate results obtained by different structures of the MLP neural networks.

The reported results in Table S2 confirmed that a network with eleven hidden neurons, tangent and logarithm sigmoid transfer functions in hidden and output layers, trained by the Levenberg–Marquardt algorithm, is the best MLP model among the 7920 designed paradigms.

Suitable ANN model selection

Figure 3 schematically explains a procedure used in this study to find the most suitable model. Concisely, a comparison between the best RBF and MLP models’ accuracy shows that the latter is the most suitable approach for anticipating the CO2 mole fraction in the [Bmim][PF6] ionic liquid.

Fig. 3
Fig. 3The alternative text for this image may have been generated using AI.
Full size image

The employed procedure to find the most accurate paradigm for predicting CO2 mole fraction in the [Bmim][PF6].

Here, the AARD index is only used to monitor the accuracy of the developed smart models. There exist some other statistical indices that are often employed to measure the accuracy of a model. Table 3 reports the accuracy of the best-designed MLP and RBF neural networks in terms of performance parameters for training, testing, and the whole databank.

Table 3 Measurement of the accuracy of the designed best MLP and RBF models using four well-known statistical criteria.

The structure of the developed MLP neural network is presented in Fig. 4. As mentioned earlier, temperature and pressure are independent variables, and CO2 solubility in [Bmim][PF6] ionic liquid is considered a dependent variable. Moreover, this model has the structure of 2-11-1, tangent sigmoid, and logarithmic sigmoid transfer functions in hidden and output layers, respectively. Furthermore, the Levenberg–Marquardt optimization technique is the best training algorithm for adjusting the MLP model hyperparameters. Figure 5 shows that the training algorithm continuously decreases the observed error up to 600 iterations, and the training stage is finished.

Fig. 4
Fig. 4The alternative text for this image may have been generated using AI.
Full size image

(a) The best MLP structure for prediction of the CO2 mole fraction in [Bmim][PF6], (b) The histogram diagram of the experimental data.

Fig. 5
Fig. 5The alternative text for this image may have been generated using AI.
Full size image

Variation of the mean square error of the proposed MLP model during training by the Levenberg–Marquardt algorithm.

Values of weights and biases of the trained MLP model by the Levenberg–Marquardt are presented in Table 4. This type of information is required to use our developed MLP model by others. The reported weights and biases in this table, along with the activation functions, are the only information that needs to be used in our MLP neural network for estimating CO2 mole fraction.

Table 4 Weights and biases of the proposed MLP model.

Proposed MLP validation

The focus of this section is devoted to the validation of the most accurate MLP model using experimental data. For this purpose, a graphical presentation based on the cross-plot is used. Cross-plot is a well-known method to check the performance of a model. It presents the predicted values as a function of their corresponding actual data points. The diagonal line of the cross-plot shows the exact adaptation of the predictions to their corresponding actual values. Figure 6 illustrates the cross-plot of predicted CO2 solubility versus the actual information for the training stage. It can be seen that major parts of the red circle symbols are almost concentrated around the diagonal line.

Fig. 6
Fig. 6The alternative text for this image may have been generated using AI.
Full size image

Cross-plot of predicted versus experimental CO2 solubility during the training stage.

The Leverage analysis of the MLP model was conducted, and its related regression plots are shown in Fig. 7 for the purpose of validating the MLP model with the experimental results. The Leverage approach was used for the MLP model because this model has better performance than the RBF model. According to Fig. 7, the regression coefficient in the validation of experimental results with the predicted data is 0.96269, indicating a good adaptation between the MLP model and the experimental results.

Fig. 7
Fig. 7The alternative text for this image may have been generated using AI.
Full size image

The Leverage analysis for studying the validation process in the MLP model.

The cross-plot for the whole of the datasets (training + testing) is presented in Fig. 8. Figures 6 and 9 also confirmed the moderate behavior of the MLP network for anticipating the experimental database of CO2 solubility in the [Bmim][PF6] ionic liquid.

Fig. 8
Fig. 8The alternative text for this image may have been generated using AI.
Full size image

Performance of the MLP network for the prediction of the 371 experimental data for the CO2 mole fraction.

Fig. 9
Fig. 9The alternative text for this image may have been generated using AI.
Full size image

The MLP predictions versus their associated experimental values for the testing subdivision.

It was mentioned previously that so collected the experimental database was collected from thirteen different research studies. We think it is good to monitor the performance of the designed MLP neural network to predict the reported datasets in the various manuscripts. Figure 10 presents the efficiency of the developed MLP paradigm in terms of deviation percentage for predicting experimental datasets reported in different references. It is clear that the MLP neural network explains the worst results in predicting the experimental data reported by Afzal et al.38. The next highest deviation percentage are devoted to the reported datasets by Yokozeki et al.33 and Shiflett and Yokozeki42, respectively. Our developed MLP approach predicted the experimental measurements in these references with the deviation of 53% and 34%, respectively. Experimental datasets about CO2 solubility in [Bmim][PF6] in all other references are predicted with an deviation of less than 28%. On the other hand, the MLP neural network predicted the experimental measurements by Kodama et al.39 with the highest accuracy, i.e., deviation percentage of 8.27%. It is noteworthy that the value of deviation percentage in this current work has been calculated at 0.02456% which is lower than the other works mentioned above. It is noteworthy that the value of deviation percentage in this current work has been calculated at 0.02456% which is lower than the other works mentioned above, and proves that the ANN structure for this work is better than the other works mentioned above.

Fig. 10
Fig. 10The alternative text for this image may have been generated using AI.
Full size image

Performance of the MLP neural network for estimating experimental datasets reported by different researchers.

Effect of temperature and pressure

The effect of isothermal change in pressure on CO2 solubility in 1-N-butyl-3-methylimidazolium hexafluorophosphate ionic liquid is displayed in Fig. 7. Figure 11 presents experimental data as well as their corresponding predictions by the MLP model. It is obvious that the MLP paradigm correctly predicts the trend of CO2 solubility in [Bmim][PF6] versus isothermal change in pressure. It is worth noting that the MLP model provides the AARD of 0.0859%, 0.00874%, and 0.03088% for temperatures of 283.15, 298.15, and 323.15 K, respectively. These observed deviations confirmed that the MLP neural network presents higher inaccuracy for predicting CO2 solubility at higher temperatures. This may be associated with the smaller number of available experimental datasets for higher temperatures. Moreover, both experimental data and MLP results proved that the CO2 mole fraction in [Bmim][PF6] increases by isothermal enhancing of the operating pressure. This observation is in agreement with the fact that the CO2 mole fraction in liquids increases with increasing pressure.

Fig. 11
Fig. 11The alternative text for this image may have been generated using AI.
Full size image

Difference of CO2 mole fraction in the [Bmim][PF6] with isothermal enhancing of the pressure.

Variation of CO2 solubility in the given ionic liquid as a function of isobar change of temperature from the experimental and predicted values is presented in Fig. 12. This figure shows a relatively good agreement between actual data points and MLP predictions. Our MLP model correctly detects the profile of CO2 solubility versus isobaric change of temperature. On the other hand, experimental and modeling results confirmed that increasing temperature decreases the CO2 mole fraction in the [Bmim][PF6]. The results are also in agreement with the scientific fact that gas solubility in liquids decreases with increasing temperature.

Fig. 12
Fig. 12The alternative text for this image may have been generated using AI.
Full size image

The effect of the isobaric increase in the temperature on the CO2 solubility in the [Bmim][PF6] ionic liquid.

For describing the CO2 solubility mechanism in the ionic liquid, equations that relate Henry’s constant (HCO2) to the pressure and temperature can be helpful. Carroll et al.55 explain the dependency of CO2 solubility on the Henry’s constant on temperature and pressure. Equation (11) is the dependency of CO2 solubility on the Henry’s constant at different pressures. Equation (12) is the dependency of CO2 solubility on the Henry’s constant at different temperatures. According to Eq. (11), pressure increase leads to an increase in the CO2 solubility. Indeed, Pressure increase can increase the Henry’s constant. Besides, pressure increase causes a change in Henry’s constant, and finally a change in the CO2 solubility. Equation (12) also proved that the temperature increase leads to an elevation of Henry’s constant, which means a positive influence on the CO2 solubility in the ionic liquid. This conclusion was also observed in the work of Carroll et al. In our work, the ANN predicts that the temperature (Fig. 12) and pressure (Fig. 11) increase lead to CO2 solubility elevation in the ionic liquid, which is owing to the increase the Henry’s constant.

$${\text{P}}_{{{\text{CO}}_{{2}} }} {\text{ = H}}_{{{\text{CO}}_{{2}} }} \cdot {\text{x}}_{{{\text{CO}}_{{2}} }}$$
(11)
$$\ln {\text{ H}}_{{{\text{CO}}_{{2}} }} = \, - 6.8346 + \, \frac{{1.2817 \times 10^{4} }}{{\text{T}}} - \frac{{3.7668 \times 10^{6} }}{{{\text{T}}^{2} }} \, + \, \frac{{2.997 \times 10^{8} }}{{{\text{T}}^{3} }}$$
(12)

The van’t Hoff formula is written in Eq. (13). The van’t Hoff formula relates the thermodynamic equilibrium constant (K) to the temperature. The values of enthalpy change (ΔH) and entropy change (ΔS) can be calculated with this formula. The graph of ln(K) vs T−1 was plotted in Fig. 13. The slope and intercept of Fig. 13 are \(\frac{{ - \Delta {\text{H}}^{{\text{o}}} }}{{\text{R}}}\) and \(\frac{{ - \Delta {\text{S}}^{{\text{o}}} }}{{\text{R}}}\), respectively. The values of enthalpy change and entropy change were determined at 22.802 kJ · mol−1 and 0.0703 kJ · mol−1 · K−1, respectively. It was concluded that the absorption of CO2 molecules with [Bmim][PF6] is endothermic with an irregular pattern, because of the positive sign of the enthalpy change and also the entropy change, respectively.

$${\text{ln}}\left( {\text{K}} \right){ = }\left( { - \frac{{{\Delta H}^{{\text{o}}} }}{{\text{R}}}} \right) \, \cdot \, \frac{{1}}{{\text{T}}}{ + }\frac{{{\Delta S}^{{\text{o}}} }}{{\text{R}}}$$
(13)
Fig. 13
Fig. 13The alternative text for this image may have been generated using AI.
Full size image

The plot of ln(K) vs T−1.

Simultaneous parameter effect

In this section, the effect of simultaneous changes in temperature and pressure on CO2 solubility in 1-N-butyl-3-methylimidazolium hexafluorophosphate ionic liquid is simulated using the proposed MLP model (Fig. 14). It is observed from Fig. 14 that the CO2 mole fraction in the [Bmim][PF6] increases by enhancing pressure and decreasing temperature. Actually, the maximum CO2 mole fraction in the given ionic liquid is achieved at the highest pressure and lowest temperature (i.e., pressure of 200 bar and temperature of 283 K). The fluctuations that appeared in this figure may be associated with the uncertainty in the MLP predictions, which was about 0.025% in terms of relative deviation.

Fig. 14
Fig. 14The alternative text for this image may have been generated using AI.
Full size image

Three-dimensional simulation graph for the effect of pressure and temperature on CO2 mole fraction in [Bmim][PF6] for (a) MLP model (train function: trainlm, transfer function for first layer: tansig, transfer function for second layer: logsig), (b) MLP model (train function: traincgb, transfer function for first layer: tansig, transfer function for second layer: tansig).

Future recommendations

Some recommendations about the ANN method in the field of CO2 capture can be useful for scientists working in the future. In our opinion, scientists can consider ionic liquids, eutectic solvents, and organic solvents as inputs. Additionally, other variables, such as activity coefficient, fugacity coefficient, acentric factor, and additives, can be defined as inputs. Other separation tools, like adsorbents, can be added to the other inputs, like ionic liquids, for detecting a better option for CO2 capture. Moreover, other outputs, such as absorption efficiency, can also be studied in addition to CO2 solubility. The other recommendation is entering data with the use of Aspen Plus and molecular dynamics (MD) because of a lack of enough experimental data. It is recommended to apply other ANN methods, like SVM, to study the CO2 solubility in [Bmim][PF6] ionic liquid. One of the challenges in the ANN method is entering a high amount of data as inputs, which can be solved by experts in the fields of computer engineering and chemical engineering.

Conclusions

CO2 solubility in [Bmim][PF6] is modeled using MLP and RBF. The MLP model predicted 371 experimental datasets with the least error and the highest adaptation to the experimental data. The structure of the suggested MLP model has eleven hidden neurons, tangent and logarithm sigmoid in hidden and output layers, and trained by the Levenberg–Marquardt algorithm. Also, the results confirmed that the MLP neural network presents higher inaccuracy for predicting CO2 solubility at higher temperatures. Sensitivity analysis on the RBF neural network structure showed that increasing the number of hidden neurons to more than 10 has no positive effect on its accuracy. It is recommended that the other neural networks be used in future work. Also, it is interesting to define more than one ionic liquid as input in an ANN to determine the better ionic liquid for CO2 elimination.