Machine learning for quantitative LIBS analysis of aluminum alloys: a comparison of random forest, gradient boosting, and extremely randomized trees

Capella, Aline Gonçalves; López, Marta Martín; de Damborenea González, Juan José; Arenas, María Ángeles; Diego, Ignacio Garcia

doi:10.1038/s41598-026-46449-2

Download PDF

Article
Open access
Published: 12 May 2026

Machine learning for quantitative LIBS analysis of aluminum alloys: a comparison of random forest, gradient boosting, and extremely randomized trees

Aline Gonçalves Capella^1,2,
Marta Martín López²,
Juan José de Damborenea González²,
María Ángeles Arenas² &
…
Ignacio Garcia Diego²

Scientific Reports volume 16, Article number: 14758 (2026) Cite this article

Subjects

Abstract

Laser-induced breakdown spectroscopy (LIBS) is a versatile technique for characterizing materials and analyzing elements, but it is limited in its quantitative accuracy due to nonlinear effects. This study uses machine learning (ML) regression algorithms to improve the quantification of commercially pure aluminum and its alloys with multiple alloying elements. A dataset of LIBS spectra from standard samples was used to train three tree-based ML models: Random Forest (RF), Gradient Boosting (GB), and Extremely Randomized Trees (ET). These models were evaluated using standard regression metrics. Prediction errors were consistently low for both in-sample (non-independent) validation and independent validation samples (MAE < 0.25 wt%), confirming strong generalization. The ET model produced the lowest errors, particularly for binary and ternary alloys. The GB model generated the most statistically indistinguishable results compared to the reference values, primarily due to higher prediction variance. The RF model showed intermediate performance. Reliable predictions required at least three representative samples per alloy family, which emphasizes the importance of dataset diversity in LIBS-based machine learning workflows. Therefore, integrating LIBS with ML is an effective strategy for predicting elemental compositions in metallic alloys. Future work will extend this approach to other aluminum alloys and support the development of scalable frameworks for quantitative analysis.

Machine learning-driven nonlinear analysis of inclusion effects in aluminium alloys

Article Open access 14 October 2025

Accelerating the design and discovery of tribocorrosion-resistant metals by interfacing multiphysics modeling with machine learning and genetic algorithms

Article Open access 22 January 2025

High entropy alloy property predictions using a transformer-based language model

Article Open access 07 April 2025

Introduction

Laser-induced breakdown spectroscopy (LIBS) is a technique that analyzes the elemental composition of a material by measuring the emission of excited atoms. In this method, a low-energy pulsed laser with high intensity (on the order of GW/cm²) vaporizes a small portion of the sample surface, creating a micro-plasma through atomization and ionization. The plasma then emits radiation, which a spectrometer collects and disperses, producing emission lines characteristic of the chemical elements present in the sample^1,2. Hahn and Omenetto^3,4 provided foundational insights into the plasma diagnostics and methodological strategies that define LIBS’s analytical capabilities. The authors demonstrated that LIBS offers several advantages, such as near-instantaneous results, minimal sample preparation, versatility, and operational simplicity. However, accurately quantifying samples with LIBS remains challenging due to several factors. Plasma parameters, such as electron density and temperature, influence line intensities and broadening. These parameters are also sensitive to experimental conditions and sample properties. Traditional methods, such as calibration-free laser-induced breakdown spectroscopy (CF-LIBS), derive concentrations directly from plasma emission spectra under the assumptions of local thermodynamic equilibrium (LTE), stoichiometric ablation, and optically thin plasma conditions. This approach requires careful plasma diagnostics, including determining the electron temperature (T_e) via Boltzmann plots of emission line intensities and the electron density (n_e) through Stark broadening analysis. Once these parameters are obtained, the relative elemental concentrations are computed by normalizing the transition probabilities and population densities derived from the measured spectra. Despite the advantages of CF-LIBS, such as rapid and minimally invasive analysis, the authors highlighted that achieving reliable compositional quantification continues to require careful plasma diagnostics and experimental design^3,4. Furthermore, matrix effects, self-absorption, and surface oxidation, which are particularly prevalent in metallic systems such as aluminum alloys, can distort spectral features and reduce reproducibility⁵.

In this sense, machine learning (ML) algorithms are a prominent interdisciplinary tool for autonomously identifying relevant patterns in data. ML involves using algorithms and other methods on data sets to enable “learning” and predicting outcomes and classifying information. It is a subfield of artificial intelligence (AI)⁶. ML can be applied to LIBS data with little subjective intervention^7,8. The method is based on LIBS calibration, which uses empirical correlations between emission line intensities and standards of known composition. This enables the calibration data to relate signal to concentration and learn multivariate and potentially nonlinear relationships across the entire spectrum in order to predict elemental composition.

According to Hao et al.⁹, machine learning (ML) can be divided into three categories: unsupervised, semi-supervised, and supervised learning. These categories are based on whether the data is labeled or unlabeled, and each category serves different analytical objectives. Unsupervised methods operate on unlabeled data to uncover intrinsic patterns without human intervention or predefined outputs. This makes them useful for exploratory analysis or when labeled data is scarce. In LIBS, for example, unsupervised methods have been used to cluster spectra from similar samples based on spectral similarity (e.g., K-means¹⁰ clustering and hierarchical clustering¹¹ and to reduce the dimensionality of spectral data to improve visualization and facilitate further analysis (e.g., principal component analysis (PCA² and ISODATA clustering¹². Semi-supervised learning considers both labeled and unlabeled data by iteratively expanding pseudo-labels and making predictions on new data. It is important to note that semi-supervised learning uses labeled data to guide the learning procedure; thus, the method is only accurate when effective unlabeled samples are considered^13,14. Supervised learning enables the mapping of features with predictive power through extensive calibration efforts using fully labeled datasets. Supervised patterns are used to train a model that can predict unknown samples. Several methods have been cited in the literature for predicting the concentration of elements in LIBS datasets based on the entire emission spectrum. These methods include multiple linear regression (MLR), partial least squares (PLS), the K-nearest neighbor method (KNN), support vector machines (SVM), artificial neural networks (ANN), and random forest (RF)^{15,16,17,18,19} and gradient boosting (GB)²⁰.

RF regression has shown promise in processing LIBS datasets due to its ability to handle classification and regression tasks effectively and accurately while minimizing overfitting issues and processing quickly. LIBS datasets combined with the RF algorithm have been used for training sets and have demonstrated a relationship between the number of trees in the RF and the accuracy of sample classification²¹. Feng et al.²² showed that this combination can predict the degree of copper (Cu) pollution in atmospheric sedimentation samples without complicated sample preparation, providing a basis for pollution prevention and control measures. Liu et al.²³ evaluated the quantitative analysis of toxic elements in polypropylene (PP) via LIBS coupled with RF regression. Their results showed that different preprocessing methods (normalization and mean centering) and the use of variable importance improved the method’s performance for plastic analysis.

Neiva et al.²⁰ evaluated the integration of the SHapley Additive exPlanations (SHAP) algorithm with GB model to predict carbon (C) concentration in soils analyzed by LIBS, demonstrating a framework for decoding the complex interplay between emission lines and target elemental concentrations. Hu et al.²⁴ performed a comparison between Light Gradient Boosting Machine (LGBM) and Standard Deviation (SD) methods to spectral data obtained via LIBS, showing a fast detection and quantification of the copper (Cu) and zinc (Zn) elements in aerosols. Compared to RF and GB, this algorithm introduces greater randomization in feature and split selection, making it more suitable for large scale datasets²⁵. Ghazwani and Begum²⁶ investigated the effectiveness of RF, GB, and ET (extremely randomized trees) models to predicting the solubility of the hyoscine drug and the density of solvents in the supercritical processing of pharmaceuticals, reporting R² values above 0.96 with relatively low average percentage errors. Despite this, the application of the ET algorithm for LIBS prediction remains limited. A recent study by Guedes et al.²⁷ proposed a LIBS-based for carbon (C) quantification in soil, employing SHAP to guide variable selection and using ET model for training on a refined dataset. The results demonstrated superior accuracy and robustness across training and validation sets, highlighting its effectiveness in handling spectral complexity and its lower susceptibility to data variability. Table 1 presents a comparison of tree-based models for elemental quantification using LIBS.

LIBS presents specific challenges when used on aluminum alloys which can affect the stability and the accuracy of ML-based quantification. These challenges include the high thermal conductivity of aluminum, low ablation limits, and spectroscopic phenomena. These phenomena include self-absorption in the aluminum (Al), resonant lines, and overlap with lines from alloying elements (e.g., Cu, Zn, and Mg)²⁸. S. Van den Eynde et al.²⁹ explored the use of quantitative LIBS for characterizing aluminum alloys in recycling and manufacturing. The authors showed that accurately determining minor alloying elements in aluminum alloys is difficult due to matrix effects, strong auto-absorption of the Al resonance lines (394.4 and 396.15 nm), and spectral overlap with common alloying elements, such as Cu, Zn, and Mg. However, the authors demonstrated that advanced regression strategies can significantly improve LIBS quantification by reducing prediction errors across multiple alloying elements. Wang et al.³⁰ applied a backpropagation artificial neural network (BP-ANN) to LIBS analysis of Al-Mg alloys. The authors demonstrated that nonlinear modeling of spectral features outperforms conventional calibration methods by capturing the complex relationships between plasma emission and alloy composition. In a recent study, ML was applied to LIBS of five aluminum alloy classes. Nonlinear reduction methods were compared with linear alternatives, such as principal component analysis (PCA), to mitigate the effects of high dimensionality and nonlinear relationships. The results showed that combining isometric mapping (IsoMap) nonlinear reduction with a support vector machine achieved 96.67% accuracy³¹.

The demand for rapid, accurate, and cost-effective characterization of aluminum alloys is increasing in sectors such as aerospace, automotive, and recycling, where precise control of alloy composition is critical for performance and sustainability. LIBS offers unique advantages for on-site analysis, yet its quantitative reliability is compromised by matrix effects, self-absorption, and spectral overlaps inherent to aluminum systems. Machine learning provides a pathway to overcome these limitations by leveraging the full spectral information to predict elemental concentrations without exhaustive plasma diagnostics. In this study, we comparatively evaluate three tree-based regression models—RF, GB, and ET—using a comprehensive LIBS dataset of aluminum and its alloys. Our goal is to explore the potential of these models for accurate and robust quantitative analysis, providing an effective alternative to traditional calibration strategies and deep learning approaches in metallurgical applications.

Table 1 – Comparison between tree-based models for LIBS elemental quantification.

Full size table

Materials and methods

Materials

Commercially pure aluminum (cp-Al) and aluminum alloys (Al-Cu, Al-Zn, and Al-Cu-Zn) Certified Reference Materials (CRM) with documented elemental compositions were used for the parallel training of multiple models (RF, GB, and ET) and their validation as regression modeling of LIBS data. Table 2 reports the chemical elements and their weight percentages (wt%) of these samples, certified by MBH Analytical Limited.

Table 2 Chemical composition of the samples of cp-Al and its alloys.

Full size table

The CRM surfaces were prepared for LIBS testing by sequential sanding from #220 to #1200 grit sandpaper, followed by cleaning with ethanol and drying.

Experimental setup of LIBS

Figure 1 shows the experimental setup of laser-induced breakdown spectroscopy (LIBS), which was designed and assembled at CENIM-CSIC. For all LIBS measurements in this study, plasma was generated using a Q-switched Nd: YAG laser (Onteko) with a wavelength of 1064 nm, a pulse duration of 6 ns, a repetition rate of 1 Hz and a pulse energy of 10 mJ. During the process, the laser beam is focused on the sample surface at a focal distance of 50 mm to produce a plasma plume. A photodetector captured the light emitted by the plasma with optimized delay times for each alloy system to maximize the signal-to-background ratio (SNR) within the 250–850 nm range. For cp-Al and Al-Cu alloys, a delay time of 5 µs was sufficient since the Al and Cu lines are intense at the early stages of plasma emission and provide a high SNR. However, a longer delay time of 15 µs was required for Al–Zn and Al–Cu–Zn alloys to avoid detector saturation observed at shorter delays (5 µs) and to improve the detection of Zn emission lines. At this later stage of plasma evolution, the continuum background decreases and the relative line intensities become more distinguishable, enhancing line resolution and signal-to-background ratio. This behavior is consistent with the temporal evolution of LIBS plasmas, in which emission characteristics depend strongly on time due to changes in plasma temperature, electron density, and continuum radiation^3,4.

The emitted light is collected by an optical fiber and directed to a spectrometer (Ocean Optics USB4000), which disperses the light and records the spectral data with acquisition software (SpectraGryph 1.2).

In the present study, each CRM sample was analyzed at 50 different positions, with 10 consecutive laser pulses applied at each position. Only the emission spectrum from the tenth pulse was recorded because the initial pulses were used primarily to clean the surface and stabilize plasma formation. Laser parameters and detection delay times were selected based on preliminary tests and optimization procedures to ensure acquisition of spectra with the highest signal-to-background ratio (SNR).

Quantification of chemical elements from LIBS data using RF, GB, and ET models

A total of 500 spectra, obtained from ten CRM aluminum samples with varying concentrations of aluminum (Al), copper (Cu), zinc (Zn), magnesium (Mg), iron (Fe), and silicon (Si), were used to train three tree-based models: RF, GB, and ET. The machine learning analysis was implemented in Python 3.12³⁴ using Google Colab³⁵, a cloud-based Jupyter notebook environment running Ubuntu 22.04 LTS. All computations were performed on a Colab virtual machine without GPU acceleration. Details of the scripts are described in the following sections.

Data acquisition and preprocessing of LIBS spectra

Initially, LIBS measurements were performed by applying ten consecutive laser pulses at each analysis position. Only the emission spectrum from the tenth pulse was recorded and saved as an individual CSV file containing paired wavelength-intensity data. Thus, each analysis position yielded one representative spectrum while the first nine pulses cleaned the surface and stabilized the plasma. For each sample, 50 positions were analyzed, resulting in 50 spectra per sample. Considering ten different reference samples, a total of 500 spectra were obtained. Prior to modeling, the spectra underwent a multi-step preprocessing workflow. The high-frequency noise was reduced, and peak definition was enhanced by applying a Savitzky–Golay filter³⁶ with a fixed window size of 17 and a polynomial order of 2. The evaluated spectral range was 250–850 nm. The Savitzky-Golay filter parameters were selected as a compromise between noise reduction and preservation of emission-line features. These parameters were defined based on preliminary tests evaluating different configurations to ensure consistent peak definition across the analyzed spectral range. Spectral intensities were then interpolated at 0.1 nm intervals and normalized to their maximum intensity to ensure comparability across samples. No feature selection or peak extraction was performed. Instead, the full preprocessed spectrum was used as input for the machine learning models, consisting of interpolated intensity values across the entire 250–850 nm range (~ 6,000 variables). This approach allows the models to capture both strong emission lines and subtle spectral features associated with minor and trace elements.

Finally, a complete dataset was constructed by combining the interpolated and normalized spectral intensities (model features) with the corresponding certified elemental compositions (targets). All processed spectra were consolidated into a single dataset and exported as a CSV file for training and validating a machine learning model.

Regression modeling of LIBS data for multi-element analysis

Three regression models based on machine learning methods were trained and evaluated to quantify the elemental composition from LIBS spectra: RF, GB, and ET. RF employs multiple decision trees, each built from a randomly sampled subset of features. The individual strength of the trees and their mutual correlation result in a generalization error that tends to converge³³. GB builds predictive models that optimize a loss function using regression trees as base learners. This enables predictions for both regression and classification tasks, even with noisy data³⁷. ET is a method designed for classification and regression tasks. It employs strong randomization to generate highly diversified trees with computational efficiency and robustness while maintaining competitive predictive accuracy³⁸. These models were implemented using the scikit-learn library³⁹, which provides practical tools for developing, training, evaluating, and deploying machine learning models in Python.

The previously generated dataset containing interpolated spectral intensities as input features and corresponding real elemental compositions as target variables was used for model training. The data were randomly split into training (80%) and test (20%) subsets at the spectrum level using a fixed random seed (42). In this approach, spectra from the same sample could be present in both subsets. This strategy was adopted to evaluate model performance at the spectral level, which is representative of typical LIBS measurements involving multiple acquisitions per sample. To assess model generalization and mitigate potential bias associated with this splitting strategy, external validation was performed using independent samples not included in the training dataset, as described in Sect. 2.3.3. This complementary validation ensures that the performance of the model is not overestimated because of the spectrum-level splitting strategy.

The RF and ET models were trained directly. In contrast, the GB model was wrapped in a MultiOutputRegressor to handle multiple outputs simultaneously. Each model was trained using 100 to 400 estimators (trees), and the optimal number of estimators was selected based on the minimization of the Mean Squared Error (MSE) on the test subset. This strategy contributes to controlling model complexity, reducing the risk of overfitting and underfitting during training, while the use of ensemble tree-based models (RF, GB, and ET) provides inherent robustness when handling high-dimensional spectral data. When similar predictive performances were observed across different configurations, the model with lower computational cost (i.e., fewer estimators and reduced training time) was preferred. Training and evaluation of the models were conducted in parallel using the joblib library to optimize computational performance. Model performance was evaluated using element-wise Mean Square Error (MSE) and Root Mean Squared Error (RMSE)⁴⁰, while the coefficient of determination (R²)⁴¹ was used as a global metric at the sample level. Finally, the trained models were saved individually in pickle format for further use in prediction tasks.

Application of trained models to new LIBS spectra

To estimate the chemical composition of the reference and unknown (predicted) samples (Table 2), the trained models (RF, GB, and ET) were applied to the LIBS spectra collected from these samples. Each input spectrum was preprocessed with smoothing, wavelength filtering, interpolation, and normalization to generate input features, as previously described in Sect. 2.3.1. The three models were saved in pickle format and loaded using the joblib library to predict the elemental concentrations in weight% (wt%). Reference samples were used to evaluate the behavior of the models and its consistency, while predicted samples were used to assess the model ability to predict the composition of the new samples.

The predictions for each model were compiled into structured data frames, and the mean composition of each element was calculated across all analyzed spectra (50 files per sample). The results were exported as bar graphs and an Excel file was generated with individual sheets containing the predictions of each model.

The agreement between predicted and reference values was assessed using student t-tests (α = 0.05). The analysis considered both the variability of the predicted means (standard deviation of replicate spectra) and the reported uncertainties of the certified reference values, when available. This approach follows standard statistical procedures for evaluating the accuracy and uncertainty of measurement data^42,43. The student t-test was not performed on aluminum (Al), since its reference values were obtained by difference (100 wt% – sum of other alloy elements) and thus lacked an experimentally measured standard deviation.

Model validation was carried out at two levels: (i) spectrum-level validation, which involved 80/20 random split across 500 spectra, and (ii) sample-level validation, which entailed aggregating spectra from individual alloys. Two scenarios were considered at the sample level: in-sample (non-independent) validation, in which the ten samples were included in the training dataset, and external validation, in which the independent samples E114, G77J4, 11,694, and 11,695 were not used for training. In-sample validation was used to evaluate the physical and chemical consistency of the models at the sample scale. It is important to highlight that this validation is not independent, since spectra from the same samples may appear in both the training and test subsets due to the spectrum-level data split. Therefore, this analysis assesses the internal consistency of the models rather than their generalization capability.

Additionally, the model performance was evaluated using standard regression metrics: mean squared error (MSE), mean absolute error (MAE), root mean squared error (RMSE)⁴⁴, and the coefficient of determination (R²). MSE and RMSE emphasize larger deviations by penalizing squared errors. In contrast, MAE provides a more direct measure of average prediction error in weight% (wt%). R² quantifies the proportion of the variance in the reference composition explained by the model; values closer to 1 indicate higher predictive accuracy. In the context of LIBS analysis, where plasma fluctuations, matrix effects, and spectral overlap introduce significant variability, these metrics allow us to assess both the magnitude of prediction errors (MAE and RMSE) and the overall model fit (MSE and R²). Together, these metrics provide a framework for evaluating the predictive reliability of regression models applied to spectroscopic data.

The full pipeline, including preprocessing and training scripts, is archived at Zenodo⁴⁵.

Results and discussion

Internal spectrum-level validation of ensemble regression models

Figure 2 illustrates the preprocessing workflow applied before model training. It shows a representative LIBS spectrum of a cp-Al sample at four stages: raw acquisition, after Savitzky–Golay smoothing, interpolated at 0.1 nm, and normalized to maximum intensity. This visualization clarifies how spectral features were standardized before machine learning analysis. An advantage of this workflow is that it does not require baseline subtraction or background smoothing. Combining Savitzky–Golay filtering and normalization reduced high-frequency noise, standardized the spectra, and preserved emission-line intensities. This simplified the preprocessing pipeline and avoided the risk of introducing distortions caused by baseline correction methods.

Table 3 presents the performance of the models, which is based on an internal 80/20 split of the dataset. The dataset comprises 500 LIBS spectra from ten reference samples with varying chemical compositions (see Table 2). The evaluation was conducted at the spectrum level, meaning spectra from the same sample could appear in both the training and test subsets. RF model achieved an MSE of 0.1548 and an R² of 0.9517. GB model improved the fit, achieving an MSE of 0.0837 and an R² of 0.9715. ET model reduced the error further, achieving an MSE of 0.0429 and an R² of 0.9880. These results demonstrate that the ET model outperformed the other algorithms by providing the lowest error and the highest coefficient of determination. This superior performance becomes clearer when considering the nature of LIBS data and the limitations of conventional linear approaches. For example, although Partial Least Squares (PLS) regression is widely used in LIBS quantitative analysis due to its simplicity and interpretability, it assumes linear relationships between spectral intensities and elemental concentrations. PLS models typically provide comparable performance under controlled conditions and are limited when dealing with nonlinear behavior and compositional complexity^46,47. Studies have shown that machine learning approaches can outperform traditional linear methods in LIBS quantitative analysis with higher R² values and lower prediction errors^48,49. As previously mentioned, the literature highlights these limitations, demonstrating that deviations from linearity are prevalent due to plasma-matter interactions and self-absorption effects^1,50.

The ET model enhanced performance can be attributed to its greater randomness in feature and threshold selection during tree construction. Additionally, the ET algorithm is more robust to noise and outliers because of its randomized splitting strategy. It is also less sensitive to feature scaling because tree-based methods rely on threshold-based decisions rather than distance metrics. These characteristics contribute to the ET algorithm stability when handling high-dimensional, noisy LIBS spectral data These properties are well documented for ensemble tree methods and contribute to their effectiveness in high-dimensional and noisy datasets³⁸. This is particularly advantageous for LIBS data analysis because plasma fluctuations, matrix effects, and background noise introduce nonlinearities. Consequently, the ET model can achieve a more effective balance between bias and variance, resulting in improved generalization performance. Similar findings were reported by Geurts et al.³⁸, who demonstrated that adding randomness to the selection process reduces generalization error in classification and regression tasks.

As previously mentioned, LIBS analysis of aluminum and its alloys presents several challenges. These include high thermal conductivity, low ablation thresholds, and the pronounced self-absorption of Al resonance lines at 394.4 and 396.15 nm. There is also spectral overlap with alloying elements such as copper, zinc, and magnesium^3,4,29. These effects reduce reproducibility, making it more difficult to extract reliable features for quantitative analysis. The superior performance of ET algorithms in this study suggests that they are well-suited for LIBS applications involving aluminum alloys, as they can handle nonlinearities and noise, which often causes conventional calibration strategies to fail to compensate for matrix effects. By incorporating greater randomness in tree construction, the ET model can capture subtle spectral variations associated with minor alloying elements. This mitigates distortions introduced by plasma instabilities and line self-absorption.

Table 3 Performance of three trained models for 500 randomly selected LIBS spectra.

Full size table

Increasing the number of estimators for RF and ET models from 100 to 400 did not result in significant improvements in predictive accuracy. This confirms that merely expanding the ensemble size does not guarantee better performance in LIBS modeling. Similar conclusions were reported by Friedman et al.⁵¹, who emphasized that inappropriate hyperparameter tuning, such as a suboptimal learning rate, maximum tree depth, or number of iterations, can result in overfitting or underfitting.

Processing the LIBS-ML training took about 68 min for 500 spectra. This behavior can be attributed to the cost of fitting three ensemble models—RF and ET, each with 100 trees, and GB, with 400 trees – using approximately 6000 input features (derived from the 0.1 nm interpolation of each spectrum), within the computational environment of a Google Colab virtual machine.

Validation of regression models by sample

In addition to the internal, spectrum-level validation presented in Table 3, the model performance at the sample level was examined under two scenarios: semi-external (samples used in the training set) and external (samples E114, G77J4, 11694, and 11695). These analyses, together, provide insight into the consistency and generalization ability of the ensemble regression models.

In-sample (non-independent) validation: samples included in the training dataset

The statistical metrics of the predictive performance of the three regression models are presented in Tables 4 and 5. Table 4 reports MAE and RMSE for each chemical element, while Table 5 summarizes the global metrics (MAE, RMSE, and R²) for the entire dataset. These results align with the graphical trends observed in Figs. 3, 4 and 5. Although this is not independent validation, evaluating the samples in the training set provides insight into the consistency of predictions when spectra are aggregated at the sample level. It is important to note that this analysis is not independent validation. Due to the spectrum-level data splitting strategy adopted in this study, spectra from the same samples were included in both the training and test subsets. Thus, the evaluation in this section is an in-sample (non-independent) validation that reflects the model consistency in predicting spectra from samples already in the training dataset. This analysis assessed whether the models preserve the chemical identity of each alloy when multiple spectra are averaged, which is representative of practical LIBS applications.

For aluminum-rich samples (cp-Al), all models exhibited high accuracy. Predictions were closely aligned with reference values and showed minimal dispersion. This behavior was observed across all three cp-Al samples (67990, 11697, and 11628), with deviations below 0.5 wt% for the quantified elements. The GB model showed higher scatter in this case, while the ET model provided the most accurate fits consistently. The RF model exhibited intermediate performance, closely following the ET model for most cp-Al predictions. For the Al–Cu alloy (samples E113, 115, and E116), all models slightly underpredicted Cu. This tendency was more pronounced in sample E116, where 6.6 wt% Cu was underestimated by all algorithms, especially the RF and GB models. For the Al–Cu–Zn ternary alloy (sample G77J2), the ET model most closely agreed with the Zn reference values. ET model reproduced both intermediate Zn levels (3.25 wt% in sample G77J2) and high Zn content (11.60 wt% in sample G77J6) with minimal bias. In contrast, the RF and GB models exhibited larger deviations at the highest Zn concentrations. For the cp-Al sample (67990), where trace elements dominate the residuals, the ET model outperformed the others, while the GB model displayed the greatest deviation. These trends were confirmed by the predicted versus reference plots. Aluminum-rich matrices exhibited strong linearity across all algorithms, whereas secondary alloying elements showed slight but systematic deviations. Linearity was preserved for major and secondary elements up to 12 wt% across all ten samples, confirming the robustness of the models over a wide compositional range. Prediction accuracy decreased for all models when considering minor constituents (less than 1 wt%), with increased scatter around the 1:1 line. This behavior reflects the known challenges of LIBS at trace levels, including plasma fluctuations, spectral overlap, and matrix effects. The additional randomness of the ET model reduced systematic bias and improved generalization, thereby enhancing its performance. Meanwhile, RF remained intermediate between GB and ET.

The results demonstrated that all three models can accurately predict the major and secondary constituents of cp-Al and Al-based alloys when evaluated under non-independent conditions. Among the tested approaches, the ET model appears to be the most reliable regressor, providing the most consistent predictions without requiring extensive hyperparameter tuning.

External validation: independent samples

The predictive performance of the regression models was also assessed using independent LIBS spectra from validation samples not included in the training set (Table 2). Figures 6, 7, 8 compare the reference and predicted compositions obtained with the three models.

The results indicate that the three ensemble regression models (RF, GB, and ET) accurately captured the overall compositional trends of the validation samples. In the binary Al-Cu alloy (sample E114), copper (Cu) was correctly identified as the secondary element at 5–6 wt% while minor constituents (Fe, Mg, Si, and Zn) were detected at trace levels of less than 0.5 wt%. However, their predictions displayed slight, model-dependent fluctuations. Although the absolute concentrations were correctly identified, the predictions showed dispersion depending on the element and model, especially in the low-concentration range. For the ternary Al-Cu-Zn alloy (sample G77J4), the models correctly identified Cu (approximately 1 wt%) and Zn (5–6 wt%) as relevant alloying elements. Predictions for Fe, Mg, and Si remained below 2 wt%, consistent with their expected minor role. ET model showed the closest agreement with the reference values for Zn, while GB model displayed slightly higher scatter in this range. The three models also maintained high predictive accuracy for commercially pure aluminum (samples 11694 and 11695). Residual trace elements (Fe, Mg, Si, and Zn) were predicted at levels below 2 wt%, reflecting their minor role. The differences between the models were generally subtle for major elements, whereas clearer discrepancies emerged for trace constituents, particularly in the low-concentration regime. In this regime, ET showed greater robustness, while GB exhibited higher sensitivity to spectral noise. The slightly higher prediction errors observed for Al-Cu and Al-Cu-Zn alloys (samples E114 and G77J4, respectively) are influenced by alloy type and compositional complexity. Binary and especially ternary alloys involve more complex multi-element interactions, making prediction more difficult. Additionally, the presence of multiple alloying elements and trace constituents increases spectral complexity and reduces the signal-to-noise ratio of minor elements, further contributing to the observed increase in prediction error.

It is important to note that Zn exhibited more pronounced deviations in the external validation samples (Figs. 6, 7, 8). This behavior is not observed in in-sample (non-independent) validation, where Zn predictions, particularly for the ET model, are more accurate. These findings suggest that the observed deviations can primarily be attributed to low Zn concentration levels (trace regime) and increased matrix complexity in ternary alloys, such as G77J4.

A statistical comparison based on t-tests revealed that the GB model produced the greatest number of cases in which the predictions were statistically indistinguishable from the reference values within the experimental uncertainty (7 out of 24 cases, p > 0.05). The ET and RF models followed with four and two such cases, respectively. While this suggests that GB predictions more often overlap with reference values, it is largely a consequence of its higher prediction variances. These variances inflate the standard errors and consequently the p-values, masking systematic deviations. In contrast, the ET model exhibited statistical differences from the reference values in most cases. However, this was due to its lower dispersion, which made the test more sensitive to small systematic biases. A complete list of p-values for each sample and element is provided in Supplementary Table S1. This behavior suggests that, although the GB model appears statistically consistent with the reference values, its higher prediction variance makes it unreliable for precise quantitative applications. In LIBS analysis, where the accurate estimation of both major and trace elements is critical, lower dispersion is generally preferred to statistical indistinguishability driven by uncertainty. Therefore, the ET model provides more consistent and reliable predictions for practical compositional analysis, despite the higher p-values observed for GB model.

To integrate the results of the semi-external and external validations, we assessed the global predictive performance using the mean absolute error (MAE), the root mean squared error (RMSE), and the coefficient of determination (R²). Figure 9 shows the MAE (Fig. 9a) and RMSE (Fig. 9b) values for all models (RF, GB, and ET) across the validation samples. Although not shown in Fig. 9, the R² values remained consistently close to unity for all models, indicating a high level of predictive accuracy. Overall, the three ensemble models (RF, GB, and ET) produced low error metrics, with MAE values generally below 0.25 wt% and RMSE values below 0.33 wt%. In LIBS, jointly analyzing these metrics is particularly relevant because MAE and RMSE quantify absolute and squared deviations, respectively, capturing errors at major and trace levels.

For the Al-Cu alloys (samples E114 and E116), the ET model generally outperformed the RF and GB models. It achieved the lowest error in sample E116 (MAE = 0.1519%; RMSE = 0.2058%). For the ternary alloys (samples G77J2 and G77J4), the ET model again provided the most accurate results, while the GB model showed higher deviations. The ET model also yielded the best predictions for cp-Al (sample 67990: MAE = 0.0125 wt%; RMSE = 0.0163 wt%), substantially reducing errors compared to the RF and GB models.

Finally, when comparing semi-external and external validation, error metrics remained consistently low in both sets. For the semi-external samples, ET model achieved an MAE of 0.0125 wt% (cp-Al 67990) and RMSE values below 0.21 wt% for the alloys. For the external samples, the MAE remained in the same range (< 0.25 wt%), and the RMSE values were below 0.33 wt%. This indicates that transitioning from training-pool samples to independent ones did not significantly increase predictive error. Instead, the models—particularly ET model —preserved high accuracy and generalization capacity, confirming that they were learning underlying relationships rather than memorizing the training set. This consistency across metrics reinforces that the ensemble regressors were effectively generalizing from LIBS training data, capturing real compositional relationships rather than memorizing spectra. Accordingly, the high R² values observed are not indicative of overfitting but reflect the ability of the models to generalize to independent samples not included in the training dataset.

Table 4 Predictive performance of the three regression models (RF, GB, and ET) based on MAE and RMSE for each chemical element.

Full size table

Table 5 Predictive performance of the three regression models (RF, GB, and ET) based on global statistical metrics (MAE, RMSE, and R²).

Full size table

The results demonstrate that tree-based models are well-suited for rapid, accurate compositional analysis in LIBS applications. The ET model appears to be the most reliable regressor among the tested approaches when used with its default settings without the need for prior hyperparameter tuning. It exhibited the strongest bias control across different compositional regimes. The RF model offered a balanced compromise between accuracy and robustness. The GB model, however, proved to be less robust across broader compositional ranges.

The main limitation observed was that the models produced inaccurate predictions due to the insufficient representation of alloy families in the training dataset. Using only one sample per group does not provide enough variability for the models to capture the corresponding compositional domain, leading to inefficient learning and poor predictions. Our results indicate that reliable performance requires at least three representative samples from each group (e.g., cp-Al, Al–Cu, Al–Cu–Zn), enabling the models to interpolate more effectively across the expected compositional ranges. This finding highlights that ensemble regressors perform best in multi-element systems when supported by adequate diversity in the training data.

Conclusion

The present work evaluated the accuracy of determining the elemental composition of cp-Al and its alloys by combining laser-induced breakdown spectroscopy (LIBS) with supervised machine learning models: RF, GB, and ET. The study concluded that tree-based regressors are adequate for LIBS applications, providing rapid, multi-element quantification with high accuracy across a broad range of compositions. The main conclusions are:

The ET model showed the best overall performance, providing the most accurate predictions across all alloy types (MAE < 0.25 wt%, RMSE < 0.33 wt%).
Model performance strongly depends on dataset diversity. Reliable predictions require at least three representative samples per alloy family to capture compositional variability.
All models demonstrated good generalization capability, maintaining low prediction errors for independent samples; however, the ET model showed the most stable behavior when applied to unseen alloys.
The GB model achieved more statistically indistinguishable results (p > 0.05); however, this was mainly due to higher prediction variance rather than improved predictive accuracy.

Data availability

The source code used in this work is openly available in Zenodo under the DOI: https://doi.org/10.5281/zenodo.15967233The archived version corresponds to the code used to generate all results presented in this study.

References

Cremers, D. A. & Radziemski, L. J. Handbook of laser-induced breakdown spectroscopy. Hoboken (NJ): John Wiley & Sons; ISBN: 978-0-470-09299-8. (2006).
Pořízka, P. et al. On the utilization of principal component analysis in laser-induced breakdown spectroscopy data analysis: A review. Spectrochim. Acta B At. Spectrosc. 148, 65–82. https://doi.org/10.1016/j.sab.2018.05.030 (2018).
Article CAS Google Scholar
Hahn, D. W. & Omenetto, N. Laser-induced breakdown spectroscopy (LIBS), Part I: Review of basic diagnostics and plasma–particle interactions. Appl. Spectrosc. 64(12), 335–66. https://doi.org/10.1366/000370210793561691 (2010).
Article ADS PubMed Google Scholar
Hahn, D. W. & Omenetto, N. Laser-induced breakdown spectroscopy (LIBS), Part II: Review of instrumental and methodological approaches. Appl. Spectrosc. 66(4), 347–419. https://doi.org/10.1366/11-06574 (2012).
Article ADS CAS PubMed Google Scholar
Hu, Z. et al. A review of calibration-free laser-induced breakdown spectroscopy. TrAC Trends Anal. Chem. 152, 116618. https://doi.org/10.1016/j.trac.2022.116618 (2022).
Article CAS Google Scholar
Glossary of terms. Mach. Learn. ;30:271–274. https://doi.org/10.1023/A:1017181826899 (1998).
Article Google Scholar
Liu, H., Han, K., Yang, W. & Chen, M. Recent advances in machine learning methodologies for LIBS quantitative analysis [Internet]. In: Pulsed laser processing of materials. London: IntechOpen; https://doi.org/10.5772/intechopen.1004414 (2024).https://doi.org/10.5772/intechopen.1004414
Ferreira, E. C., Milori, D. M. B. P., Ferreira, E. J., Da Silva, R. M. & Martin-Neto, L. Artificial neural network for Cu quantitative determination in soil using a portable laser induced breakdown spectroscopy system. Spectrochim. Acta B At. Spectrosc. 63(10), 1216–20. https://doi.org/10.1016/j.sab.2008.08.016 (2008).
Article CAS Google Scholar
Hao, Z. et al. Machine learning in laser-induced breakdown spectroscopy: A review. Front. Phys. 19, 62501. https://doi.org/10.1007/s11467-024-1427-2 (2024).
Article ADS Google Scholar
Tang, Y. et al. Industrial polymers classification using LIBS combined with self-organizing maps and K-means algorithm. Optik 165, 198–205. https://doi.org/10.1016/j.ijleo.2018.03.121 (2018).
Article CAS Google Scholar
Wójcik, M. R., Zdunek, R. & Antończak, A. J. Unsupervised verification of LIBS dataset clustering. Spectrochim Acta B 126, 33–40. https://doi.org/10.1016/j.sab.2016.10.009 (2016).
Article CAS Google Scholar
He, L., Wang, Q., Zhao, Y., Liu, L. & Peng, Z. Study on cluster analysis used with LIBS. Plasma Sci. Technol. 18 (6), 647–651. https://doi.org/10.1088/1009-0630/18/6/11 (2016).
Article ADS CAS Google Scholar
Li, X., Lu, H., Yang, J. & Chang, F. Semi-supervised LIBS quantitative analysis method based on co-training regression model. Plasma Sci Technol 21(3), 034015. https://doi.org/10.1088/2058-6272/aaee14 (2019).
Article ADS CAS Google Scholar
Wang, Q., Teng, G., Li, C., Zhao, Y. & Peng, Z. Identification and classification of explosives using semi-supervised learning and LIBS. J. Hazard. Mater. 369, 423–430. https://doi.org/10.1016/j.jhazmat.2019.02.015 (2019).
Article CAS PubMed Google Scholar
Chen, T. et al. Quantitative analysis of chromium in potatoes by LIBS coupled with linear multivariate calibration. Appl. Opt. 54, 7807–12. https://doi.org/10.1364/AO.54.007807 (2015).
Article ADS CAS PubMed Google Scholar
Cao, Z. et al. Rapid classification of coal by LIBS with K-nearest neighbor (KNN) chemometrics. Instrum. Sci. Technol. 51 (1), 59–67. https://doi.org/10.1080/10739149.2022.2087185 (2022).
Article CAS MathSciNet Google Scholar
Liang, L. et al. Classification of steel materials by LIBS coupled with support vector machines. Appl. Opt. 53, 544–552. https://doi.org/10.1364/AO.53.000544 (2014).
Article ADS CAS PubMed Google Scholar
D’Andrea, E. et al. A hybrid calibration-free/artificial neural networks approach to the quantitative analysis of LIBS spectra. Appl Phys B 118, 353–60. https://doi.org/10.1007/s00340-014-5990-z (2015).
Article ADS CAS Google Scholar
Yang, G., Zhang, X., Yuan, H., Ni, W. & Zhang, W. Basicity analysis of sintered ore using LIBS combined with random forest regression. J. Anal. At. Spectrom. 30(1), 173–7. https://doi.org/10.1039/C4JA00352G (2015).
Article CAS Google Scholar
Neiva, D. K., Guedes, W. N., Martin-Neto, L. & Villas-Boas, P. R. Enhancing elemental quantification in LIBS with SHAP-guided emission line analysis: A soil carbon study. Spectrochim. Acta B At. Spectrosc. 217, 106971. https://doi.org/10.1016/j.sab.2024.106971 (2024).
Article CAS Google Scholar
Zhan, L. et al. Rapid classification of aluminum alloy based on LIBS and random forest algorithm. Plasma Sci. Technol. 21 (3), 034018. https://doi.org/10.1088/2058-6272/aaf7bf (2019).
Article ADS CAS Google Scholar
Feng, T. et al. Pollution risk estimation of copper in atmospheric sedimentation samples by LIBS combined with random forest. Anal. Methods. 13, 3424–3432. https://doi.org/10.1039/D1AY00879J (2021).
Article CAS PubMed Google Scholar
Liu, K., Tian, D., Xu, H., Wang, H. & Yang, G. Quantitative analysis of toxic elements in polypropylene via LIBS coupled with random forest regression based on variable importance. Anal. Methods. 11 (44), 5692–5699. https://doi.org/10.1039/C9AY01796H (2019).
Article Google Scholar
Hu, A. et al. Spectral screening-assisted LIBS for quantitative analysis of heavy metal elements in liquid aerosols. J. Anal. At. Spectrom. https://doi.org/10.1039/D5JA00118H (2025).
Article Google Scholar
Biau, G. & Scornet, E. A random forest guided tour. Test 25, 197–227. https://doi.org/10.1007/s11749-016-0481-7 (2016).
Article MathSciNet Google Scholar
Ghazwani, M. & Begum, M. Y. Computational intelligence modeling of hyoscine drug solubility and solvent density in supercritical processing: Gradient boosting, extra trees, and random forest models. Sci. Rep. 13, 10046. https://doi.org/10.1038/s41598-023-37232-8 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Guedes, W. N. et al. Robust soil total carbon prediction using LIBS: Integrating expert knowledge with machine learning and external dataset evaluation. At. Spectrosc. 46(2), 141–9. https://doi.org/10.46770/AS.2025.018 (2025).
Article Google Scholar
Li, J., Yang, Q., Yao, J., He, X. & Wang, F. Application of quantitative analysis for aluminum alloy in femtosecond laser-ablation spark-induced breakdown spectroscopy using one-point and multi-line calibration. Anal. Chim. Acta 1238, 340613. https://doi.org/10.1016/j.aca.2022.340613 (2023).
Article CAS PubMed Google Scholar
Van den Eynde, S., Díaz-Romero, D. J., Zaplana, I. & Peeters, J. Deep learning regression for quantitative LIBS analysis. Spectrochim. Acta B 202, 106634. https://doi.org/10.1016/j.sab.2023.106634 (2023).
Article CAS Google Scholar
Wang, L. et al. Excited state dynamics in aluminum–magnesium alloys studied by LIBS and quantum chemistry. J. Phys. Chem. A. 129 (16), 3754–3761. https://doi.org/10.1021/acs.jpca.5c01645 (2025).
Article CAS PubMed Google Scholar
Harefa, E. & Zhou, W. Laser-induced breakdown spectroscopy combined with nonlinear manifold learning for improvement of aluminum alloy classification accuracy. Sensors 22, 3129. https://doi.org/10.3390/s22093129 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Wei, L. et al. Quantitative analysis of fertilizer using LIBS combined with random forest algorithm. Anal. Methods 13(27), 3332–9. https://doi.org/10.1039/D1AY00879J (2021).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article Google Scholar
Python Software Foundation. Python language reference, version 3.10 [Internet]. Available from: https://www.python.org
Bisong, E. Building machine learning and deep learning models on Google Cloud Platform. Berkeley (CA): Apress; ISBN: 978-1-4842-4470-8. (2019).
Savitzky, A. & Golay, M. J. E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 36(8), 1627–39. https://doi.org/10.1021/ac60214a047 (1964).
Article ADS CAS Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29(5), 1189–232. https://doi.org/10.1214/aos/1013203451 (2001).
Article MathSciNet Google Scholar
Geurts, P., Ernst, D. & Wehenkel, L. Extremely randomized trees. Mach. Learn. 63, 3–42. https://doi.org/10.1007/s10994-006-6226-1 (2006).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar
James, G., Witten, D., Hastie, T. & Tibshirani, R. An introduction to statistical learning: with applications in R. New York: Springer; ISBN: 978-1-4614-7138-7. (2013).
Draper, N. R. & Smith, H. Applied regression analysis. 3rd ed. Hoboken (NJ): Wiley-Interscience; ISBN: 978-0-471-17082-2. (1998).
International Organization for Standardization. ISO 5725-2:2019. Accuracy (trueness and precision) of measurement methods and results – Part 2: Basic method for the determination of repeatability and reproducibility of a standard measurement method (ISO, 2019).
Joint Committee for Guides in Metrology. Evaluation of measurement data – Guide to the expression of uncertainty in measurement (GUM). JCGM 100:2008. Sèvres: BIPM. (2008). Available from: https://www.bipm.org/documents/20126/2071204/JCGM_100_2008_E.pdf
Hodson, T. O. Root-mean-square error (RMSE) or mean absolute error (MAE): When to use them or not. Geosci. Model Dev. 15, 5481–7. https://doi.org/10.5194/gmd-15-5481-2022 (2022).
Article ADS Google Scholar
Capella, A. G., Garcia, I., de Damborenea, J. & Arenas, M. A. LIBS Quantification Pipeline – Version 1.0.0 [Software]. Zenodo. https://doi.org/10.5281/zenodo.15967233 (2025).
Dyar, M. D. et al. Comparison of partial least squares and lasso regression techniques as applied to laser-induced breakdown spectroscopy of geological samples. Spectrochimica Acta Part B: Atomic Spectroscopy 70, 51–67. https://doi.org/10.1016/j.sab.2012.04.011 (2012).
Article CAS Google Scholar
Costa, V. C. et al. Calibration strategies applied to laser-induced breakdown spectroscopy: a critical review of advances and challenges. J. Braz Chem. Soc. 31 (12), 2439–2451. https://doi.org/10.21577/0103-5053.20200175 (2020).
Article CAS Google Scholar
Rao, A. P., Jenkins, P. R., Auxier, J. D. & Shattan, M. B. Comparison of machine learning techniques to optimize the analysis of plutonium surrogate material via a portable LIBS device. J. Anal. At. Spectrom. 36(2), 399–406. https://doi.org/10.1039/D0JA00435A (2021).
Article CAS Google Scholar
Liu, H., Han, K., Yang, W. & Chen, M. recent advances in machine learning methodologies for LIBS quantitative analysis [Internet]. Pulsed Laser Process. Mat. https://doi.org/10.5772/intechopen.1004414 (2024).
Article Google Scholar
Sun, C. et al. Machine learning allows calibration models to predict trace element concentration in soils with generalized LIBS spectra. Sci. Rep. 9, 11363. https://doi.org/10.1038/s41598-019-47751-y (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann Stat 29(5), 1189–232. https://doi.org/10.1214/aos/1013203451 (2001).
Article MathSciNet Google Scholar

Download references

Acknowledgements

A.G. Capella gratefully acknowledge the support of the Laboratorio de Análisis Químico, in particular, the technical assistance of Emilia Moroño García-Navas for kindly providing the cp-Al standards and Al alloys used in this study.

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.

Author information

Authors and Affiliations

Institute of Science and Technology, Federal University of Sao Paulo, ICT-UNIFESP, Sao José dos Campos, 12242460, Brazil
Aline Gonçalves Capella
Department of Surface Engineering Corrosion and Durability, National Center for Metallurgical Research, CENIM-CSIC, Madrid, 28040, Spain
Aline Gonçalves Capella, Marta Martín López, Juan José de Damborenea González, María Ángeles Arenas & Ignacio Garcia Diego

Authors

Aline Gonçalves Capella
View author publications
Search author on:PubMed Google Scholar
Marta Martín López
View author publications
Search author on:PubMed Google Scholar
Juan José de Damborenea González
View author publications
Search author on:PubMed Google Scholar
María Ángeles Arenas
View author publications
Search author on:PubMed Google Scholar
Ignacio Garcia Diego
View author publications
Search author on:PubMed Google Scholar

Contributions

Aline Gonçalves Capella: funding acquisition; data curation, formal analysis, investigation, methodology (script development); writing – original draft.- Marta Martín López: data curation; formal analysis; investigation.- Juan José de Damborenea González: funding acquisition; writing – review and editing.- María Ángeles Arenas: funding acquisition; writing – review and editing.- Ignacio García Diego: funding acquisition; formal analysis, interpretation of data, writing – review and editing.

Corresponding authors

Correspondence to Aline Gonçalves Capella or Juan José de Damborenea González.

Ethics declarations

Competing interests

The authors declare no competing interests.

Declaration of generative AI and AI-assisted technologies

During the preparation of this work, the author(s) used ChatGPT (OpenAI, GPT-4, Mini High Model) to develop and revise the Python code. After using this tool/service, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the published article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Capella, A.G., López, M.M., de Damborenea González, J.J. et al. Machine learning for quantitative LIBS analysis of aluminum alloys: a comparison of random forest, gradient boosting, and extremely randomized trees. Sci Rep 16, 14758 (2026). https://doi.org/10.1038/s41598-026-46449-2

Download citation

Received: 26 September 2025
Accepted: 25 March 2026
Published: 12 May 2026
Version of record: 12 May 2026
DOI: https://doi.org/10.1038/s41598-026-46449-2

Subjects

Abstract

Similar content being viewed by others

Machine learning-driven nonlinear analysis of inclusion effects in aluminium alloys

Accelerating the design and discovery of tribocorrosion-resistant metals by interfacing multiphysics modeling with machine learning and genetic algorithms

High entropy alloy property predictions using a transformer-based language model

Introduction

Materials and methods

Materials

Experimental setup of LIBS

Quantification of chemical elements from LIBS data using RF, GB, and ET models

Data acquisition and preprocessing of LIBS spectra

Regression modeling of LIBS data for multi-element analysis

Application of trained models to new LIBS spectra

Results and discussion

Internal spectrum-level validation of ensemble regression models

Validation of regression models by sample

In-sample (non-independent) validation: samples included in the training dataset

External validation: independent samples

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Declaration of generative AI and AI-assisted technologies

Additional information

Publisher’s note

Supplementary Information

Supplementary Material 1 (download XLSX )

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links